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NEISSERIA MENINGITIDIS ANTIGENS AND COMPOSITIONS 

This application is a continuation-in-part of the following U.S. Provisional Patent 
applications, from each of which priority is claimed, and each of which is incorporated by 
reference in its entirety: 60/083,758 (filed May 1, 1998); 60/094,869 (filed July 31, 1998); 
60/098,994 (filed September 2, 1998); 60/099,062 (filed September 2, 1998); 60/103,749 
(filed October 9, 1998); 60/103,794 (filed October 9, 1998); 60/103,796 (filed October 9, 
1998); and 60/121,528 (filed February 25, 1999). 

This invention relates to antigens from the bacterial species: Neisseria meningitidis 
and Neisseria gonorrhoeae. 

BACKGROUND 

Neisseria meningitidis is a non-motile, gram negative diplococcus human pathogen. It 
colonizes the pharynx, causing meningitis and, occasionally, septicaemia in the absence of 
meningitis. It is closely related to N. gonorrhoea, although one feature that clearly 
differentiates meningococcus from gonococcus is the presence of a polysaccharide capsule 
that is present in all pathogenic meningococci. 

N. meningitidis causes both endemic and epidemic disease. In the United States the 
attack rate is 0.6-1 per 100,000 persons per year, and it can be much greater during outbreaks, 
(see Lieberman et al. (1996) Safety and Immunogenicity of a Serogroups A/C Neisseria 
meningitidis Oligosaccharide-Protein Conjugate Vaccine in Young Children. JAMA 
275(19):1499-1503; Schuchat et al (1997) Bacterial Meningitis in the United States in 1995. 
NEngl J Med 337(14):970-976). In developing countries, endemic disease rates are much 
higher and during epidemics incidence rates can reach 500 cases per 100,000 persons per 
year. Mortality is extremely high, at 10-20% in the United States, and much higher in 
developing countries. Following the introduction of the conjugate vaccine against 
Haemophilus influenzae, N meningitidis is the major cause of bacterial meningitis at all ages 
in the United States (Schuchat et al (1997) supra). 

Based on the organism's capsular polysaccharide, 12 serogroups of N. meningitidis 
have been identified. Group A is the pathogen most often implicated in epidemic disease in 
sub-Saharan Africa. Serogroups B and C are responsible for the vast majority of cases in the 
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United States and in most developed countries. Serogroups W135 and Y are responsible for 
the rest of the cases in the United States and developed countries. The meningococcal vaccine 
currently in use is a tetravalent polysaccharide vaccine composed of serogroups A, C, Y and 
W135. Although efficacious in adolescents and adults, it induces a poor immune response and 
short duration of protection, and cannot be used in infants [eg. Morbidity and Mortality 
weekly report, Vol.46, No. RR-5 (1997)]. This is because polysaccharides are T-cell 
independent antigens that induce a weak immune response that cannot be boosted by repeated 
immunization. Following the success of the vaccination against H.influenzae, conjugate 
vaccines against serogroups A and C have been developed and are at the final stage of clinical 
testing (Zollinger WD "New and Improved Vaccines Against Meningococcal Disease". In: 
New Generation Vaccines, supra, pp. 469-488; Lieberman et al (1996) supra; Costantino et al 
(1992) Development and phase I clinical testing of a conjugate vaccine against 
meningococcus A and C. Vaccine 10:691-698). 

Meningococcus B (menB) remains a problem, however. This serotype currently is 
responsible for approximately 50% of total meningitis in the United States, Europe, and South 
America. The polysaccharide approach cannot be used because the menB capsular 
polysaccharide is a polymer of a(2-8)-linked N-acetyl neuraminic acid that is also present in 
mammalian tissue. This results in tolerance to the antigen; indeed, if an immune response 
were elicited, it would be anti-self, and therefore undesirable. In order to avoid induction of 
autoimmunity and to induce a protective immune response, the capsular polysaccharide has, 
for instance, been chemically modified substituting the Af-acetyl groups with N-propionyl 
groups, leaving the specific antigenicity unaltered (Romero & Outschoorn (1994) Current 
status of Meningococcal group B vaccine candidates: capsular or non-capsular? Clin 
Microbiol Rev 7(4):559-575). 

Alternative approaches to menB vaccines have used complex mixtures of outer 
membrane proteins (OMPs), containing either the OMPs alone, or OMPs enriched in porins, 
or deleted of the class 4 OMPs that are believed to induce antibodies that block bactericidal 
activity. This approach produces vaccines that are not well characterized. They are able to 
protect against the homologous strain, but are not effective at large where there are many 
antigenic variants of the outer membrane proteins. To overcome the antigenic variability, 
multivalent vaccines containing up to nine different porins have been constructed (eg. 
Poolman JT (1 992) Development of a meningococcal vaccine. Infect. Agents Dis. 4: 13-28). 
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Additional proteins to be used in outer membrane vaccines have been the opa and opc 
proteins, but none of these approaches have been able to overcome the antigenic variability 
(eg. Ala'Aldeen & Borriello (1996) The meningococcal trans ferrin-binding proteins 1 and 2 
are both surface exposed and generate bactericidal antibodies capable of killing homologous 
and heterologous strains. Vaccine 14(l):49-53). 

A certain amount of sequence data is available for meningococcal and gonoccocal 
genes and proteins (e.g. EP-A-0467714, W096/29412), but this is by no means complete. 
Other men B proteins may include those listed in WO 97/28273, WO 96/29412, WO 
95/03413, US 5,439,808, and US 5,879,686. 

The provision of further sequences could provide an opportunity to identify secreted 
or surface-exposed proteins that are presumed targets for the immune system and which are 
not antigenically variable. For instance, some of the identified proteins could be components 
of efficacious vaccines against meningococcus B, some could be components of vaccines 
against all meningococcal serotypes, and others could be components of vaccines against all 
pathogenic Neisseriae including Neisseria meningitidis or Neisseria gonorrhoeae. Those 
sequences specific to N meningitidis or N. gonorrhoeae that are more highly conserved are 
further preferred sequences. 

It is thus an object of the invention is to provide Neisserial DNA sequences which 
encode proteins that are antigenic or immunogenic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 illustrates the products of protein expression and purification of the predicted 
ORF 919 as cloned and expressed in E. coli. 

Fig. 2 illustrates the products of protein expression and purification of the predicted 
ORF 279 as cloned and expressed in E. coli. 

Fig. 3 illustrates the products of protein expression and purification of the predicted 
ORF 576-1 as cloned and expressed in E. coli. 

Fig. 4 illustrates the products of protein expression and purification of the predicted 
ORF 519-1 as cloned and expressed in E. coli. 

Fig. 5 illustrates the products of protein expression and purification of the predicted 
ORF 121-1 as cloned and expressed inE. coli. 
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Fig. 6 illustrates the products of protein expression and purification of the predicted 
ORF 128-1 as cloned and expressed in E. coli. 

Fig. 7 illustrates the products of protein expression and purification of the predicted 
ORF 206 as cloned and expressed in E. coli. 

Fig. 8 illustrates the products of protein expression and purification of the predicted 
ORF 287 as cloned and expressed in E. coli. 

Fig. 9 illustrates the products of protein expression and purification of the predicted 
ORF 406 as cloned and expressed in E. coli. 

Fig. 10 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 919 as cloned and expressed in E. coli. 

Fig. 1 1 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 279 as cloned and expressed in E. coli. 

Fig. 12 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 576-1 as cloned and expressed in E. coli. 

Fig. 13 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 519-1 as cloned and expressed in E. coli. 

Fig. 14 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 121-1 as cloned and expressed in E. coli. 

Fig. 15 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 128-1 as cloned and expressed in E. coli. 

Fig. 16 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 206 as cloned and expressed in E. coli. 

Fig. 17 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 287 as cloned and expressed in E. coli. 

Fig. 18 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 406 as cloned and expressed in E. coli. 

Fig. 19 shows an alignment comparison of amino acid sequences for ORF 225 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 
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Fig. 20 shows an alignment comparison of amino acid sequences for ORP 235 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 21 shows an alignment comparison of amino acid sequences for ORF 287 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 22 shows an alignment comparison of amino acid sequences for ORF 519 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 23 shows an alignment comparison of amino acid sequences for ORF 919 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, farther confirming its 
utility as an antigen for both vaccines and diagnostics. 

THE INVENTION 

The invention provides proteins comprising the N meningitidis amino acid sequences 
and N. gonorrhoeae amino acid sequences disclosed in the examples. 

It also provides proteins comprising sequences homologous (i.e., those having 
sequence identity) to the N. meningitidis amino acid sequences disclosed in the examples. 
Depending on the particular sequence, the degree of homology (sequence identity) is 
preferably greater than 50% (eg. 60%, 70%, 80%, 90%, 95%, 99% or more). These proteins 
include mutants and allelic variants of the sequences disclosed in the examples. Typically, 
50% identity or more between two proteins is considered to be an indication of functional 
equivalence. Identity between proteins is preferably determined by the Smith- Waterman 
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homology search algorithm as implemented in MPSRCH program (Oxford Molecular) using 
an affine gap search with parameters: gap penalty 12, gap extension penalty 1. 

The invention further provides proteins comprising fragments of the N. meningitidis 
amino acid sequences and N. gonorrhoeae amino acid sequences disclosed in the examples. 
The fragments should comprise at least n consecutive amino acids from the sequences and, 
depending on the particular sequence, n is 7 or more (eg. 8, 10, 12, 14, 16, 18, 20 or more). 
Preferably the fragments comprise an epitope from the sequence. 

The proteins of the invention can, of course, be prepared by various means (eg. 
recombinant expression, purification from cell culture, chemical synthesis etc.) and in various 
forms (eg. native, fusions etc.). They are preferably prepared in substantially pure or isolated 
form (ie. substantially free from other N. meningitidis or N. gonorrhoeae host cell proteins) 

According to a further aspect, the invention provides antibodies which bind to these 
proteins. These may be polyclonal or monoclonal and may be produced by any suitable 
means. 

According to a further aspect, the invention provides nucleic acid comprising the 
N. meningitidis nucleotide sequences and TV. gonorrhoeae nucleotide sequences disclosed in 
the examples. 

According to a further aspect, the invention comprises nucleic acids having sequence 
identity of greater than 50% (e.g., 60%, 70%, 80%, 90%, 95%, 99% or more) to the nucleic 
acid sequences herein. Sequence identity is determined as above-discussed. 

According to a further aspect, the invention comprises nucleic acid that hybridizes to 
the sequences provided herein. Conditions for hybridization are set forth herein. 

Nucleic acid comprising fragments of these sequences are also provided. These should 
comprise at least n consecutive nucleotides from the W. meningitidis sequences or N. 
gonorrhoeae sequences and depending on the particular sequence, n is 10 or more (eg 12, 14, 
15, 18, 20, 25, 30, 35, 40 or more). 

According to a further aspect, the invention provides nucleic acid encoding the 
proteins and protein fragments of the invention. 

It should also be appreciated that the invention provides nucleic acid comprising 
sequences complementary to those described above (eg. for antisense or probing purposes). 

Nucleic acid according to the invention can, of course, be prepared in many ways (eg. 
by chemical synthesis, in part or in whole, from genomic or cDNA libraries, from the 
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organism itself etc.) and can take various forms (eg. single stranded, double stranded, vectors, 
probes etc.). 

In addition, the term "nucleic acid" includes DNA and RNA, and also their analogues, 
such as those containing modified backbones, and also protein nucleic acids (PNA) etc. 

According to a further aspect, the invention provides vectors comprising nucleotide 
sequences of the invention (eg. expression vectors) and host cells transformed with such 
vectors. 

According to a further aspect, the invention provides compositions comprising protein, 
antibody, and/or nucleic acid according to the invention. These compositions may be suitable 
as vaccines, for instance, or as diagnostic reagents or as immunogenic compositions. 

The invention also provides nucleic acid, protein, or antibody according to the 
invention for use as medicaments (eg. as vaccines) or as diagnostic reagents. It also provides 
the use of nucleic acid, protein, or antibody according to the invention in the manufacture of 
(I) a medicament for treating or preventing infection due to Neisserial bacteria (ii) a 
diagnostic reagent for detecting the presence of Neisserial bacteria or of antibodies raised 
against Neisserial bacteria or (iii) for raising antibodies. Said Neisserial bacteria may be any 
species or strain (such as N. gonorrhoeae) but are preferably N. meningitidis, especially strain 
B or strain C. 

The invention also provides a method of treating a patient, comprising administering 
to the patient a therapeutically effective amount of nucleic acid, protein, and/or antibody 
according to the invention. 

According to further aspects, the invention provides various processes. 

A process for producing proteins of the invention is provided, comprising the step of 
culturing a host cell according to the invention under conditions which induce protein 
expression. 

A process for detecting polynucleotides of the invention is provided, comprising the 
steps of: (a) contacting a nucleic probe according to the invention with a biological sample 
under hybridizing conditions to form duplexes; and (b) detecting said duplexes. 

A process for detecting proteins of the invention is provided, comprising the steps of: 
(a) contacting an antibody according to the invention with a biological sample under 
conditions suitable for the formation of an antibody-antigen complexes; and (b) detecting 
said complexes. 
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Having now generally described the inventiion, the same will be more readily 
understood through reference to the following examples which are provided by way of 
illustration, and are not intended to be limiting of the present invention, unless specified. 

Methodology - Summary of standard procedures and techniques. 
General 

This invention provides Neisseria meningitidis menB nucleotide sequences, amino 
acid sequences encoded therein. With these disclosed sequences, nucleic acid probe assays 
and expression cassettes and vectors can be produced. The expression vectors can be 
transformed into host cells to produce proteins. The purified or isolated polypeptides (which 
may also be chemically synthesized) can be used to produce antibodies to detect menB 
proteins. Also, the host cells or extracts can be utilized for biological assays to isolate 
agonists or antagonists. In addition, with these sequences one can search to identify open 
reading frames and identify amino acid sequences. The proteins may also be used in 
immunogenic compositions, antigenic compositions and as vaccine components. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of molecular biology, microbiology, recombinant DNA, and 
immunology, which are within the skill of the art. Such techniques are explained fully in the 
literature e.g., Sambrook Molecular Cloning; A Laboratory Manual, Second Edition (1989); 
DNA Cloning, Volumes land ii (D.N Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait 
ed, 1984); Nucleic Acid Hybridization (B.D. Hames & SJ. Higgins eds. 1984); Transcription 
and Translation (B.D. Hames & S.J. Higgins eds. 1984); Animal Cell Culture (R.I. Freshney 
ed. 1986); Immobilized Cells and Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide to 
Molecular Cloning (1984); the Methods in Enzymology series (Academic Press, Inc.), 
especially volumes 154 & 155; Gene Transfer Vectors for Mammalian Cells (J.H. Miller and 
M.P. Calos eds. 1987, Cold Spring Harbor Laboratory); Mayer and Walker, eds. (1987), 
Immunochemical Methods in Cell and Molecular Biology (Academic Press, London); Scopes, 
(1987) Protein Purification: Principles and Practice, Second Edition (Springer- Verlag, 
N.Y.), and Handbook of Experimental Immunology, Volumes I-IV (D.M. Weir and C.C. 
Blackwell eds 1986). 

Standard abbreviations for nucleotides and amino acids are used in this specification. 
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All publications, patents, and patent applications cited herein are incorporated in full 
by reference. 

Expression systems 

The Neisseria menB nucleotide sequences can be expressed in a variety of different 
expression systems; for example those used with mammalian cells, plant cells, baculoviruses, 
bacteria, and yeast. 

i. Mammalian Systems 

Mammalian expression systems are known in the art. A mammalian promoter is any 
DNA sequence capable of binding mammalian RNA polymerase and initiating the 
downstream (3') transcription of a coding sequence (e.g., structural gene) into mRNA. A 
promoter will have a transcription initiating region, which is usually placed proximal to the 5" 
end of the coding sequence, and a TATA box, usually located 25-30 base pairs (bp) upstream 
of the transcription initiation site. The TATA box is thought to direct RNA polymerase II to 
begin RNA synthesis at the correct site. A mammalian promoter will also contain an 
upstream promoter element, usually located within 100 to 200 bp upstream of the TATA box. 
An upstream promoter element determines the rate at which transcription is initiated and can 
act in either orientation (Sambrook et al. (1989) "Expression of Cloned Genes in Mammalian 
Cells." In Molecular Cloning: A Laboratory Manual, 2nded.). 

Mammalian viral genes are often highly expressed and have a broad host range; 
therefore sequences encoding mammalian viral genes provide particularly useful promoter 
sequences. Examples include the SV40 early promoter, mouse mammary tumor virus LTR 
promoter, adenovirus major late promoter (Ad MLP), and herpes simplex virus promoter. In 
addition, sequences derived from non-viral genes, such as the murine metallothionein gene, 
also provide useful promoter sequences. Expression may be either constitutive or regulated 
(inducible). Depending on the promoter selected, many promotes may be inducible using 
known substrates, such as the use of the mouse mammary tumor virus (MMTV) promoter 
with the glucocorticoid responsive element (GRE) that is induced by glucocorticoid in 
hormone-responsive transformed cells (see for example, U.S. Patent 5,783,681). 

The presence of an enhancer element (enhancer), combined with the promoter 
elements described above, will usually increase expression levels. An enhancer is a 
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regulatory DNA sequence that can stimulate transcription up to 1000-fold when linked to 
homologous or heterologous promoters, with synthesis beginning at the normal RNA start 
site. Enhancers are also active when they are placed upstream or downstream from the 
transcription initiation site, in either normal or flipped orientation, or at a distance of more 
than 1000 nucleotides from the promoter (Maniatis et al. (1987) Science 236:1237- Alberts et 
al. (1989) Molecular Biology of the Cell, 2nd ed.). Enhancer elements derived from viruses 
may be particularly useful, because they usually have a broader host range. Examples include 
the SV40 early gene enhancer (Dijkema et al (1985) EMBO J. 4:161) and the 
enhancer/promoters derived from the long terminal repeat (LTR) of the Rous Sarcoma Virus 
(Gorman et al. (1982b) Proc. Natl Acad. Set 79:6111) and from human cytomegalovirus 
(Boshart et al. (1985) Cell 41:521). Additionally, some enhancers are regulatable and become 
active only in the presence of an inducer, such as a hormone or metal ion (Sassone-Corsi and 
Borelli (1986) Trends Genet. 2:215; Maniatis et al. (1987) Science 236:1237). 

A DNA molecule may be expressed intracellular^ in mammalian cells. A promoter 
sequence may be directly linked with the DNA molecule, in which case the first amino acid at 
the N-terminus of the recombinant protein will always be a methionine, which is encoded by 
the ATG start codon. If desired, the N-terminus may be cleaved from the protein by in vitro 
incubation with cyanogen bromide. 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provides for secretion of the foreign protein in mammalian cells. 
Preferably, there are processing sites encoded between the leader fragment and the foreign 
gene that can be cleaved either in vivo or in vitro. The leader sequence fragment usually 
encodes a signal peptide comprised of hydrophobic amino acids which direct the secretion of 
the protein from the cell. The adenovirus tripartite leader is an example of a leader sequence 
that provides for secretion of a foreign protein in mammalian cells. 

Usually, transcription termination and polyadenylation sequences recognized by 
mammalian cells are regulatory regions located 3' to the translation stop codon and thus, 
together with the promoter elements, flank the coding sequence. The 3* terminus of the 
mature mRNA is formed by site-specific post-transcriptional cleavage and polyadenylation 
(Birnstiel et al. (1985) Cell 47:349; Proudfoot and Whitelaw (1988) "Termination and 3' end 
processing of eukaryotic RNA. In Transcription and splicing (ed. B.D. Hames and D.M. 
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Glover); Proudfoot (1989) Trends Biochem. ScL 7^: 105). These sequences direct the 
transcription of an mRNA which can be translated into the polypeptide encoded by the DNA. 
Examples of transcription terminator/polyadenylation signals include those derived from 
SV40 (Sambrook et al (1989) "Expression of cloned genes in cultured mammalian cells." In 
Molecular Cloning: A Laboratory Manual). 

Usually, the above described components, comprising a promoter, polyadenylation 
signal, and transcription termination sequence are put together into expression constructs. 
Enhancers, introns with functional splice donor and acceptor sites, and leader sequences may 
also be included in an expression construct, if desired. Expression constructs are often 
maintained in a replicon, such as an extrachromosomal element (e.g., plasmids) capable of 
stable maintenance in a host, such as mammalian cells or bacteria. Mammalian replication 
systems include those derived from animal viruses, which require trans-acting factors to 
replicate. For example, plasmids containing the replication systems of papovaviruses, such as 
SV40 (Gluzman (1981) Cell 25:175) or polyomavirus, replicate to extremely high copy 
number in the presence of the appropriate viral T antigen. Additional examples of 
mammalian replicons include those derived from bovine papillomavirus and Epstein-Barr 
virus. Additionally, the replicon may have two replication systems, thus allowing it to be 
maintained, for example, in mammalian cells for expression and in a prokaryotic host for 
cloning and amplification. Examples of such mammalian-bacteria shuttle vectors include 
pMT2 (Kaufman et al. (1989) Mol Cell Biol 9:946) and pHEBO (Shimizu et al. (1986) Mol 
Cell Biol 6:1074). 

The transformation procedure used depends upon the host to be transformed. Methods 
for introduction of heterologous polynucleotides into mammalian cells are known in the art 
and include dextran-mediated transfection, calcium phosphate precipitation, polybrene 
mediated transfection, protoplast fusion, electroporation, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

Mammalian cell lines available as hosts for expression are known in the art and 
include many immortalized cell lines available from the American Type Culture Collection 
(ATCC), including but not limited to, Chinese hamster ovary (CHO) cells, HeLa cells, baby 
hamster kidney (BHK) cells, monkey kidney cells (COS), human hepatocellular carcinoma 
cells (e.g., Hep G2), and a number of other cell lines. 
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ii. Plant Cellular Expression Systems 

There are many plant cell culture and whole plant genetic expression systems known 
in the art. Exemplary plant cellular genetic expression systems include those described in 
patents, such as: U.S. 5,693,506; US 5,659,122; and US 5,608,143. Additional examples of 
genetic expression in plant cell culture has been described by Zenk, Phytochemistry 30:3861- 
3863 (1991). Descriptions of plant protein signal peptides may be found in addition to the 
references described above in Vaulcombe et ah, Mol. Gen. Genet. 209:33-40 (1987); Chandler 
et al., Plant Molecular Biology 3:407-418 (1984); Rogers, J. Biol Chem. 260:3731-3738 
(1985); Rothstein et al., Gene 55:353-356 (1987); Whittier et al., Nucleic Acids Research 
15:2515-2535 (1987); Wirsel et al., Molecular Microbiology 3:3-14 (1989); Yu et al., Gene 
122:247-253 (1992). A description of the regulation of plant gene expression by the 
phytohormone, gibberellic acid and secreted enzymes induced by gibberellic acid can be 
found in R.L. Jones and J. MacMillin, Gibberellins: in: Advanced Plant Physiology,. Malcolm 
B. Wilkins, ed., 1984 Pitman Publishing Limited, London, pp. 21-52. References that 
describe other metabolically-regulated genes: Sheen, Plant Cell, 2:1027-1038(1990); Maas et 
al, EMBOJ. 9:3447-3452 (1990); Benkel and Hickey, Proc. Natl Acad. ScL 84:1337-1339 
(1987) 

Typically, using techniques known in the art, a desired polynucleotide sequence is 
inserted into an expression cassette comprising genetic regulatory elements designed for 
operation in plants. The expression cassette is inserted into a desired expression vector with 
companion sequences upstream and downstream from the expression cassette suitable for 
expression in a plant host. The companion sequences will be of plasmid or viral origin and 
provide necessary characteristics to the vector to permit the vectors to move DNA from an 
original cloning host, such as bacteria, to the desired plant host. The basic bacterial/plant 
vector construct will preferably provide a broad host range prokaryote replication origin; a 
prokaryote selectable marker; and, for Agrobacterium transformations, T DNA sequences for 
Agrobacterium-mediated transfer to plant chromosomes. Where the heterologous gene is not 
readily amenable to detection, the construct will preferably also have a selectable marker gene 
suitable for determining if a plant cell has been transformed. A general review of suitable 
markers, for example for the members of the grass family, is found in Wilmink and Dons, 
1993, Plant Mol Biol Reptr, 1 1(2):165-185. 
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Sequences suitable for permitting integration of the heterologous sequence into the 
plant genome are also recommended. These might include transposon sequences and the like 
for homologous recombination as well as Ti sequences which permit random insertion of a 
heterologous expression cassette into a plant genome. Suitable prokaryote selectable markers 
include resistance toward antibiotics such as ampicillin or tetracycline. Other DNA sequences 
encoding additional functions may also be present in the vector, as is known in the art. 

The nucleic acid molecules of the subject invention may be included into an 
expression cassette for expression of the protein(s) of interest. Usually, there will be only one 
expression cassette, although two or more are feasible. The recombinant expression cassette 
will contain in addition to the heterologous protein encoding sequence the following 
elements, a promoter region, plant 5' untranslated sequences, initiation codon depending upon 
whether or not the structural gene comes equipped with one, and a transcription and 
translation termination sequence. Unique restriction enzyme sites at the 5' and 3' ends of the 
cassette allow for easy insertion into a pre-existing vector. 

A heterologous coding sequence may be for any protein relating to the present 
invention. The sequence encoding the protein of interest will encode a signal peptide which 
allows processing and translocation of the protein, as appropriate, and will usually lack any 
sequence which might result in the binding of the desired protein of the invention to a 
membrane. Since, for the most part, the transcriptional initiation region will be for a gene 
which is expressed and translocated during germination, by employing the signal peptide 
which provides for translocation, one may also provide for translocation of the protein of 
interest. In this way, the protein(s) of interest will be translocated from the cells in which they 
are expressed and may be efficiently harvested. Typically secretion in seeds are across the 
aleurone or scutellar epithelium layer into the endosperm of the seed. While it is not required 
that the protein be secreted from the cells in which the protein is produced, this facilitates the 
isolation and purification of the recombinant protein. 

Since the ultimate expression of the desired gene product will be in a eucaryotic cell it 
is desirable to determine whether any portion of the cloned gene contains sequences which 
will be processed out as introns by the host's splicosome machinery. If so, site-directed 
mutagenesis of the "intron" region may be conducted to prevent losing a portion of the genetic 
message as a false intron code, Reed and Maniatis, Cell 41:95-105, 1985. 
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The vector can be microinjected directly into plant cells by use of micropipettes to 
mechanically transfer the recombinant DNA. Crossway, Mol Gen. Genet, 202: 179-1 85, 
1985. The genetic material may also be transferred into the plant cell by using polyethylene 
glycol, Krens, et al., Nature, 296, 12-1 A, 1982. Another method of introduction of nucleic 
acid segments is high velocity ballistic penetration by small particles with the nucleic acid 
either within the matrix of small beads or particles, or on the surface, Klein, et al, Nature, 
327, 70-73, 1987 and Knudsen and Muller, 1991, Planta, 185:330-336 teaching particle 
bombardment of barley endosperm to create transgenic barley. Yet another method of 
introduction would be fusion of protoplasts with other entities, either minicells, cells, 
lysosomes or other fusible lipid- surfaced bodies, Fraley, et al., Proc. Natl. Acad. Sci. USA, 19, 
1859-1863, 1982. 

The vector may also be introduced into the plant cells by electroporation. (Fromm et 
al, Proc. Natl Acad. Sci. USA 82:5824, 1985). In this technique, plant protoplasts are 
electroporated in the presence of plasmids containing the gene construct. Electrical impulses 
of high field strength reversibly permeabilize biomembranes allowing the introduction of the 
plasmids. Electroporated plant protoplasts reform the cell wall, divide, and form plant callus. 

All plants from which protoplasts can be isolated and cultured to give whole 
regenerated plants can be transformed by the present invention so that whole plants are 
recovered which contain the transferred gene. It is known that practically all plants can be 
regenerated from cultured cells or tissues, including but not limited to all major species of 
sugarcane, sugar beet, cotton, fruit and other trees, legumes and vegetables. Some suitable 
plants include, for example, species from the genera Fragaria, Lotus, Medicago, Onobrychis, 
Trifolium, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, 
Brassica, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Lycopersion, 
Nicotiana, Solanum, Petunia, Digitalis, Majorana, Cichorium, Helianthus, Lactuca, Bromus, 
Asparagus, Antirrhinum, Hererocallis, Nemesia, Pelargonium, Panicum, Pennisetum, 
Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Lolium, Zea, Triticum, 
Sorghum, and Datura. 

Means for regeneration vary from species to species of plants, but generally a 
suspension of transformed protoplasts containing copies of the heterologous gene is first 
provided. Callus tissue is formed and shoots may be induced from callus and subsequently 
rooted. Alternatively, embryo formation can be induced from the protoplast suspension. 
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These embryos germinate as natural embryos to form plants. The culture media will generally 
contain various amino acids and hormones, such as auxin and cytokinins. It is also 
advantageous to add glutamic acid and proline to the medium, especially for such species as 
corn and alfalfa. Shoots and roots normally develop simultaneously. Efficient regeneration 
will depend on the medium, on the genotype, and on the history of the culture. If these three 
variables are controlled, then regeneration is fully reproducible and repeatable. 

In some plant cell culture systems, the desired protein of the invention may be 
excreted or alternatively, the protein may be extracted from the whole plant. Where the 
desired protein of the invention is secreted into the medium, it may be collected. 
Alternatively, the embryos and embryoless-half seeds or other plant tissue may be 
mechanically disrupted to release any secreted protein between cells and tissues. The mixture 
may be suspended in a buffer solution to retrieve soluble proteins. Conventional protein 
isolation and purification methods will be then used to purify the recombinant protein. 
Parameters of time, temperature pH, oxygen, and volumes will be adjusted through routine 
methods to optimize expression and recovery of heterologous protein. 

iii. Baculovirus Systems 

The polynucleotide encoding the protein can also be inserted into a suitable insect 
expression vector, and is operably linked to the control elements within that vector. Vector 
construction employs techniques which are known in the art. Generally, the components of 
the expression system include a transfer vector, usually a bacterial plasmid, which contains 
both a fragment of the baculovirus genome, and a convenient restriction site for insertion of 
the heterologous gene or genes to be expressed; a wild type baculovirus with a sequence 
homologous to the baculovirus-specific fragment in the transfer vector (this allows for the 
homologous recombination of the heterologous gene in to the baculovirus genome); and 
appropriate insect host cells and growth media. 

After inserting the DNA sequence encoding the protein into the transfer vector, the 
vector and the wild type viral genome are transfected into an insect host cell where the vector 
and viral genome are allowed to recombine. The packaged recombinant virus is expressed 
and recombinant plaques are identified and purified. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, inter 
alia, Invitrogen, San Diego CA ( M MaxBac M kit). These techniques are generally known to 
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those skilled in the art and fully described in Summers and Smith, Texas Agricultural 
Experiment Station Bulletin No. 1555 (1987) (hereinafter "Summers and Smith"). 

Prior to inserting the DNA sequence encoding the protein into the baculovirus 
genome, the above described components, comprising a promoter, leader (if desired), coding 
sequence of interest, and transcription termination sequence, are usually assembled into an 
intermediate transplacement construct (transfer vector). This construct may contain a single 
gene and operably linked regulatory elements; multiple genes, each with its owned set of 
operably linked regulatory elements; or multiple genes, regulated by the same set of 
regulatory elements. Intermediate transplacement constructs are often maintained in a 
replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable maintenance 
in a host, such as a bacterium. The replicon will have a replication system, thus allowing it to 
be maintained in a suitable host for cloning and amplification. 

Currently, the most commonly used transfer vector for introducing foreign genes into 
AcNPV is pAc373. Many other vectors, known to those of skill in the art, have also been 
designed. These include, for example, pVL985 (which alters the polyhedrin start codon from 
ATG to ATT, and which introduces a BamHI cloning site 32 basepairs downstream from the 
ATT; see Luckow and Summers, Virology (1989) 77:31. 

The plasmid usually also contains the polyhedrin polyadenylation signal (Miller et al. 
(1988) Ann. Rev. Microbiol, 42:111) and a prokaryotic ampicillin-resistance (amp) gene and 
origin of replication for selection and propagation in E. coli. 

Baculovirus transfer vectors usually contain a baculovirus promoter. A baculovirus 
promoter is any DNA sequence capable of binding a baculovirus RNA polymerase and 
initiating the downstream (5' to 3') transcription of a coding sequence (e.g., structural gene) 
into mRNA. A promoter will have a transcription initiation region which is usually placed 
proximal to the 5' end of the coding sequence. This transcription initiation region usually 
includes an RNA polymerase binding site and a transcription initiation site. A baculovirus 
transfer vector may also have a second domain called an enhancer, which, if present, is 
usually distal to the structural gene. Expression may be either regulated or constitutive. 

Structural genes, abundantly transcribed at late times in a viral infection cycle, provide 
particularly useful promoter sequences. Examples include sequences derived from the gene 
encoding the viral polyhedron protein, Friesen et al., (1986) "The Regulation of Baculovirus 
Gene Expression," in: The Molecular Biology of Baculoviruses (ed. Walter Doerfler); EPO 
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Publ. Nos. 127 839 and 155 476; and the gene encoding the plO protein, Vlak et al., (1988), J. 
Gen. Virol 69:765. 

DNA encoding suitable signal sequences can be derived from genes for secreted insect 
or baculovirus proteins, such as the baculovirus polyhedrin gene (Carbonell et al. (1988) 
Gene, 73:409). Alternatively, since the signals for mammalian cell posttranslational 
modifications (such as signal peptide cleavage, proteolytic cleavage, and phosphorylation) 
appear to be recognized by insect cells, and the signals required for secretion and nuclear 
accumulation also appear to be conserved between the invertebrate cells and vertebrate cells, 
leaders of non-insect origin, such as those derived from genes encoding human (alpha) a- 
interferon, Maeda et al., (1985), Nature 315:592; human gastrin-releasing peptide, Lebacq- 
Verheyden et al., (1988), Molec. Cell Biol. <S:3129; human IL-2, Smith et al, (1985) Proc. 
Nat'l Acad. Set USA, 52:8404; mouse IL-3, (Miyajima et al., (1987) Gene 58:273; and human 
glucocerebrosidase, Martin et ah (1988) DNA, 7:99, can also be used to provide for secretion 
in insects. 

A recombinant polypeptide or polyprotein may be expressed intracellular^ or, if it is 
expressed with the proper regulatory sequences, it can be secreted. Good intracellular 
expression of nonfused foreign proteins usually requires heterologous genes that ideally have 
a short leader sequence containing suitable translation initiation signals preceding an ATG 
start signal. If desired, methionine at the N-terminus may be cleaved from the mature protein 
by in vitro incubation with cyanogen bromide. 

Alternatively, recombinant polyproteins or proteins which are not naturally secreted 
can be secreted from the insect cell by creating chimeric DNA molecules that encode a fusion 
protein comprised of a leader sequence fragment that provides for secretion of the foreign 
protein in insects. The leader sequence fragment usually encodes a signal peptide comprised 
of hydrophobic amino acids which direct the translocation of the protein into the endoplasmic 
reticulum. 

After insertion of the DNA sequence and/or the gene encoding the expression product 
precursor of the protein, an insect cell host is co-transformed with the heterologous DNA of 
the transfer vector and the genomic DNA of wild type baculovirus - usually by co- 
transfection. The promoter and transcription termination sequence of the construct will 
usually comprise a 2-5kb section of the baculovirus genome. Methods for introducing 
heterologous DNA into the desired site in the baculovirus virus are known in the art. (See 
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Summers and Smith supra; Ju et al. (1987); Smith et al., Mol Cell Biol (1983) 3:2156; and 
Luckow and Summers (1989)). For example, the insertion can be into a gene such as the 
polyhedrin gene, by homologous double crossover recombination; insertion can also be into a 
restriction enzyme site engineered into the desired baculovirus gene. Miller et al., (1989), 
Bioessays 4:91 . The DNA sequence, when cloned in place of the polyhedrin gene in the 
expression vector, is flanked both 5' and 3' by polyhedrin-specific sequences and is positioned 
downstream of the polyhedrin promoter. 

The newly formed baculovirus expression vector is subsequently packaged into an 
infectious recombinant baculovirus. Homologous recombination occurs at low frequency 
(between about 1% and about 5%); thus, the majority of the virus produced after 
cotransfection is still wild-type virus. Therefore, a method is necessary to identify 
recombinant viruses. An advantage of the expression system is a visual screen allowing 
recombinant viruses to be distinguished. The polyhedrin protein, which is produced by the 
native virus, is produced at very high levels in the nuclei of infected cells at late times after 
viral infection. Accumulated polyhedrin protein forms occlusion bodies that also contain 
embedded particles. These occlusion bodies, up to 15 \xm in size, are highly refractile, giving 
them a bright shiny appearance that is readily visualized under the light microscope. Cells 
infected with recombinant viruses lack occlusion bodies. To distinguish recombinant virus 
from wild-type virus, the transfection supernatant is plaqued onto a monolayer of insect cells 
by techniques known to those skilled in the art. Namely, the plaques are screened under the 
light microscope for the presence (indicative of wild-type virus) or absence (indicative of 
recombinant virus) of occlusion bodies. Current Protocols in Microbiology Vol. 2 (Ausubel 
et al. eds) at 16.8 (Supp. 10, 1990); Summers and Smith, supra; Miller et al. (1989). 

Recombinant baculovirus expression vectors have been developed for infection into 
several insect cells. For example, recombinant baculoviruses have been developed for, inter 
alia: Aedes aegypti , Autographa californica, Bombyx mori, Drosophila melanogaster, 
Spodoptera frugiperda, and Trichoplusia ni (PCT Pub. No. WO 89/046699; Carbonell et al., 
(1985) Virol 56:153; Wright (1986) Nature 527:718; Smith et al., (1983) Mol Cell Biol 
3:2156; and see generally, Fraser, et al (1989) In Vitro Cell Dev. Biol 25:225). 

Cells and cell culture media are commercially available for both direct and fusion 
expression of heterologous polypeptides in a baculovirus/expression system; cell culture 
technology is generally known to those skilled in the art. See, e.g., Summers and Smith supra. 
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The modified insect cells may then be grown in an appropriate nutrient medium, 
which allows for stable maintenance of the plasmid(s) present in the modified insect host. 
Where the expression product gene is under inducible control, the host may be grown to high 
density, and expression induced. Alternatively, where expression is constitutive, the product 
will be continuously expressed into the medium and the nutrient medium must be 
continuously circulated, while removing the product of interest and augmenting depleted 
nutrients. The product may be purified by such techniques as chromatography, e.g., HPLC, 
affinity chromatography, ion exchange chromatography, etc.; electrophoresis; density gradient 
centrifugation; solvent extraction, or the like. As appropriate, the product may be further 
purified, as required, so as to remove substantially any insect proteins which are also secreted 
in the medium or result from lysis of insect cells, so as to provide a product which is at least 
substantially free of host debris, e.g., proteins, lipids and polysaccharides. 

In order to obtain protein expression, recombinant host cells derived from the 
transfoimants are incubated under conditions which allow expression of the recombinant 
protein encoding sequence. These conditions will vary, dependent upon the host cell selected. 
However, the conditions are readily ascertainable to those of ordinary skill in the art, based 
upon what is known in the art. 

iv. Bacterial Systems 

Bacterial expression techniques are known in the art. A bacterial promoter is any DNA 
sequence capable of binding bacterial RNA polymerase and initiating the downstream (3') 
transcription of a coding sequence (e.g. structural gene) into mRNA. A promoter will have a 
transcription initiation region which is usually placed proximal to the 5' end of the coding 
sequence. This transcription initiation region usually includes an RNA polymerase binding 
site and a transcription initiation site. A bacterial promoter may also have a second domain 
called an operator, that may overlap an adjacent RNA polymerase binding site at which RNA 
synthesis begins. The operator permits negative regulated (inducible) transcription, as a gene 
repressor protein may bind the operator and thereby inhibit transcription of a specific gene. 
Constitutive expression may occur in the absence of negative regulatory elements, such as the 
operator. In addition, positive regulation may be achieved by a gene activator protein binding 
sequence, which, if present is usually proximal (5') to the RNA polymerase binding sequence. 
An example of a gene activator protein is the catabolite activator protein (CAP), which helps 



WO 99/57280 



20 



PCT/US99/09346 



initiate transcription of the lac operon in Escherichia coli (E. coli) (Raibaud et al. (1984) 
Annu. Rev. Genet. 18:173). Regulated expression may therefore be either positive or negative, 
thereby either enhancing or reducing transcription. 

Sequences encoding metabolic pathway enzymes provide particularly useful promoter 
sequences. Examples include promoter sequences derived from sugar metabolizing enzymes, 
such as galactose, lactose (lac) (Chang et al (1977) Nature 198:1056), and maltose. 
Additional examples include promoter sequences derived from biosynthetic enzymes such as 
tryptophan (trp) (Goeddel et al (1980) Nuc. Acids Res. 5:4057; Yelverton et al. (1981) Nucl 
Acids Res. P:731; U.S. Patent 4,738,921; EPO Publ. Nos. 036 776 and 121 775). The beta- 
lactamase (bid) promoter system (Weissmann (1981) "The cloning of interferon and other 
mistakes/' In Interferon 3 (ed. I. Gresser)), bacteriophage lambda PL (Shimatake et al. (1981) 
Nature 292:12%) and T5 (U.S. Patent 4,689,406) promoter systems also provide useful 
promoter sequences. 

In addition, synthetic promoters which do not occur in nature also function as bacterial 
promoters. For example, transcription activation sequences of one bacterial or bacteriophage 
promoter may be joined with the operon sequences of another bacterial or bacteriophage 
promoter, creating a synthetic hybrid promoter (U.S. Patent 4,551,433). For example, the tac 
promoter is a hybrid trp-lac promoter comprised of both trp promoter and lac operon 
sequences that is regulated by the lac repressor (Amann et al. (1983) Gene 25:167; de Boer et 
al. (1983) Proc. Natl Acad. Sci. 50:21). Furthermore, a bacterial promoter can include 
naturally occurring promoters of non-bacterial origin that have the ability to bind bacterial 
RNA polymerase and initiate transcription. A naturally occurring promoter of non-bacterial 
origin can also be coupled with a compatible RNA polymerase to produce high levels of 
expression of some genes in prokaryotes. The bacteriophage T7 RNA polymerase/promoter 
system is an example of a coupled promoter system (Studier et al (1986) J. Mol Biol 
189:\U\ Tabor et al (1985) Proc Natl. Acad Sci. 52:1074). In addition, a hybrid promoter 
can also be comprised of a bacteriophage promoter and an E. coli operator region (EPO Publ. 
No. 267 851). 

In addition to a functioning promoter sequence, an efficient ribosome binding site is 
also useful for the expression of foreign genes in prokaryotes. In E. coli, the ribosome binding 
site is called the Shine-Dalgarno (SD) sequence and includes an initiation codon (ATG) and a 
sequence 3-9 nucleotides in length located 3-1 1 nucleotides upstream of the initiation codon 
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(Shine et al. (1975) Nature 254:34). The SD sequence is thought to promote binding of 
mRNA to the ribosome by the pairing of bases between the SD sequence and the 3' end of 
E. coli 16S rRNA (Steitz et al. (1979) "Genetic signals and nucleotide sequences in 
messenger RNA." In Biological Regulation and Development: Gene Expression (ed. R.F. 
Goldberger)). To express eukaryotic genes and prokaryotic genes with weak ribosome- 
binding site, it is often necessary to optimize the distance between the SD sequence and the 
ATG of the eukaryotic gene (Sambrook et al. (1989) "Expression of cloned genes in 
Escherichia coli." In Molecular Cloning: A Laboratory Manual). 

A DNA molecule may be expressed intracellularly. A promoter sequence may be 
directly linked with the DNA molecule, in which case the first amino acid at the N-terminus 
will always be a methionine, which is encoded by the ATG start codon. If desired, methionine 
at the N-terminus may be cleaved from the protein by in vitro incubation with cyanogen 
bromide or by either in vivo or in vitro incubation with a bacterial methionine N-terminal 
peptidase (EPO Publ. No. 219 237). 

Fusion proteins provide an alternative to direct expression. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous bacterial protein, or other stable protein, is 
fused to the 5' end of heterologous coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For example, the bacteriophage lambda cell 
gene can be linked at the 5* terminus of a foreign gene and expressed in bacteria. The resulting 
fusion protein preferably retains a site for a processing enzyme (factor Xa) to cleave the 
bacteriophage protein from the foreign gene (Nagai et al. (1984) Nature 509:810). Fusion 
proteins can also be made with sequences from the lacZ (Jia et al. (1 987] ) Gene 60:191), trpE 
(Allen et al. (1987) J. Biotechnol. 5:93; Makoff et al. (1989) J. Gen. Microbiol. 135:11), and 
Chey (EPO Publ. No. 324 647) genes. The DNA sequence at the junction of the two amino 
acid sequences may or may not encode a cleavable site. Another example is a ubiquitin fusion 
protein. Such a fusion protein is made with the ubiquitin region that preferably retains a site 
for a processing enzyme (e.g. ubiquitin specific processing-protease) to cleave the ubiquitin 
from the foreign protein. Through this method, native foreign protein can be isolated (Miller 
et al. (1989) Bio/Technology 7:698). 

Alternatively, foreign proteins can also be secreted from the cell by creating chimeric 
DNA molecules that encode a fusion protein comprised of a signal peptide sequence fragment 
that provides for secretion of the foreign protein in bacteria (U.S. Patent 4,336,336). The 
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signal sequence fragment usually encodes a signal peptide comprised of hydrophobic amino 
acids which direct the secretion of the protein from the cell. The protein is either secreted into 
the growth media (gram-positive bacteria) or into the periplasmic space, located between the 
inner and outer membrane of the cell (gram-negative bacteria). Preferably there are processing 
sites, which can be cleaved either in vivo or in vitro encoded between the signal peptide 
fragment and the foreign gene. 

DNA encoding suitable signal sequences can be derived from genes for secreted 
bacterial proteins, such as the E. coli outer membrane protein gene (ompA) (Masui et al. 
(1983), in: Experimental Manipulation of Gene Expression; Ghrayeb et al. (1984) EMBO J. 
3:2431) and the E. coli alkaline phosphatase signal sequence (phoA) (Oka et al. (1985) Proc. 
Natl. Acad. Sci. 82:1212). As an additional example, the signal sequence of the alpha-amylase 
gene from various Bacillus strains can be used to secrete heterologous proteins from B. 
subtilis (Palva et al. (1982) Proc. Natl. Acad. Sci. USA 7P:5582; EPO Publ. No. 244 042). 

Usually, transcription termination sequences recognized by bacteria are regulatory 
regions located 3' to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Transcription termination sequences 
frequently include DNA sequences of about 50 nucleotides capable of forming stem loop 
structures that aid in terminating transcription. Examples include transcription termination 
sequences derived from genes with strong promoters, such as the trp gene in E. coli as well as 
other biosynthetic genes. 

Usually, the above described components, comprising a promoter, signal sequence (if 
desired), coding sequence of interest, and transcription termination sequence, are put together 
into expression constructs. Expression constructs are often maintained in a replicon, such as 
an extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
bacteria. The replicon will have a replication system, thus allowing it to be maintained in a 
prokaryotic host either for expression or for cloning and amplification. In addition, a replicon 
may be either a high or low copy number plasmid. A high copy number plasmid will 
generally have a copy number ranging from about 5 to about 200, and usually about 10 to 
about 150. A host containing a high copy number plasmid will preferably contain at least 
about 10, and more preferably at least about 20 plasmids. Either a high or low copy number 
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vector may be selected, depending upon the effect of the vector and the foreign protein on the 
host. 

Alternatively, the expression constructs can be integrated into the bacterial genome 
with an integrating vector. Integrating vectors usually contain at least one sequence 
homologous to the bacterial chromosome that allows the vector to integrate. Integrations 
appear to result from recombinations between homologous DNA in the vector and the 
bacterial chromosome. For example, integrating vectors constructed with DNA from various 
Bacillus strains integrate into the Bacillus chromosome (EPO Publ. No. 127 328). Integrating 
vectors may also be comprised of bacteriophage or transposon sequences. 

Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of bacterial strains that have been transformed. 
Selectable markers can be expressed in the bacterial host and may include genes which render 
bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, kanamycin 
(neomycin), and tetracycline (Davies et al. (1978) Annu. Rev. Microbiol. 32:469). Selectable 
markers may also include biosynthetic genes, such as those in the histidine, tryptophan, and 
leucine biosynthetic pathways. 

Alternatively, some of the above described components can be put together in 
transformation vectors. Transformation vectors are usually comprised of a selectable market 
that is either maintained in a replicon or developed into an integrating vector, as described 
above. 

Expression and transformation vectors, either extra-chromosomal replicons or 
integrating vectors, have been developed for transformation into many bacteria. For example, 
expression vectors have been developed for, inter alia, the following bacteria: Bacillus 
subtilis (Palva et al. (1982) Proc. Natl. Acad. Sci. USA 7P:5582; EPO Publ. Nos. 036 259 and 
063 953; PCT Publ. No. WO 84/04541), Escherichia coli (Shimatake et al. (1981) Nature 
292:128; Amann et al. (1985) Gene 40:183; Studier et al. (1986) J. Mol. Biol. 189:113; EPO 
Publ. Nos. 036 776, 136 829 and 136 907), Streptococcus cremoris (Powell et al. (1988) Appl. 
Environ. Microbiol. 54:655); Streptococcus lividans (Powell et al. (1988) Appl. Environ. 
Microbiol. 54:655), Streptomyces lividans (U.S. Patent 4,745,056). 

Methods of introducing exogenous DNA into bacterial hosts are well-known in the art, 
and usually include either the transformation of bacteria treated with CaCl 2 or other agents, 
such as divalent cations and DMSO. DNA can also be introduced into bacterial cells by 
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electroporation. Transformation procedures usually vary with the bacterial species to be 
transformed. (See e.g., use of Bacillus: Masson et al. (1989) FEMS Microbiol. Lett. 60:213; 
Palva et al. (1982) Proc. Natl. Acad. Sci. USA 79:5582; EPO Publ. Nos. 036 259 and 063 953; 
PCT Publ. No. WO 84/04541; use of Campylobacter: Miller et al. (1988) Proc. Natl. Acad. 
Sci. 85:856; and Wang et al. (1990) J. Bacteriol. 1 72:949; use of Escherichia coli: Cohen et 
al. (1973) Proc. Natl. Acad. Sci. 69:2110; Dower etal. (1988) Nucleic Acids Res. 76:6127; 
Kushner (1978) "An improved method for transformation of Escherichia coli with ColEl- 
derived plasmids. In Genetic Engineering: Proceedings of the International Symposium on 
Genetic Engineering (eds. H.W. Boyer and S. Nicosia); Mandel et al. (1970) J. Mol. Biol. 
55:159; Taketo (1988) Biochim. Biophys. Acta 949:318; use of Lactobacillus: Chassy et al. 
(1987) FEMS Microbiol. Lett. 44:113; use of Pseudomonas: Fiedler et al. (1988) Anal. 
Biochem 170:38; use of Staphylococcus: Augustin et al. (1990) FEMS Microbiol. Lett. 
66:203; use of Streptococcus: Barany et al. (1980) J. Bacteriol. 144:698; Harlander (1987) 
"Transformation of Streptococcus lactis by electroporation, in: Streptococcal Genetics (ed. J. 
Ferretti and R. Curtiss III); Perry et al. (1981) Infect. Immun. 52:1295; Powell et al. (1988) 
Appl. Environ. Microbiol. 54:655; Somkuti et al. (1987) Proc. 4th Evr. Cong. Biotechnology 
7:412. 

v. Yeast Expression 

Yeast expression systems are also known to one of ordinary skill in the art. A yeast 
promoter is any DNA sequence capable of binding yeast RNA polymerase and initiating the 
downstream (3') transcription of a coding sequence (e.g. structural gene) into mRNA. A 
promoter will have a transcription initiation region which is usually placed proximal to the 5' 
end of the coding sequence. This transcription initiation region usually includes an RNA 
polymerase binding site (the "TATA Box") and a transcription initiation site. A yeast 
promoter may also have a second domain called an upstream activator sequence (UAS), 
which, if present, is usually distal to the structural gene. The UAS permits regulated 
(inducible) expression. Constitutive expression occurs in the absence of a UAS. Regulated 
expression may be either positive or negative, thereby either enhancing or reducing 
transcription. 

Yeast is a fermenting organism with an active metabolic pathway, therefore sequences 
encoding enzymes in the metabolic pathway provide particularly useful promoter sequences. 
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Examples include alcohol dehydrogenase (ADH) (EPO Publ. No. 284 044), enolase, 
glucokinase, glucose- 6-phosphate isomerase, glyceraldehyde-3-phosphate-dehydrogenase 
(GAP or GAPDH), hexokinase, phosphofructokinase, 3-phosphoglycerate mutase, and 
pyruvate kinase (PyK) (EPO Publ. No. 329 203). The yeast PH05 gene, encoding acid 
phosphatase, also provides useful promoter sequences (Myanohara et al. (1983) Proc. Natl 
Acad.ScL USA 80:1). 

In addition, synthetic promoters which do not occur in nature also function as yeast 
promoters. For example, UAS sequences of one yeast promoter may be joined with the 
transcription activation region of another yeast promoter, creating a synthetic hybrid 
promoter. Examples of such hybrid promoters include the ADH regulatory sequence linked to 
the GAP transcription activation region (U.S. Patent Nos. 4,876,197 and 4,880,734). Other 
examples of hybrid promoters include promoters which consist of the regulatory sequences of 
either the ADH2, GAL4, GAL JO, OR PH05 genes, combined with the transcriptional 
activation region of a glycolytic enzyme gene such as GAP or PyK (EPO Publ. No. 164 556). 
Furthermore, a yeast promoter can include naturally occurring promoters of non-yeast origin 
that have the ability to bind yeast RNA polymerase and initiate transcription. Examples of 
such promoters include, inter alia, (Cohen et al. (1980) Proc. Natl Acad. Sci. USA 77:1078; 
Henikoff et al. (1981) Nature 255:835; Hollenberg et al. (1981) Curr. Topics Microbiol. 
Immunol. 96:119; Hollenberg et al. (1979) "The Expression of Bacterial Antibiotic Resistance 
Genes in the Yeast Saccharomyces cerevisiae," in: Plasmids of Medical, Environmental and 
Commercial Importance (eds. K.N. Timmis and A. Puhler); Mercerau-Puigalon et al. (1980) 
Gene 77:163; Panthier et al. (1980) Curr. Genet. 2:109;). 

A DNA molecule may be expressed intracellularly in yeast. A promoter sequence may 
be directly linked with the DNA molecule, in which case the first amino acid at the N- 
terminus of the recombinant protein will always be a methionine, which is encoded by the 
ATG start codon. If desired, methionine at the N-terminus may be cleaved from the protein by 
in vitro incubation with cyanogen bromide. 

Fusion proteins provide an alternative for yeast expression systems, as well as in 
mammalian, plant, baculovirus, and bacterial expression systems. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous yeast protein, or other stable protein, is 
fused to the 5' end of heterologous coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For example, the yeast or human 
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superoxide dismutase (SOD) gene, can be linked at the 5' terminus of a foreign gene and 
expressed in yeast. The DNA sequence at the junction of the two amino acid sequences may 
or may not encode a cleavable site. See e.g., EPO Publ. No. 196056. Another example is a 
ubiquitin fusion protein. Such a fusion protein is made with the ubiquitin region that 
preferably retains a site for a processing enzyme (e.g. ubiquitin-specific processing protease) 
to cleave the ubiquitin from the foreign protein. Through this method, therefore, native 
foreign protein can be isolated (e.g., WO88/024066). 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provide for secretion in yeast of the foreign protein. Preferably, there 
are processing sites encoded between the leader fragment and the foreign gene that can be 
cleaved either in vivo or in vitro. The leader sequence fragment usually encodes a signal 
peptide comprised of hydrophobic amino acids which direct the secretion of the protein from 
the cell. 

DNA encoding suitable signal sequences can be derived from genes for secreted yeast 
proteins, such as the yeast invertase gene (EPO Publ. No. 012 873; JPO Publ. No. 
62:096,086) and the A-factor gene (U.S. Patent 4,588,684). Alternatively, leaders of non- 
yeast origin, such as an interferon leader, exist that also provide for secretion in yeast (EPO 
Publ. No. 060 057). 

A preferred class of secretion leaders are those that employ a fragment of the yeast 
alpha-factor gene, which contains both a "pre" signal sequence, and a "pro" region. The types 
of alpha-factor fragments that can be employed include the full-length pre-pro alpha factor 
leader (about 83 amino acid residues) as well as truncated alpha-factor leaders (usually about 
25 to about 50 amino acid residues) (U.S. Patent Nos. 4,546,083 and 4,870,008; EPO Publ. 
No. 324 274). Additional leaders employing an alpha-factor leader fragment that provides for 
secretion include hybrid alpha-factor leaders made with a presequence of a first yeast, but a 
pro-region from a second yeast alphafactor. (See e.g., PCT Publ. No. WO 89/02463.) 

Usually, transcription termination sequences recognized by yeast are regulatory 
regions located 3' to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Examples of transcription terminator 
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sequence and other yeast-recognized termination sequences, such as those coding for 
glycolytic enzymes. 

Usually, the above described components, comprising a promoter, leader (if desired), 
coding sequence of interest, and transcription termination sequence, are put together into 
expression constructs. Expression constructs are often maintained in a replicon, such as an 
extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
yeast or bacteria. The replicon may have two replication systems, thus allowing it to be 
maintained, for example, in yeast for expression and in a prokaryotic host for cloning and 
amplification. Examples of such yeast-bacteria shuttle vectors include YEp24 (Botstein et al 
(1979) Gene 5:17-24), pCI/1 (Brake et al (1984) Proc. Natl. Acad. Sci USA 57:4642-4646), 
and YRpl7 (Stinchcomb et al. (1982) J. Mol Biol 158:151). In addition, a replicon may be 
either a high or low copy number plasmid. A high copy number plasmid will generally have a 
copy number ranging from about 5 to about 200, and usually about 10 to about 150. A host 
containing a high copy number plasmid will preferably have at least about 10, and more 
preferably at least about 20. Enter a high or low copy number vector may be selected, 
depending upon the effect of the vector and the foreign protein on the host. See e.g., Brake et 
al, supra. 

Alternatively, the expression constructs can be integrated into the yeast genome with 
an integrating vector. Integrating vectors usually contain at least one sequence homologous to 
a yeast chromosome that allows the vector to integrate, and preferably contain two 
homologous sequences flanking the expression construct. Integrations appear to result from 
recombinations between homologous DNA in the vector and the yeast chromosome (Orr- 
Weaver et al. (1983) Methods in Enzymol. 707:228-245). An integrating vector may be 
directed to a specific locus in yeast by selecting the appropriate homologous sequence for 
inclusion in the vector. See Orr- Weaver et al, supra. One or more expression construct may 
integrate, possibly affecting levels of recombinant protein produced (Rine et al. (1983) Proc. 
Natl Acad. Sci. USA 80:6150). The chromosomal sequences included in the vector can occur 
either as a single segment in the vector, which results in the integration of the entire vector, or 
two segments homologous to adjacent segments in the chromosome and flanking the 
expression construct in the vector, which can result in the stable integration of only the 
expression construct. 
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Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of yeast strains that have been transformed. 
Selectable markers may include biosynthetic genes that can be expressed in the yeast host, 
such as ADE2, HIS4, LEXJ2, TRP1, and ALG7, and the G418 resistance gene, which confer 
resistance in yeast cells to tunicamycin and G418, respectively. In addition, a suitable 
selectable marker may also provide yeast with the ability to grow in the presence of toxic 
compounds, such as metal. For example, the presence of CUP1 allows yeast to grow in the 
presence of copper ions (Butt et al (1987) Microbiol Rev. 57:351). 

Alternatively, some of the above described components can be put together into 
transformation vectors. Transformation vectors are usually comprised of a selectable marker 
that is either maintained in a replicon or developed into an integrating vector, as described 
above. 

Expression and transformation vectors, either extrachromosomal replicons or 
integrating vectors, have been developed for transformation into many yeasts. For example, 
expression vectors and methods of introducing exogenous DNA into yeast hosts have been 
developed for, inter alia, the following yeasts: Candida albicans (Kurtz, et al. (1986) Mol 
Cell Biol 6:142); Candida maltosa (Kunze, etal. (1985)7. Basic Microbiol 2.5:141); 
Hansenula polymorpha (Gleeson, et al (1986)7. Gen. Microbiol 1 32:3459; Roggenkamp et 
al (1986) Mol Gen. Genet. 202:302); Kluyveromyces fragilis (Das, et al (1984) 7. Bacterid 
158:1 165); Kluyveromyces lactis (De Louvencourt et al (1983) 7 Bacteriol 154:737; Van 
den Berg et al (1990) Bio/Technology 5:135); Pichia guillerimondii (Kunze et al (1985) 7 
Basic Microbiol 25:141); Pichia pastoris (Cregg, et al (1985) Mol. Cell Biol 5:3376; U.S. 
Patent Nos. 4,837,148 and 4,929,555); Saccharomyces cerevisiae (Hinnen et al (1978) Proc. 
Natl Acad. Sci. USA 75:1929; Ito etal (1983)7. Bacteriol 755:163); Schizosaccharomyces 
pombe (Beach and Nurse (1981) Nature 300:706); and Yarrowia lipolytica (Davidow, et al. 
(1985) Curr. Genet. 70:380471 Gaillardin, etal (1985) Curr. Genet. 70:49). 

Methods of introducing exogenous DNA into yeast hosts are well-known in the art, 
and usually include either the transformation of spheroplasts or of intact yeast cells treated 
with alkali cations. Transformation procedures usually vary with the yeast species to be 
transformed. See e.g., [Kurtz et al (1986) Mol Cell Biol. 6:142; Kunze et al (1985) 7 Basic 
Microbiol 25:141; Candida]; [Gleeson et al (1986)7 Gen. Microbiol 7J2:3459; 
Roggenkamp etal (1986) Mol Gen. Genet. 202:302; Hansenula]; [Das etal. (1984)7 
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Bacteriol. 158:1 165; De Louvencourt et al. (1983) /. Bacteriol. 154:1165; Van den Berg et al. 
(1990) Bio/Technology 8:135; Kluyveromyces]; [Cregge/o/. (1985) Mol. Cell. Biol. 5:3376; 
Kunze et al. (1985) J. Basic Microbiol. 25:141; U.S. Patent Nos. 4,837,148 and 4,929,555; 
Pichia]; [Hinnen et al. (1978) Proc. Natl. Acad. Sci. USA 75;1929; Ito et al. (1983) J. 
Bacteriol. 755:163 Saccharomyces]; [Beach and Nurse (1981) Nature 300:706; 
Schizosaccharomyces]; [Davidow et al. (1985) Curr. Genet. 10:39; Gaillardin et al. (1985) 
Curr. Genet. 10:49; Yarrowia]. 

Definitions 

A composition containing X is "substantially free of Y when at least 85% by weight 
of the total X+Y in the composition is X. Preferably, X comprises at least about 90% by 
weight of the total of X+Y in the composition, more preferably at least about 95% or even 
99% by weight. 

A "conserved" Neisseria amino acid fragment or protein is one that is present in a 
particular Neisserial protein in at least x% of Neisseria. The value of x may be 50% or more, 
e.g., 66%, 75%, 80%, 90%, 95% or even 100% (i.e. the amino acid is found in the protein in 
question in all Neisseria). In order to determine whether an animo acid is "conserved" in a 
particular Neisserial protein, it is necessary to compare that amino acid residue in the 
sequences of the protein in question from a plurality of different Neisseria (a reference 
population). The reference population may include a number of different Neisseria species 
or may include a single species. The reference population may include a number of different 
serogroups of a particular species or a single serogroup. A preferred reference population 
consists of the 5 most common Neisseria strains. 

The term "heterologous" refers to two biological components that are not found 
together in nature. The components may be host cells, genes, or regulatory regions, such as 
promoters. Although the heterologous components are not found together in nature, they can 
function together, as when a promoter heterologous to a gene is operably linked to the gene. 
Another example is where a Neisserial sequence is heterologous to a mouse host cell. 

"Epitope" means antigenic determinant, and may elicit a cellular and/or humoral 
response. 

Conditions for "high stringency" are 65 degrees C in 0.1 xSSC 0.5% SDS solution. 
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An "origin of replication" is a polynucleotide sequence that initiates and regulates 
replication of polynucleotides, such as an expression vector. The origin of replication behaves 
as an autonomous unit of polynucleotide replication within a cell, capable of replication under 
its own control. An origin of replication may be needed for a vector to replicate in a particular 
host cell. With certain origins of replication, an expression vector can be reproduced at a high 
copy number in the presence of the appropriate proteins within the cell. Examples of origins 
are the autonomously replicating sequences, which are effective in yeast; and the viral 
T-antigen, effective in COS-7 cells. 

A "mutant" sequence is defined as a DNA, RNA or amino acid sequence differing 
from but having homology with the native or disclosed sequence. Depending on the 
particular sequence, the degree of homology (sequence identity) between the native or 
disclosed sequence and the mutant sequence is preferably greater than 50% (e.g., 60%, 70%, 
80%, 90%, 95%, 99% or more) which is calculated as described above. As used herein, an 
"allelic variant" of a nucleic acid molecule, or region, for which nucleic acid sequence is 
provided herein is a nucleic acid molecule, or region, that occurs at essentially the same locus 
in the genome of another or second isolate, and that, due to natural variation caused by, for 
example, mutation or recombination, has a similar but not identical nucleic acid sequence. A 
coding region allelic variant typically encodes a protein having similar activity to that of the 
protein encoded by the gene to which it is being compared. An allelic variant can also 
comprise an alteration in the 5* or 3' untranslated regions of the gene, such as in regulatory 
control regions, (see, for example, U.S. Patent 5,753,235). 

Antibodies 

As used herein, the term "antibody" refers to a polypeptide or group of polypeptides 
composed of at least one antibody combining site. An "antibody combining site" is the 
three-dimensional binding space with an internal surface shape and charge distribution 
complementary to the features of an epitope of an antigen, which allows a binding of the 
antibody with the antigen. "Antibody" includes, for example, vertebrate antibodies, hybrid 
antibodies, chimeric antibodies, humanized antibodies, altered antibodies, univalent 
antibodies, Fab proteins, and single domain antibodies. 

Antibodies against the proteins of the invention are useful for affinity 
chromatography, immunoassays, and distinguishing/identifying Neisseria menB proteins. 
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Antibodies elicited against the proteins of the present invention bind to antigenic polypeptides 
or proteins or protein fragments that are present and specifically associated with strains of 
Neisseria meningitidis menB. In some instances, these antigens may be associated with 
specific strains, such as those antigens specific for the menB strains. The antibodies of the 
invention may be immobilized to a matrix and utilized in an immunoassay or on an affinity 
chromatography column, to enable the detection and/or separation of polypeptides, proteins or 
protein fragments or cells comprising such polypeptides, proteins or protein fragments. 
Alternatively, such polypeptides, proteins or protein fragments may be immobilized so as to 
detect antibodies bindably specific thereto. 

Antibodies to the proteins of the invention, both polyclonal and monoclonal, may be 
prepared by conventional methods. In general, the protein is first used to immunize a suitable 
animal, preferably a mouse, rat, rabbit or goat. Rabbits and goats are preferred for the 
preparation of polyclonal sera due to the volume of serum obtainable, and the availability of 
labeled anti-rabbit and anti-goat antibodies. Immunization is generally performed by mixing 
or emulsifying the protein in saline, preferably in an adjuvant such as Freund's complete 
adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously or 
intramuscularly). A dose of 50-200 |ig/injection is typically sufficient. Immunization is 
generally boosted 2-6 weeks later with one or more injections of the protein in saline, 
preferably using Freund's incomplete adjuvant. One may alternatively generate antibodies by 
in vitro immunization using methods known in the art, which for the purposes of this 
invention is considered equivalent to in vivo immunization. Polyclonal antisera is obtained by 
bleeding the immunized animal into a glass or plastic container, incubating the blood at 25°C 
for one hour, followed by incubating at 4°C for 2-18 hours. The serum is recovered by 
centrifugation (e.g., l,000g for 10 minutes). About 20-50 ml per bleed may be obtained from 
rabbits. 

Monoclonal antibodies are prepared using the standard method of Kohler & Milstein 
{Nature (1975) 256:495-96), or a modification thereof. Typically, a mouse or rat is 
immunized as described above. However, rather than bleeding the animal to extract serum, 
the spleen (and optionally several large lymph nodes) is removed and dissociated into single 
cells. If desired, the spleen cells may be screened (after removal of nonspecifically adherent 
cells) by applying a cell suspension to a plate or well coated with the protein antigen. B -cells 
that express membrane-bound immunoglobulin specific for the antigen bind to the plate, and 
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are not rinsed away with the rest of the suspension. Resulting B-cells, or all dissociated spleen 
cells, are then induced to fuse with myeloma cells to form hybridomas, and are cultured in a 
selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, "HAT"). The 
resulting hybridomas are plated by limiting dilution, and are assayed for the production of 
antibodies which bind specifically to the immunizing antigen (and which do not bind to 
unrelated antigens). The selected MAb-secreting hybridomas are then cultured either in vitro 
(e.g., in tissue culture bottles or hollow fiber reactors), or in vivo (as ascites in mice). 

If desired, the antibodies (whether polyclonal or monoclonal) may be labeled using 
conventional techniques. Suitable labels include fluorophores, chromophores, radioactive 
atoms (particularly 32 P and 125 I), electron-dense reagents, enzymes, and ligands having 
specific binding partners. Enzymes are typically detected by their activity. For example, 
horseradish peroxidase is usually detected by its ability to convert 
3,3\5, S'-tetramethylbenzidine (TMB) to a blue pigment, quantifiable with a 
spectrophotometer. "Specific binding partner" refers to a protein capable of binding a ligand 
molecule with high specificity, as for example in the case of an antigen and a monoclonal 
antibody specific therefor. Other specific binding partners include biotin and avidin or 
streptavidin, IgG and protein A, and the numerous receptor-ligand couples known in the art. It 
should be understood that the above description is not meant to categorize the various labels 
into distinct classes, as the same label may serve in several different modes. For example, 125 I 
may serve as a radioactive label or as an electron-dense reagent. HRP may serve as enzyme or 
as antigen for a MAb. Further, one may combine various labels for desired effect. For 
example, MAbs and avidin also require labels in the practice of this invention: thus, one 
might label a MAb with biotin, and detect its presence with avidin labeled with l25 I, or with 
an anti-biotin MAb labeled with HRP. Other permutations and possibilities will be readily 
apparent to those of ordinary skill in the art, and are considered as equivalents within the 
scope of the instant invention. 

Antigens, immunogens, polypeptides, proteins or protein fragments of the present 
invention elicit formation of specific binding partner antibodies. These antigens, 
immunogens, polypeptides, proteins or protein fragments of the present invention comprise 
immunogenic compositions of the present invention. Such immunogenic compositions may 
further comprise or include adjuvants, carriers, or other compositions that promote or enhance 
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or stabilize the antigens, polypeptides, proteins or protein fragments of the present invention. 
Such adjuvants and carriers will be readily apparent to those of ordinary skill in the art. 

Pharmaceutical Compositions 

Pharmaceutical compositions can comprise (include) either polypeptides, antibodies, 
or nucleic acid of the invention. The pharmaceutical compositions will comprise a 
therapeutically effective amount of either polypeptides, antibodies, or polynucleotides of the 
claimed invention. 

The term "therapeutically effective amount" as used herein refers to an amount of a 
therapeutic agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a 
detectable therapeutic or preventative effect. The effect can be detected by, for example, 
chemical markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature, when given to a patient that is febrile. The 
precise effective amount for a subject will depend upon the subject's size and health, the 
nature and extent of the condition, and the therapeutics or combination of therapeutics 
selected for administration. Thus, it is not useful to specify an exact effective amount in 
advance. However, the effective amount for a given situation can be determined by routine 
experimentation and is within the judgment of the clinician. 

For purposes of the present invention, an effective dose will be from about 0.01 mg/ 
kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the DNA constructs in the individual to 
which it is administered. 

A pharmaceutical composition can also contain a pharmaceutical^ acceptable carrier. 
The term "pharmaceutically acceptable carrier" refers to a carrier for administration of a 
therapeutic agent, such as antibodies or a polypeptide, genes, and other therapeutic agents. 
The term refers to any pharmaceutical carrier that does not itself induce the production of 
antibodies harmful to the individual receiving the composition, and which may be 
administered without undue toxicity. Suitable carriers may be large, slowly metabolized 
macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, 
polymeric amino acids, amino acid copolymers, and inactive virus particles. Such carriers are 
well known to those of ordinary skill in the art. 

Pharmaceutically acceptable salts can be used therein, for example, mineral acid salts 
such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of 
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organic acids such as acetates, propionates, malonates, benzoates, and the like. A thorough 
discussion of pharmaceutical^ acceptable excipients is available in Remington's 
Pharmaceutical Sciences (Mack Pub. Co., N.J. 1991). 

Pharmaceutical^ acceptable carriers in therapeutic compositions may contain liquids 
such as water, saline, glycerol and ethanol. Additionally, auxiliary substances, such as wetting 
or emulsifying agents, pH buffering substances, and the like, may be present in such vehicles. 
Typically, the therapeutic compositions are prepared as injectables, either as liquid solutions 
or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to 
injection may also be prepared. Liposomes are included within the definition of a 
pharmaceutically acceptable carrier. 

Delivery Methods 

Once formulated, the compositions of the invention can be administered directly to the 
subject. The subjects to be treated can be animals; in particular, human subjects can be 
treated. 

Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneally, intravenously or intramuscularly or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a lesion. Other 
modes of administration include oral and pulmonary administration, suppositories, and 
transdermal and transcutaneous applications, needles, and gene guns or hyposprays. Dosage 
treatment may be a single dose schedule or a multiple dose schedule. 

Vaccines 

Vaccines according to the invention may either be prophylactic (i.e., to prevent 
infection) or therapeutic (i.e., to treat disease after infection). 

Such vaccines comprise immunizing antigen(s) or immunogen(s), immunogenic 
polypeptide, protein(s) or protein fragments, or nucleic acids (e.g., ribonucleic acid or 
deoxyribonucleic acid), usually in combination with "pharmaceutically acceptable carriers," 
which include any carrier that does not itself induce the production of antibodies harmful to 
the individual receiving the composition. Suitable carriers are typically large, slowly 
metabolized macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic 
acids, polymeric amino acids, amino acid copolymers, lipid aggregates (such as oil droplets or 



WO 99/57280 



35 



PCT/US99/09346 



liposomes), and inactive virus particles. Such carriers are well known to those of ordinary 
skill in the art. Additionally, these carriers may function as immunostimulating agents 
("adjuvants"). Furthermore, the immunogen or antigen may be conjugated to a bacterial 
toxoid, such as a toxoid from diphtheria, tetanus, cholera, K pylori, etc. pathogens. 

Preferred adjuvants to enhance effectiveness of the composition include, but are not 
limited to: (1) aluminum salts (alum), such as aluminum hydroxide, aluminum phosphate, 
aluminum sulfate, etc; (2) oil-in-water emulsion formulations (with or without other specific 
immunostimulating agents such as muramyl peptides (see below) or bacterial cell wall 
components), such as for example (a) MF59 (PCT Publ. No. WO 90/14837), containing 5% 
Squalene, 0.5% Tween 80, and 0.5% Span 85 (optionally containing various amounts of 
MTP-PE (see below), although not required) formulated into submicron particles using a 
rnicrofluidizer such as Model HOY micro fluidizer (Microfluidics, Newton, MA), (b) SAF, 
containing 10% Squalane, 0.4% Tween 80, 5% pluronic-blocked polymer L121, and thr-MDP 
(see below) either microfluidized into a submicron emulsion or vortexed to generate a larger 
particle size emulsion, and (c) Ribi™ adjuvant system (RAS), (Ribi Immunochem, Hamilton, 
MT) containing 2% Squalene, 0.2% Tween 80, and one or more bacterial cell wall 
components from the group consisting of monophosphorylipid A (MPL), trehalose 
dimycolate (TDM), and cell wall skeleton (CWS), preferably MPL + CWS (Detox™); 

(3) saponin adjuvants, such as Stimulon™ (Cambridge Bioscience, Worcester, MA) may be 
used or particles generated therefrom such as ISCOMs (immunostimulating complexes); 

(4) Complete Freund's Adjuvant (CFA) and Incomplete Freund's Adjuvant (IF A); 

(5) cytokines, such as interleukins (e.g., IL-1, IL-2, IL-4, IL-5, IL-6, IL-7, IL-12, etc.), 
interferons (e.g., gamma interferon), macrophage colony stimulating factor (M-CSF), tumor 
necrosis factor (TNF), etc; (6) detoxified mutants of a bacterial ADP-ribosylating toxin such 
as a cholera toxin (CT), a pertussis toxin (PT), or an E. coli heat-labile toxin (LT), particularly 
LT-K63, LT-R72, CT-S109, PT-K9/G129; see, e.g., WO 93/13302 and WO 92/19265; and 
(7) other substances that act as immunostimulating agents to enhance the effectiveness of the 
composition. Alum and MF59 are preferred. 

As mentioned above, muramyl peptides include, but are not limited to, N-aeetyl- 
muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-normuramyl-L-alanyl-D- 
isoglutamine (nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2~(r-2 , - 
dipalmitoyl^n-glycero-3-huydroxyphosphoryloxy)-ethylamine (MTP-PE), eta 
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The vaccine compositions comprising immunogenic compositions (e.g., which may 
include the antigen, pharmaceutical^ acceptable carrier, and adjuvant) typically will contain 
diluents, such as water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances, such 
as wetting or emulsifying agents, pH buffering substances, and the like, may be present in 
such vehicles. Alternatively, vaccine compositions comprising immunogenic compositions 
may comprise an antigen, polypeptide, protein, protein fragment or nucleic acid in a 
pharmaceutical^ acceptable carrier. 

More specifically, vaccines comprising immunogenic compositions comprise an 
immunologically effective amount of the immunogenic polypeptides, as well as any other of 
the above-mentioned components, as needed. By "immunologically effective amount", it is 
meant that the administration of that amount to an individual, either in a single dose or as part 
of a series, is effective for treatment or prevention. This amount varies depending upon the 
health and physical condition of the individual to be treated, the taxonomic group of 
individual to be treated (e.g., nonhuman primate, primate, etc.), the capacity of the 
individual's immune system to synthesize antibodies, the degree of protection desired, the 
formulation of the vaccine, the treating doctor's assessment of the medical situation, and other 
relevant factors. It is expected that the amount will fall in a relatively broad range that can be 
determined through routine trials. 

Typically, the vaccine compositions or immunogenic compositions are prepared as 
injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or 
suspension in, liquid vehicles prior to injection may also be prepared. The preparation also 
may be emulsified or encapsulated in liposomes for enhanced adjuvant effect, as discussed 
above under pharmaceutically acceptable carriers. 

The immunogenic compositions are conventionally administered parenterally, e.g., by 
injection, either subcutaneously or intramuscularly. Additional formulations suitable for other 
modes of administration include oral and pulmonary formulations, suppositories, and 
transdermal and transcutaneous applications. Dosage treatment may be a single dose schedule 
or a multiple dose schedule. The vaccine may be administered in conjunction with other 
immunoregulatory agents. 

As an alternative to protein-based vaccines, DNA vaccination may be employed (e.g., 
Robinson & Torres (1997) Seminars in Immunology 9:271-283; Donnelly et al (1997) Annu 
Rev Immunol 15:617-648). 
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Gene Delivery Vehicles 

Gene therapy vehicles for delivery of constructs, including a coding sequence of a 
therapeutic of the invention, to be delivered to the mammal for expression in the mammal, can 
be administered either locally or systemically. These constructs can utilize viral or non-viral 
vector approaches in in vivo or ex vivo modality. Expression of such coding sequence can be 
induced using endogenous mammalian or heterologous promoters. Expression of the coding 
sequence in vivo can be either constitutive or regulated. 

The invention includes gene delivery vehicles capable of expressing the contemplated 
nucleic acid sequences. The gene delivery vehicle is preferably a viral vector and, more 
preferably, a retroviral, adenoviral, adeno-associated viral (AAV), herpes viral, or alphavirus 
vector. The viral vector can also be an astrovirus, coronavirus, orthomyxovirus, papovavirus, 
paramyxovirus, parvovirus, picornavirus, poxvirus, or togavirus viral vector. See generally, 
Jolly (1994) Cancer Gene Therapy 1:51 -64; Kimura (1994) Human Gene Therapy 5:845-852; 
Connelly (1995) Human Gene Therapy 6:185-193; and Kaplitt (1994) Nature Genetics 
6:148-153. 

Retroviral vectors are well known in the art, including B, C and D type retroviruses, 
xenotropic retroviruses (for example, NZB-X1, NZB-X2 and NZB9-1 (see O'Neill (1985) 1 
Virol 53:160) polytropic retroviruses e.g., MCF and MCF-MLV (see Kelly (1983) J. Virol 
45:291), spumaviruses and lentiviruses. See RNA Tumor Viruses, Second Edition, Cold 
Spring Harbor Laboratory, 1985. 

Portions of the retroviral gene therapy vector may be derived from different 
retroviruses. For example, retrovector LTRs may be derived from a Murine Sarcoma Virus, a 
tRNA binding site from a Rous Sarcoma Virus, a packaging signal from a Murine Leukemia 
Virus, and an origin of second strand synthesis from an Avian Leukosis Virus. 

These recombinant retroviral vectors may be used to generate transduction competent 
retroviral vector particles by introducing them into appropriate packaging cell lines (see US 
patent 5,591,624). Retrovirus vectors can be constructed for site-specific integration into host 
cell DNA by incorporation of a chimeric integrase enzyme into the retroviral particle (see 
W096/37626). It is preferable that the recombinant viral vector is a replication defective 
recombinant virus. 
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Packaging cell lines suitable for use with the above-described retrovirus vectors are 
well known in the art, are readily prepared (see WO95/30763 and WO92/05266), and can be 
used to create producer cell lines (also termed vector cell lines or "VCLs") for the production 
of recombinant vector particles. Preferably, the packaging cell lines are made from human 
parent cells (e.g., HT1080 cells) or mink parent cell lines, which eliminates inactivation in 
human serum. 

Preferred retroviruses for the construction of retroviral gene therapy vectors include 
Avian Leukosis Virus, Bovine Leukemia, Virus, Murine Leukemia Virus, Mink-Cell 
Focus-Inducing Virus, Murine Sarcoma Virus, Reticuloendotheliosis Virus and Rous 
Sarcoma Virus. Particularly preferred Murine Leukemia Viruses include 4070A and 1504A 
(Hartley and Rowe (1976) J Virol 19:19-25), Abelson (ATCC No. VR-999), Friend (ATCC 
No. VR-245), Graffi, Gross (ATCC Nol VR-590), Kirsten, Harvey Sarcoma Virus and 
Rauscher (ATCC No. VR-998) and Moloney Murine Leukemia Virus (ATCC No. VR-190). 
Such retroviruses may be obtained from depositories or collections such as the American 
Type Culture Collection ("ATCC") in Rockville, Maryland or isolated from known sources 
using commonly available techniques. 

Exemplary known retroviral gene therapy vectors employable in this invention include 
those described in patent applications GB2200651, EP04 15731, EP0345242, EP0334301, 
WO89/02468; WO89/05349, WO89/09271, WO90/02806, WO90/07936, WO94/03622, 
W093/25698, W093/25234, WO93/11230, WO93/10218, WO91/02805, WO91/02825, 
WO95/07994, US 5,219,740, US 4,405,712, US 4,861,719, US 4,980,289, US 4,777,127, US 
5,591,624. See also Vile (1993) Cancer Res 53:3860-3864; Vile (1993) Cancer Res 
53:962-967; Ram (1993) Cancer Res 53 (1993) 83-88; Takamiya (1992) J Neurosci Res 
33:493-503; Baba (1993) JNeurosurg 79:729-735; Mann (1983) Cell 33:153; Cane (1984) 
Proc Natl Acad Sci 81:6349; and Miller (1990) Human Gene Therapy 1. 

Human adenoviral gene therapy vectors are also known in the art and employable in 
this invention. See, for example, Berkner (1988) Biotechniques 6:616 and Rosenfeld (1991) 
Science 252:431, and WO93/07283, WO93/06223, and WO93/07282. Exemplary known 
adenoviral gene therapy vectors employable in this invention include those described in the 
above referenced documents and in W094/ 12649, WO93/03769, W093/19191, 
W094/28938, W095/11984, WO95/00655, WO95/27071, W095/29993, W095/34671, 
WO96/05320, WO94/08026, WO94/11506, WO93/06223, W094/24299, WO95/14102, 
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W095/24297, WO95/02697, W094/28152, W094/24299, WO95/09241, WO95/25807, 
WO95/05835, W094/18922 and WO95/09654. Alternatively, administration of DNA linked 
to killed adenovirus as described in Curiel (1992) Hum. Gene Ther. 3:147-154 may be 
employed. The gene delivery vehicles of the invention also include adenovirus associated 
virus (AAV) vectors. Leading and preferred examples of such vectors for use in this invention 
are the AAV-2 based vectors disclosed in Srivastava, WO93/09239. Most preferred AAV 
vectors comprise the two AAV inverted terminal repeats in which the native D-sequences are 
modified by substitution of nucleotides, such that at least 5 native nucleotides and up to 18 
native nucleotides, preferably at least 10 native nucleotides up to 18 native nucleotides, most 
preferably 10 native nucleotides are retained and the remaining nucleotides of the D-sequence 
are deleted or replaced with non-native nucleotides. The native D-sequences of the AAV 
inverted terminal repeats are sequences of 20 consecutive nucleotides in each AAV inverted 
terminal repeat (i.e., there is one sequence at each end) which are not involved in HP 
formation. The non-native replacement nucleotide may be any nucleotide other than the 
nucleotide found in the native D-sequence in the same position. Other employable exemplary 
AAV vectors are pWP-19, pWN-1, both of which are disclosed in Nahreini (1993) Gene 
124:257-262. Another example of such an AAV vector is psub201 (see Samulski (1987) J. 
Virol 61:3096). Another exemplary AAV vector is the Double-D ITR vector. Construction of 
the Double-D ITR vector is disclosed in US Patent 5,478,745. Still other vectors are those 
disclosed in Carter US Patent 4,797,368 and Muzyczka US Patent 5,139,941, Chartejee US 
Patent 5,474,935, and Kotin W094/288157. Yet a further example of an AAV vector 
employable in this invention is SSV9AFABTKneo, which contains the AFP enhancer and 
albumin promoter and directs expression predominantly in the liver. Its structure and 
construction are disclosed in Su (1996) Human Gene Therapy 7:463-470. Additional AAV 
gene therapy vectors are described in US 5,354,678, US 5,173,414, US 5,139,941, and US 
5,252,479. 

The gene therapy vectors comprising sequences of the invention also include herpes 
vectors. Leading and preferred examples are herpes simplex virus vectors containing a 
sequence encoding a thymidine kinase polypeptide such as those disclosed in US 5,288,641 
and EP0176170 (Roizman). Additional exemplary herpes simplex virus vectors include 
HFEM/ICP6-LacZ disclosed in WO95/04139 (Wistar Institute), pHSVlac described in Geller 
(1988) Science 241:1667-1669 and in WO90/09441 and WO92/07945, HSV Us3::pgC-lacZ 
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described in Fink (1992) Human Gene Therapy 3:1 1-19 and HSV 7134, 2 RH 105 and GAL4 
described in EP 0453242 (Breakefield), and those deposited with the ATCC as accession 
numbers ATCC VR-977 and ATCC VR-260. 

Also contemplated are alpha virus gene therapy vectors that can be employed in this 
invention. Preferred alpha virus vectors are Sindbis viruses vectors. Togaviruses, Semliki 
Forest virus (ATCC VR-67; ATCC VR-1247), Middleberg virus (ATCC VR-370), Ross 
River virus (ATCC VR-373; ATCC VR-1246), Venezuelan equine encephalitis virus (ATCC 
VR923; ATCC VR-1250; ATCC VR-1249; ATCC VR-532), and those described in US 
patents 5,091,309, 5,217,879, and WO92/10578. More particularly, those alpha virus vectors 
described in U.S. Serial No. 08/405,627, filed March 15, 1995,W094/21792, WO92/10578, 
WO95/07994, US 5,091,309 and US 5,217,879 are employable. Such alpha viruses may be 
obtained from depositories or collections such as the ATCC in Rockville, Maryland or 
isolated from known sources using commonly available techniques. Preferably, alphavirus 
vectors with reduced cytotoxicity are used (see USSN 08/679640). 

DNA vector systems such as eukarytic layered expression systems are also useful for 
expressing the nucleic acids of the invention. SeeWO95/07994 for a detailed description of 
eukaryotic layered expression systems. Preferably, the eukaryotic layered expression systems 
of the invention are derived from alphavirus vectors and most preferably from Sindbis viral 
vectors. 

Other viral vectors suitable for use in the present invention include those derived from 
poliovirus, for example ATCC VR-58 and those described in Evans, Nature 339 (1989) 385 
and Sabin (1973) J. Biol. Standardization 1:115; rhino virus, for example ATCC VR-1 1 10 and 
those described in Arnold (1990) / Cell Biochem L401; pox viruses such as canary pox virus 
or vaccinia virus, for example ATCC VR-1 1 1 and ATCC VR-2010 and those described in 
Fisher-Hoch (1989) Proc Natl Acad Sci 86:317; Flexner (1989) Ann NY Acad Sci 569:86, 
Flexner (1990) Vaccine 8:17; in US 4,603,1 12 and US 4,769,330 and WO89/01973; SV40 
virus, for example ATCC VR-305 and those described in Mulligan (1979) Nature 277:108 
and Madzak (1992) J Gen Virol 73:1533; influenza virus, for example ATCC VR-797 and 
recombinant influenza viruses made employing reverse genetics techniques as described in 
US 5,166,057 and in Enami (1990) Proc Natl Acad Sci 87:3802-3805; Enami & Palese (1991) 
J Virol 65:271 1-2713 and Luytjes (1989) Cell 59:1 10, (see also McMichael (1983) NEJMed 
309:13, and Yap (1978) Nature 273:238 and Nature (1979) 277:108); human 
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immunodeficiency virus as described in EP-0386882 and in Buchschacher (1992) J. Virol. 
66:2731; measles virus, for example ATCC VR-67 and VR-1247 and those described in EP- 
0440219; Aura virus, for example ATCC VR-368; Bebaru virus, for example ATCC VR-600 
and ATCC VR-1240; Cabassou virus, for example ATCC VR-922; Chikungunya virus, for 
example ATCC VR-64 and ATCC VR-1241; Fort Morgan Virus, for example ATCC 
VR-924; Getah virus, for example ATCC VR-369 and ATCC VR-1243; Kyzylagach virus, 
for example ATCC VR-927; Mayaro virus, for example ATCC VR-66; Mucambo virus, for 
example ATCC VR-580 and ATCC VR-1244; Ndumu virus, for example ATCC VR-371; 
Pixuna virus, for example ATCC VR-372 and ATCC VR-1245; Tonate virus, for example 
ATCC VR-925; Triniti virus, for example ATCC VR-469; Una virus, for example ATCC 
VR-374; Whataroa virus, for example ATCC VR-926; Y-62-33 virus, for example ATCC 
VR-375; O'Nyong virus, Eastern encephalitis virus, for example ATCC VR-65 and ATCC 
VR-1242; Western encephalitis virus, for example ATCC VR-70, ATCC VR-1251, ATCC 
VR-622 and ATCC VR-1252; and coronavirus, for example ATCC VR-740 and those 
described in Hamre ( 1 966) Proc Soc Exp Biol Med 1 2 1 : 1 90. 

Delivery of the compositions of this invention into cells is not limited to the above 
mentioned viral vectors. Other delivery methods and media may be employed such as, for 
example, nucleic acid expression vectors, polycationic condensed DNA linked or unlinked to 
killed adenovirus alone, for example see US Serial No. 08/366,787, filed December 30, 1994 
and Curiel (1992) Hum Gene Ther 3:147-154 ligand linked DNA, for example see Wu (1989) 
J Biol Chem 264:16985-16987, eucaryotic cell delivery vehicles cells, for example see US 
Serial No.08/240,030, filed May 9, 1994, and US Serial No. 08/404,796, deposition of 
photopolymerized hydrogel materials, hand-held gene transfer particle gun, as described in 
US Patent 5,149,655, ionizing radiation as described in US5,206,152 and in W092/1 1033, 
nucleic charge neutralization or fusion with cell membranes. Additional approaches are 
described in Philip (1994) Mol Cell Biol 14:241 1-2418 and in Woffendin (1994) Proc Natl 
AcadSci 91:1581-1585. 

Particle mediated gene transfer may be employed, for example see US Serial No. 
60/023,867. Briefly, the sequence can be inserted into conventional vectors that contain 
conventional control sequences for high level expression, and then incubated with synthetic 
gene transfer molecules such as polymeric DNA-binding cations like polylysine, protamine, 
and albumin, linked to cell targeting ligands such as asialoorosomucoid, as described in Wu & 
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Wu (1987) J. Biol Chem. 262:4429-4432, insulin as described in Hucked (1990) Biochem 
Pharmacol 40:253-263, galactose as described in Plank (1992) Bioconjugate Chem 
3:533-539, lactose or transferrin. 

Naked DNA may also be employed to transform a host cell. Exemplary naked DNA 
introduction methods are described in WO 90/1 1092 and US 5,580,859. Uptake efficiency 
may be improved using biodegradable latex beads. DNA coated latex beads are efficiently 
transported into cells after endocytosis initiation by the beads. The method may be improved 
further by treatment of the beads to increase hydrophobicity and thereby facilitate disruption 
of the endosome and release of the DNA into the cytoplasm. 

Liposomes that can act as gene delivery vehicles are described in U.S. 5,422,120, 
W095/13796, W094/23697, W091/14445 and EP-524,968. As described in USSN. 
60/023,867, on non-viral delivery, the nucleic acid sequences encoding a polypeptide can be 
inserted into conventional vectors that contain conventional control sequences for high level 
expression, and then be incubated with synthetic gene transfer molecules such as polymeric 
DNA-binding cations like polylysine, protamine, and albumin, linked to cell targeting ligands 
such as asialoorosomucoid, insulin, galactose, lactose, or transferrin. Other delivery systems 
include the use of liposomes to encapsulate DNA comprising the gene under the control of a 
variety of tissue-specific or ubiquitously-active promoters. Further non-viral delivery suitable 
for use includes mechanical delivery systems such as the approach described in Woffendin et 
al (1994) Proa Natl. Acad. ScL USA 91(24):1 1581 -1 1585. Moreover, the coding sequence 
and the product of expression of such can be delivered through deposition of 
photopolymerized hydrogel materials. Other conventional methods for gene delivery that can 
be used for delivery of the coding sequence include, for example, use of hand-held gene 
transfer particle gun, as described in U.S. 5,149,655; use of ionizing radiation for activating 
transferred gene, as described in U.S. 5,206,152 and W092/1 1033. 

Exemplary liposome and polycationic gene delivery vehicles are those described in 
US 5,422,120 and 4,762,915; inWO 95/13796; W094/23697; and W091/14445; in EP- 
0524968; and in Stryer, Biochemistry, pages 236-240 (1975) W.H. Freeman, San Francisco; 
Szoka (1980) Biochem Biophys Acta 600:1; Bayer (1979) Biochem Biophys Acta 550:464; 
Rivnay (1987) Meth Enzymol 149:119; Wang (1987) Proc Natl Acad Sci 84:7851; Plant 
(1989) Anal Biochem 176:420. 
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A polynucleotide composition can comprises therapeutically effective amount of a 
gene therapy vehicle, as the term is defined above. For purposes of the present invention, an 
effective dose will be from about 0.01 mg/ kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg 
of the DNA constructs in the individual to which it is administered. 

Delivery Methods 

Once formulated, the polynucleotide compositions of the invention can be 
administered (1) directly to the subject; (2) delivered ex vivo, to cells derived from the 
subject; or (3) in vitro for expression of recombinant proteins. The subjects to be treated can 
be mammals or birds. Also, human subjects can be treated. 

Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneally, intravenously or intramuscularly or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a tumor or lesion. 
Other modes of administration include oral and pulmonary administration, suppositories, and 
transdermal applications, needles, and gene guns or hyposprays. Dosage treatment may be a 
single dose schedule or a multiple dose schedule. 

Methods for the ex vivo delivery and reimplantation of transformed cells into a subject 
are known in the art and described in eg. W093/14778. Examples of cells useful in ex vivo 
applications include, for example, stem cells, particularly hematopoetic, lymph cells, 
macrophages, dendritic cells, or tumor cells. 

Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 
accomplished by the following procedures, for example, dextran-mediated transfection, 
calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
electroporation, encapsulation of the polynucleotide(s) in liposomes, and direct microinjection 
of the DNA into nuclei, all well known in the art. 

Polynucleotide and polypeptide pharmaceutical compositions 

In addition to the pharmaceutically acceptable carriers and salts described above, the 
following additional agents can be used with polynucleotide and/or polypeptide compositions. 
A.Polypeptides 

One example are polypeptides which include, without limitation: asioloorosomucoid 
(ASOR); transferrin; asialoglycoproteins; antibodies; antibody fragments; ferritin; 
interleukins; interferons, granulocyte, macrophage colony stimulating factor (GM-CSF), 
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granulocyte colony stimulating factor (G-CSF), macrophage colony stimulating factor 
(M-CSF), stem cell factor and erythropoietin. Viral antigens, such as envelope proteins, can 
also be used. Also, proteins from other invasive organisms, such as the 17 amino acid peptide 
from the circumsporozoite protein of Plasmodium falciparum known as RII. 

B. Hormones, Vitamins, Etc. 

Other groups that can be included are, for example: hormones, steroids, androgens, 
estrogens, thyroid hormone, or vitamins, folic acid. 

C. Polyalkylenes, Polysaccharides, etc. 

Also, polyalkylene glycol can be included with the desired polynucleotides or 
polypeptides. In a preferred embodiment, the polyalkylene glycol is polyethlylene glycol. In 
addition, mono-, di-, or polysaccarides can be included. In a preferred embodiment of this 
aspect, the polysaccharide is dextran or DEAE-dextran. Also, chitosan and 
poly(lactide-co-glycolide) 

D. Lipids, and Liposomes 

The desired polynucleotide or polypeptide can also be encapsulated in lipids or 
packaged in liposomes prior to delivery to the subject or to cells derived therefrom. 

Lipid encapsulation is generally accomplished using liposomes which are able to 
stably bind or entrap and retain nucleic acid. The ratio of condensed polynucleotide or 
polypeptide to lipid preparation can vary but will generally be around 1:1 (mg 
DNA:micromoles lipid), or more of lipid. For a review of the use of liposomes as carriers for 
delivery of nucleic acids, see, Hug and Sleight (1991) Biochim. Biophys. Acta. 1097:1-17; 
Straubinger(1983)Metf2. Enzymol. 101:512-527. 

Liposomal preparations for use in the present invention include cationic (positively 
charged), anionic (negatively charged) and neutral preparations. Cationic liposomes have 
been shown to mediate intracellular delivery of plasmid DNA (Feigner (1987) Proc. Natl. 
Acad. Sci. USA 84:7413-7416); mRNA(Malone (1989) Proc. Natl. Acad. Sci. USA 
86:6077-6081); and purified transcription factors (Debs (1990) J. Biol. Chem. 
265:10189-10192), in functional form. 

Cationic liposomes are readily available. For example, 
N[l-2,3-dioleyloxy)propyl]-N,N,N-triethylammonium (DOTMA) liposomes are available 
under the trademark Lipofectin, from GIBCO BRL, Grand Island, NY. (See, also, Feigner 
supra). Other commercially available liposomes include transfectace (DDAB/DOPE) and 
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DOTAP/DOPE (Boerhinger). Other cationic liposomes can be prepared from readily available 
materials using techniques well known in the art. See, eg. Szoka (1978) Proc. Natl Acad. Sci. 
USA 75:4194-4198; WO90/1 1092 for a description of the synthesis of DOTAP 
(1 ,2-bis(oleoyloxy)-3-(trimethylammonio)propane) liposomes. 

Similarly, anionic and neutral liposomes are readily available, such as from Avanti 
Polar Lipids (Birmingham, AL), or can be easily prepared using readily available materials. 
Such materials include phosphatidyl choline, cholesterol, phosphatidyl ethanolamine, 
dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol (DOPG), 
dioleoylphoshatidyl ethanolamine (DOPE), among others. These materials can also be mixed 
with the DOTMA and DOTAP starting materials in appropriate ratios. Methods for making 
liposomes using these materials are well known in the art. 

The liposomes can comprise multilammelar vesicles (MLVs), small unilamellar 
vesicles (SUVs), or large unilamellar vesicles (LUVs). The various liposome-nucleic acid 
complexes are prepared using methods known in the art. See eg. Straubinger (1983) Meth. 
Immunol 101:512-527; Szoka (1978) Proc. Natl Acad. Set USA 75:4194-4198; 
Papahadjopoulos (1975) Biochim. Biophys. Acta 394:483; Wilson (1979) Cell 17:77); Deamer 
& Bangham (1976) Biochim. Biophys. Acta 443:629; Ostro (1977) Biochem. Biophys. Res. 
Commun. 76:836; Fraley (1979) Proc. Natl Acad. Sci. USA 76:3348); Enoch & Strittmatter 
(1979) Proc. Natl. Acad. Sci. USA 76:145; Fraley (1980) J. Biol. Chem. (1980) 255:10431; 
Szoka & Papahadjopoulos (1978) Proc. Natl Acad. Sci. USA 75:145; and Schaefer-Ridder 
(1982) Science 215:166. 
E.Lipoproteins 

In addition, lipoproteins can be included with the polynucleotide or polypeptide to be 
delivered. Examples of lipoproteins to be utilized include: chylomicrons, HDL, IDL, LDL, 
and VLDL. Mutants, fragments, or fusions of these proteins can also be used. Also, 
modifications of naturally occurring lipoproteins can be used, such as acetylated LDL. These 
lipoproteins can target the delivery of polynucleotides to cells expressing lipoprotein 
receptors. Preferably, if lipoproteins are including with the polynucleotide to be delivered, no 
other targeting ligand is included in the composition. 

Naturally occurring lipoproteins comprise a lipid and a protein portion. The protein 
portion are known as apoproteins. At the present, apoproteins A, B, C, D, and E have been 



WO 99/57280 



46 



PCT/US99/09346 



isolated and identified. At least two of these contain several proteins, designated by Roman 
numerals, AI, All, AIV; CI, CII, CIIL 

A lipoprotein can comprise more than one apoprotein. For example, naturally 
occurring chylomicrons comprises of A, B, C, and E, over time these lipoproteins lose A and 
acquire C and E apoproteins. VLDL comprises A, B, C, and E apoproteins, LDL comprises 
apoprotein B; and HDL comprises apoproteins A, C, and E. 

The amino acid of these apoproteins are known and are described in, for example, 
Breslow (1985) Annu Rev. Biochem 54:699; Law (1986) Adv. Exp Med. Biol. 151:162; Chen 
(\9S6)JBiolChem 261:12918; Kane (1980) Proc Natl Acad Sci USA 77:2465; and Utermann 
(1984) Hum Genet 65:232. 

Lipoproteins contain a variety of lipids including, triglycerides, cholesterol (free and 
esters), and phopholipids. The composition of the lipids varies in naturally occurring 
lipoproteins. For example, chylomicrons comprise mainly triglycerides. A more detailed 
description of the lipid content of naturally occurring lipoproteins can be found, for example, 
in Meth. Enzymoi 128 (1986). The composition of the lipids are chosen to aid in 
conformation of the apoprotein for receptor binding activity. The composition of lipids can 
also be chosen to facilitate hydrophobic interaction and association with the polynucleotide 
binding molecule. 

Naturally occurring lipoproteins can be isolated from serum by ultracentrifugation, for 
instance. Such methods are described in Meth. Enzymoi. (supra); Pitas (1980) Biochem. 
255:5454-5460 and Mahey (1979) J Clin. Invest 64:743-750. 

Lipoproteins can also be produced by in vitro or recombinant methods by expression 
of the apoprotein genes in a desired host cell. See, for example, Atkinson (1986) Annu Rev 
Biophys Chem 15:403 and Radding (1958) Biochim Biophys Acta 30: 443. 

Lipoproteins can also be purchased from commercial suppliers, such as Biomedical 
Technologies, Inc., Stoughton, Massachusetts, USA. 

Further description of lipoproteins can be found in Zuckermann et al, PCT. Appln. 
NO.US97/14465. 
F.Polycationic Agents 

Polycationic agents can be included, with or without lipoprotein, in a composition 
with the desired polynucleotide or polypeptide to be delivered. 
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Polycationic agents, typically, exhibit a net positive charge at physiological relevant 
pH and are capable of neutralizing the electrical charge of nucleic acids to facilitate delivery 
to a desired location. These agents have both in vitro, ex vivo, and in vivo applications. 
Polycationic agents can be used to deliver nucleic acids to a living subject either 
intramuscularly, subcutaneously, etc. 

The following are examples of useful polypeptides as polycationic agents: poly lysine, 
polyarginine, polyornithine, and protamine. Other examples include histones, protamines, 
human serum albumin, DNA binding proteins, non-histone chromosomal proteins, coat 
proteins from DNA viruses, such as (XI 74, transcriptional factors also contain domains that 
bind DNA and therefore may be useful as nucleic aid condensing agents. Briefly, 
transcriptional factors such as C/CEBP, c-jun, c-fos, AP-1, AP-2, AP-3, CPF, Prot-1, Sp-1, 
Oct-1, Oct-2, CREP, and TFIID contain basic domains that bind DNA sequences. 

Organic polycationic agents include: spermine, spermidine, and purtrescine. 

The dimensions and of the physical properties of a polycationic agent can be 
extrapolated from the list above, to construct other polypeptide polycationic agents or to 
produce synthetic polycationic agents. 

Synthetic Polycationic Agents 

Synthetic polycationic agents which are useful include, for example, DEAE-dextran, 
polybrene. LipofectinD, and lipofectAMINED are monomers that form polycationic 
complexes when combined with polynucleotides or polypeptides. 

Immunodiagnostic Assays 

Neisserial antigens of the invention can be used in immunoassays to detect antibody 
levels (or, conversely, anti-Neisserial antibodies can be used to detect antigen levels). 
Immunoassays based on well defined, recombinant antigens can be developed to replace 
invasive diagnostics methods. Antibodies to Neisserial proteins within biological samples, 
including for example, blood or serum samples, can be detected. Design of the immunoassays 
is subject to a great deal of variation, and a variety of these are known in the art. Protocols for 
the immunoassay may be based, for example, upon competition, or direct reaction, or 
sandwich type assays. Protocols may also, for example, use solid supports, or may be by 
immunoprecipitation. Most assays involve the use of labeled antibody or polypeptide; the 
labels may be, for example, fluorescent, chemiluminescent, radioactive, or dye molecules. 
Assays which amplify the signals from the probe are also known; examples of which are 
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assays which utilize biotin and avidin, and enzyme-labeled and mediated immunoassays, such 
as ELISA assays. 

Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are 
constructed by packaging the appropriate materials, including the compositions of the 
invention, in suitable containers, along with the remaining reagents and materials (for 
example, suitable buffers, salt solutions, etc.) required for the conduct of the assay, as well as 
suitable set of assay instructions. 

Nucleic Acid Hybridisation 

"Hybridization" refers to the association of two nucleic acid sequences to one another 
by hydrogen bonding. Typically, one sequence will be fixed to a solid support and the other 
will be free in solution. Then, the two sequences will be placed in contact with one another 
under conditions that favor hydrogen bonding. Factors that affect this bonding include: the 
type and volume of solvent; reaction temperature; time of hybridization; agitation; agents to 
block the non-specific attachment of the liquid phase sequence to the solid support 
(Denhardt's reagent or BLOTTO); concentration of the sequences; use of compounds to 
increase the rate of association of sequences (dextran sulfate or polyethylene glycol); and the 
stringency of the washing conditions following hybridization. See Sambrook et al [supra] 
Volume 2, chapter 9, pages 9.47 to 9.57. 

"Stringency" refers to conditions in a hybridization reaction that favor association of 
very similar sequences over sequences that differ. For example, the combination of 
temperature and salt concentration should be chosen that is approximately 120 to 200 DC 
below the calculated Tm of the hybrid under study. The temperature and salt conditions can 
often be determined empirically in preliminary experiments in which samples of genomic 
DNA immobilized on filters are hybridized to the sequence of interest and then washed under 
conditions of different stringencies. See Sambrook et al at page 9.50. 

Variables to consider when performing, for example, a Southern blot are (1) the 
complexity of the DNA being blotted' and (2) the homology between the probe and the 
sequences being detected. The total amount of the fragment(s) to be studied can vary a 
magnitude of 10, from 0.1 to l^ig for a plasmid or phage digest to 10" 9 to 10" 8 g for a single 
copy gene in a highly complex eukaryotic genome. For lower complexity polynucleotides, 
substantially shorter blotting, hybridization, and exposure times, a smaller amount of starting 
polynucleotides, and lower specific activity of probes can be used. For example, a single-copy 
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yeast gene can be detected with an exposure time of only 1 hour starting with 1 jig of yeast 
DNA, blotting for two hours, and hybridizing for 4-8 hours with a probe of 10 8 cpm/ng. For a 
single-copy mammalian gene a conservative approach would start with 10 ^g of DNA, blot 
overnight, and hybridize overnight in the presence of 10% dextran sulfate using a probe of 
greater than 10 8 cpm/fig, resulting in an exposure time of -24 hours. 

Several factors can affect the melting temperature (Tm) of a DNA-DNA hybrid 
between the probe and the fragment of interest, and consequently, the appropriate conditions 
for hybridization and washing. In many cases the probe is not 100% homologous to the 
fragment. Other commonly encountered variables include the length and total G+C content of 
the hybridizing sequences and the ionic strength and formamide content of the hybridization 
buffer. The effects of all of these factors can be approximated by a single equation: 

Tm= 81 + 16.6(logi 0 Ci) + 0.4[%(G + C)]-0.6(%formamide) - 600/n-1.5(%mismatch). 

where Ci is the salt concentration (monovalent ions) and n is the length of the hybrid 
in base pairs (slightly modified from Meinkoth & Wahl (1984) Anal Biochem. 138: 267-284). 

In designing a hybridization experiment, some factors affecting nucleic acid 
hybridization can be conveniently altered. The temperature of the hybridization and washes 
and the salt concentration during the washes are the simplest to adjust. As the temperature of 
the hybridization increases (ie. stringency), it becomes less likely for hybridization to occur 
between strands that are nonhomologous, and as a result, background decreases. If the 
radiolabeled probe is not completely homologous with the immobilized fragment (as is 
frequently the case in gene family and interspecies hybridization experiments), the 
hybridization temperature must be reduced, and background will increase. The temperature of 
the washes affects the intensity of the hybridizing band and the degree of background in a 
similar manner. The stringency of the washes is also increased with decreasing salt 
concentrations. 

In general, convenient hybridization temperatures in the presence of 50% formamide 
are 42DC for a probe with is 95% to 100% homologous to the target fragment, 37DC for 90% 
to 95% homology, and 32 DC for 85% to 90% homology. For lower homologies, formamide 
content should be lowered and temperature adjusted accordingly, using the equation above. If 
the homology between the probe and the target fragment are not known, the simplest 
approach is to start with both hybridization and wash conditions which are nonstringent. If 
non-specific bands or high background are observed after autoradiography, the filter can be 
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washed at high stringency and reexposed. If the time required for exposure makes this 
approach impractical, several hybridization and/or washing stringencies should be tested in 
parallel. 

Nucleic Acid Probe Assays 

Methods such as PCR, branched DNA probe assays, or blotting techniques utilizing 
nucleic acid probes according to the invention can determine the presence of cDNA or 
mRNA. A probe is said to "hybridize" with a sequence of the invention if it can form a duplex 
or double stranded complex, which is stable enough to be detected. 

The nucleic acid probes will hybridize to the Neisserial nucleotide sequences of the 
invention (including both sense and antisense strands). Though many different nucleotide 
sequences will encode the amino acid sequence, the native Neisserial sequence is preferred 
because it is the actual sequence present in cells. mRNA represents a coding sequence and so 
a probe should be complementary to the coding sequence; single-stranded cDNA is 
complementary to mRNA, and so a cDNA probe should be complementary to the non-coding 
sequence. 

The probe sequence need not be identical to the Neisserial sequence (or its 
complement) — some variation in the sequence and length can lead to increased assay 
sensitivity if the nucleic acid probe can form a duplex with target nucleotides, which can be 
detected. Also, the nucleic acid probe can include additional nucleotides to stabilize the 
formed duplex. Additional Neisserial sequence may also be helpful as a label to detect the 
formed duplex. For example, a non-complementary nucleotide sequence may be attached to 
the 5 f end of the probe, with the remainder of the probe sequence being complementary to a 
Neisserial sequence. Alternatively, non-complementary bases or longer sequences can be 
interspersed into the probe, provided that the probe sequence has sufficient complementarity 
with the a Neisserial sequence in order to hybridize therewith and thereby form a duplex 
which can be detected. 

The exact length and sequence of the probe will depend on the hybridization 
conditions, such as temperature, salt condition and the like. For example, for diagnostic 
applications, depending on the complexity of the analyte sequence, the nucleic acid probe 
typically contains at least 10-20 nucleotides, preferably 15-25, and more preferably at least 30 
nucleotides, although it may be shorter than this. Short primers generally require cooler 
temperatures to form sufficiently stable hybrid complexes with the template. 
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Probes may be produced by synthetic procedures, such as the triester method of 
Matteucci et al [J. Am. Chem. Soc. (1981) 103:3185], or according to Urdea et al [Proc. 
Natl. Acad. Set USA (1983) 80: 7461], or using commercially available automated 
oligonucleotide synthesizers. 

The chemical nature of the probe can be selected according to preference. For certain 
applications, DNA or RNA are appropriate. For other applications, modifications may be 
incorporated eg. backbone modifications, such as phosphorothioates or methylphosphonates, 
can be used to increase in vivo half-life, alter RNA affinity, increase nuclease resistance etc. 
[eg. see Agrawal & Iyer (1995) Curr Opin Biotechnol 6:12-19; Agrawal (1996) TIBTECH 
14:376-387]; analogues such as peptide nucleic acids may also be used [eg. see Corey (1997) 
TIBTECH 15:224-229; Buchardt et al. (1993) TIBTECH 11:384-386]. 

One example of a nucleotide hybridization assay is described by Urdea et al. in 
international patent application WO92/02526 [see also US patent 5,124,246]. 

Alternatively, the polymerase chain reaction (PCR) is another well-known means for 
detecting small amounts of target nucleic acids. The assay is described in: Mullis et al [Meth. 
Enzymol (1987) 155: 335-350]; US patent 4,683,195; and US patent 4,683,202. Two 
"primer" nucleotides hybridize with the target nucleic acids and are used to prime the 
reaction. The primers can comprise sequence that does not hybridize to the sequence of the 
amplification target (or its complement) to aid with duplex stability or, for example, to 
incorporate a convenient restriction site. Typically, such sequence will flank the desired 
Neisserial sequence. 

A thermostable polymerase creates copies of target nucleic acids from the primers 
using the original target nucleic acids as a template. After a threshold amount of target nucleic 
acids are generated by the polymerase, they can be detected by more traditional methods, such 
as Southern blots. When using the Southern blot method, the labelled probe will hybridize to 
the Neisserial sequence (or its complement). 

Also, mRNA or cDNA can be detected by traditional blotting techniques described in 
Sambrook et al [supra], mRNA, or cDNA generated from mRNA using a polymerase 
enzyme, can be purified and separated using gel electrophoresis. The nucleic acids on the gel 
are then blotted onto a solid support, such as nitrocellulose. The solid support is exposed to a 
labelled probe and then washed to remove any unhybridized probe. Next, the duplexes 
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containing the labeled probe are detected. Typically, the probe is labelled with a radioactive 
moiety. 

EXAMPLES 

The examples describe nucleic acid sequences which have been identified in 
N. meningitidis, and N. gonorrhoeae along with their respective and putative translation 
products. Not all of the nucleic acid sequences are complete ie. they encode less than the full- 
length wild-type protein. 

The examples are generally in the following format: 

• a nucleotide sequence which has been identified in N. meningitidis 

• the putative translation product of said N. meningitidis sequence 

• a computer analysis of said translation product based on database comparisons 

• a corresponding nucleotide sequence identified from N. gonorrhoeae 

• the putative translation product of said N. gonorrhoeae sequence 

• a comparision of the percentage of identity between the translation product of the 
N. meningitidis sequence and the N. gonorrhoeae sequence. 

• a corresponding nucleotide sequence identified from strain A of N. meningitidis 

• the putative translation product of said N. meningitidis strain A sequence 

• a comparision of the percentage of identity between the translation product of the 
N. meningitidis sequence and the N. gonorrhoeae sequence. 

• a description of the characteristics of the protein which indicates that it might be 
suitably antigenic or immunogenic. 

Sequence comparisons were performed at NCBI (http://www.ncbi.nlm.nih.gov) using 
the algorithms BLAST, BLAST2, BLASTn, BLASTp, tBLASTn, BLASTx, & tBLASTx [eg 
see also Altschul et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs. Nucleic Acids Research 25:2289-3402], Searches were performed 
against the following databases: non-redundant GenBank+EMBL+DDBJ+PDB sequences 
and non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PIR sequences. 

Dots within nucleotide sequences represent nucleotides which have been arbitrarily 
introduced in order to maintain a reading frame. In the same way, double-underlined 
nucleotides were removed . Lower case letters represent ambiguities which arose during 
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alignment of independent sequencing reactions (some of the nucleotide sequences in the 
examples are derived from combining the results of two or more experiments). 

Nucleotide sequences were scanned in all six reading frames to predict the presence of 
hydrophobic domains using an algorithm based on the statistical studies of Esposti et al 
[Critical evaluation of the hydropathy of membrane proteins (1990) Eur J Biochem 190:207- 
219]. These domains represent potential transmembrane regions or hydrophobic leader 
sequences. 

Open reading frames were predicted from fragmented nucleotide sequences using the 
program ORFFINDER (NCBI). 

Underlined amino acid sequences indicate possible transmembrane domains or leader 
sequences in the ORFs, as predicted by the PSORT algorithm (http://www.psort.nibb.ac.jp). 
Functional domains were also predicted using the MOTIFS program (GCG Wisconsin & 
PROSITE). 

For each of the following examples: based on the presence of a putative leader 
sequence and/or several putative transmembrane domains (single-underlined) in the 
gonococcal protein, it is predicted that the proteins from N. meningitidis and N. gonorrhoeae, 
and their respective epitopes, could be useful antigens or immunogenic compositions for 
vaccines or diagnostics. 

The standard techniques and procedures which may be employed in order to perform 
the invention {e.g. to utilize the disclosed sequences for vaccination or diagnostic purposes) 
were summarized above. This summary is not a limitation on the invention but, rather, gives 
examples that may be used, but are not required. 

In particular, the following methods were used to express, purify and biochemically 
characterize the proteins of the invention. 
Chromosomal DNA Preparation 

N. meningitidis strain 2996 was grown to exponential phase in 100ml of GC medium, 
harvested by centrifugation, and resuspended in 5ml buffer (20%(w/v) Sucrose, 50mM Tris- 
HC1, 50mM EDTA, pH8). After 10 minutes incubation on ice, the bacteria were lysed by 
adding 10ml of lysis solution (50mM NaCl, 1% Na-Sarkosyl, 50|ag/ml Proteinase K), and the 
suspension incubated at 37°C for 2 hours. Two phenol extractions (equilibrated to pH 8) and 
one CHCl 3 /isoamylalcohol (24:1) extraction were performed. DNA was precipitated by 
addition of 0.3M sodium acetate and 2 volumes of ethanol, and collected by centrifugation. 
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The pellet was washed once with 70%(v/v) ethanol and redissolved in 4.0ml TE buffer 
(lOmM Tris-HCl, ImM EDTA, pH 8.0). The DNA concentration was measured by reading 
the OD at 260 nm. 

Oligonucleotide design 

Synthetic oligonucleotide primers were designed on the basis of the coding sequence 
of each ORF, using (a) the meningococcus B sequence when available, or (b) the 
gonococcus/meningococcus A sequence, adapted to the codon preference usage of 
meningococcus as necessary. Any predicted signal peptides were omitted, by designing the 5' 
primers to sequence immediately downstream from the predicted leader sequence. 

For most ORFs, the 5' primers included two restriction enzyme recognition sites 
(BamHl-Ndel, BamHl-Nhel, EcoRl-Ndel or EcoRl-Nhel), depending on the restriction pattern 
of the gene of interest. The 3' primers included a Xhol or a Hindlll restriction site (table 1). 
This procedure was established in order to direct the cloning of each amplification product 
(corresponding to each ORF) into two different expression systems: pGEX-KG (using 
BamHl-Xhol, BamHl-Hindlll, EcoRl-Xhol or EcoRl-Hindlll), and pET2 1 b+ (using Ndei- 
Xhol, Nhel-Xhol, Ndel-Hindlll or Nhel-Hindlll). 



5 '-end primer tail: 



CGC GGATCCCATATG 
CGC GGATCCGCTAGC 
CCG GAATTC TA CATATG 
CCGGAATTCTAGCTAGC 



(BamHl-Ndel ) 
(BamHl-Nhel) 
(EcoRl-Ndel) 
(EcoRl-Nhel) 



3'-end primer tail: 



CCCG CTCGAG 
CCCGCTCGAG 



(Xhol) 
(Hindi!!) 



For cloning ORFs into the pGEX-His vector, the 5' and 3' primers contained only one 
restriction enzyme site (EcoR\, Kpnl or Sail for the 5' primers and Pstl, Xbal, Sphl or Sail for 
the 3' primers). Again restriction sites were chosen according to the particular restriction 
pattern of the gene (table 1). 



5 '-end primer tail: 



3 '-end primer tail: 



(AAA) AAA GAATTC 
(AAA) AAA GGTACC 
(AAA) AAA CTGCAG 
(AAA) AAATCTAGA 



(EcoRl ) 
(Kpnl) 

(Pstl) 
(Xbal) 
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AAA GCATGC (SphT) 
5' or 3'-end primer tail: AAAAAA GTCGAC (Sail) 

As well as containing the restriction enzyme recognition sequences, the primers 
included nucleotides which hybridized to the sequence to be amplified. The melting 
temperature depended on the number and type of hybridising nucleotides in the whole primer, 
and was determined for each primer using the formulae: 

Tm = 4 (G+C)+ 2 (A+T) (tail excluded) 

T m = 64.9 + 0.41 (% GC) - 600/N (whole primer) 

The melting temperatures of the selected oligonucleotides were usually 65-70°C for 
the whole oligo and 50-55°C for the hybridising region alone. 

Table 1 shows the forward and reverse primers used for each amplification. In certain 
cases, the sequence of the primer does not exactly match the sequence of the predicted ORF. 
This is because when initial amplifications were performed, the complete 5' and/or 3' 
sequences for some meningococcal B ORFs were not known. However the corresponding 
sequences had been identified in Gonococcus or in Meningoccus A. Hence, when the 
Meningoccus B sequence was incomplete or uncertain, Gonococcal or Meningococcal A 
sequences were used as the basis for primer design. These sequences were altered to take 
account of codon preference. It can be appreciated that, once the complete sequence is 
identified, this approach will no longer be necessary. 

Oligonucleotides were synthesized using a Perkin Elmer 394 DNA/RNA Synthesizer, 
eluted from the columns in 2.0ml NH 4 OH, and deprotected by 5 hours incubation at 56°C 
The oligos were precipitated by addition of 0.3M Na- Acetate and 2 volumes ethanol. The 
samples were centrifuged and the pellets resuspended in either 100|il or 1.0ml of water. The 
OD26O was determined using a Perkin Elmer Lambda Bio spectophotometer and the 
concentration adjusted to 2-10pmol/pl 

Amplification 

The standard PCR protocol was as follows: 50-200ng of genomic DNA was used as a 
template in the presence of 20-40^iM of each oligonucletide primer, 400-800(iM dNTPs 
solution, lx PCR buffer (including 1.5mM MgCb), 2.5 units Tag/ DNA polymerase (using 
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Perkin-Elmer AmpliTaQ, GIBCO Platinum, Pwo DNA polymerase, or Tahara Shuzo Taq 
polymerase). In some cases, PCR was optimsed by the addition of 10}xl DMSO or 50|il 2M 
Betaine. 

After a hot start (adding the polymerase during a preliminary 3 minute incubation of 
the whole mix at 95 °C), each sample underwent a two-step amplification. The first 5 cycles 
were performed using the hybridization temperature that excluded the restriction enzyme tail 
of the primer (see above). This was followed by 30 cycles using the hybridization temperature 
calculated for the whole length oligos. The cycles were completed with a 10 minute extension 
step at 72°C. The standard cycles were as follows: 





Denaturation 


Hybridisation 


Elongation 


First 5 cycles 


30 seconds 
95°C 


30 seconds 
50-55°C 


30-60 seconds 
72°C 


Last 30 cycles 


30 seconds 
95°C 


30 seconds 
65-70°C 


30-60 seconds 
72°C 



Elongation times varied according to the length of the ORE to be amplified. 
Amplifications were performed using either a 9600 or a 2400 Perkin Elmer GeneAmp PCR 
System. To check the results, 1/10 of the amplification volume was loaded onto a 1-1.5% 
(w/v) agarose gel and the size of each amplified fragment compared with a DNA molecular 
weight marker. 

The amplified DNA was either loaded directly on a 1% agarose gel or first 
precipitated with ethanol and resuspended in a volume suitable to be loaded on a 1.0% 
agarose gel. The DNA fragment corresponding to the band of correct size was purified using 
the Qiagen Gel Extraction Kit, following the manufacturer's protocol. DNA fragments were 
eluted in a volume of 30|il or 50jli1 with either H20 or lOmM Tris, pH 8.5. 

Digestion of PCR fragments 

The purified DNA corresponding to the amplified fragment was doubly-digested with 
the appropriate restriction enzymes for; cloning into pET-21b+ and expressing the protein as a 
C-terminus His-tagged fusion, for cloning into pGEX-KG and expressing the protein as a N- 
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terminus GST-fusion, and for cloning into pGEX-His and expressing the protein as a 
N-terminus GST-His tagged fusion. 

Each purified DNA fragment was incubated at 37°C for 3 hours to overnight with 20 
units of appropriate restriction enzyme (New England Biolabs) in a volume of either 30 or 
40pl in the presence of suitable digestion buffer. Digested fragments were purified using the 
QIAquick PCR purification kit (following the manufacturer's instructions) and eluted in a 
volume of 30ul or 50pl with either H20 or lOmM Tris, pH 8.5. The DNA concentration was 
determined by quantitative agarose gel electrophoresis (1.0% gel) in the presence of a titrated 
molecular weight marker. 

Digestion of the cloning vectors (pET22B, pGEX-KG, pTRC-His A, pET21b+, pGEX- 
KG, and pGEX-His) 

The vector pGEX-His is a modified pGEX-2T vector carrying a region encoding six 
histidine residues upstream of the thrombin cleavage site and containing the multiple cloning 
site of the vector pTRC99 (Pharmacia). 10 ug plasmid was double-digested with 50 units of 
each restriction enzyme in 200 pi reaction volume in the presence of appropriate buffer by 
overnight incubation at 37°C. After loading the whole digestion on a 1% agarose gel, the 
band corresponding to the digested vector was purified from the gel using the Qiagen 
QIAquick Gel Extraction Kit and the DNA was eluted in 50 pi of 10 mM Tris-HCl, pH 8.5. 
The DNA concentration was evaluated by measuring OD 26 o of the sample, and adjusted to 50 
pg/pl. 1 pi of plasmid was used for each cloning procedure. 

lOpg of plasmid vector was doubly-digested with 50 units of each restriction enzyme 
in a volume of 200pl with the appropriate buffer overnight at 37°C. The digest was loaded 
onto a 1.0% agarose gel and the band corresponding to the digested vector purified using the 
Qiagen QIAquick Gel Extraction Kit. DNA was eluted in 50pl of lOmM Tris-HCl, pH 8.5. 
The DNA concentration was evaluated by measuring OD260n m and the concentration adjusted 
to 50pg/pl. lpl of plasmid was used for each cloning procedure. 

Cloning 

For some ORFs, the fragments corresponding to each ORF, previously digested and 
purified, were ligated in both pET22b and pGEX-KG. In a final volume of 20 pi, a molar 



WO 99/57280 



58 



PCT/US99/09346 



ratio of 3:1 fragment/vector was ligated using 0.5 ^il of NEB T4 DNA ligase (400 units/jil), in 
the presence of the buffer supplied by the manufacturer. The reaction was incubated at room 
temperature for 3 hours. In some experiments, ligation was performed using the Boheringer 
"Rapid Ligation Kit", following the manufacturer's instructions. 

In order to introduce the recombinant plasmid in a suitable strain, 100 \xl E. coli DH5 
competent cells were incubated with the ligase reaction solution for 40 minutes on ice, then at 
37°C for 3 minutes, then, after adding 800 |il LB broth, again at 37°C for 20 minutes. The 
cells were then centrifuged at maximum speed in an Eppendorf microfuge and resuspended in 
approximately 200 (il of the supernatant. The suspension was then plated on LB ampicillin 
(100 mg/ml ). 

The screening of the recombinant clones was performed by growing 5 
randomly-chosen colonies overnight at 37 °C in either 2 ml (pGEX or pTC clones) or 5ml 
(pET clones) LB broth + 100 ng/ml ampicillin. The cells were then pelletted and the DNA 
extracted using the Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's 
instructions, to a final volume of 30 |il. 5 \i\ of each individual miniprep (approximately lg ) 
were digested with either NdellXhol or BamHl/Xhol and the whole digestion loaded onto a 1- 
1.5% agarose gel (depending on the expected insert size), in parallel with the molecular 
weight marker (1Kb DNA Ladder, GEBCO). The screening of the positive clones was made 
on the base of the correct insert size. 

For other ORFs, the fragments corresponding to each ORF, previously digested and 
purified, were ligated into both pET21b+ and pGEX-KG. A molar ratio of of 3:1 
fragment/vector was used in a final volume of 20^1, that included 0.5^il T4 DNA ligase (400 
units/|il, NEB) and ligation buffer supplied by the manufacturer. The reaction was performed 
at room temperature for 3 hours. In some experiments, ligation was performed using the 
Boheringer "Rapid Ligation Kit" and the manufacturer's protocol. 

Recombinant plasmid was transformed into 100(il of competent E. coli DH5 or 
HB101 by incubating the ligase reaction solution and bacteria for 40 minutes on ice then at 
37°C for 3 minutes. This was followed by the addition of 800jal LB broth and incubation at 
37°C for 20 minutes. The cells were centrifuged at maximum speed in an Eppendorf 
microfuge, resuspended in approximately 200|il of the supernatant and plated onto LB 
ampicillin (lOOmg/ml ) agar. 
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Screening for recombinant clones was performed by growing 5 randomly selected 
colonies overnight at 37°C in either 2.0ml (pGEX-KG clones) or 5.0ml (pET clones) LB 
broth + 100)ag/ml ampicillin. Cells were pelleted and plasmid DNA extracted using the 
Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's instructions. 
Approximately lp.g of each individual miniprep was digested with the appropriate restriction 
enzymes and the digest loaded onto a 1-1.5% agarose gel (depending on the expected insert 
size), in parallel with the molecular weight marker (lkb DNA Ladder, GIBCO). Positive 
clones were selected on the basis of the size of insert. 

ORFs were cloned into PGEX-His, by doubly-digesting the PCR product and ligating 
into similarly digested vector. After cloning, recombinant plasmids were transformed into the 
E.coli host W31 10. Individual clones were grown overnight at 37°C in LB broth with 50p,g/ml 
ampicillin. 

Certain ORFs may be cloned into the pGEX-HIS vector using EcoRl-Pstl cloning 
sites, or EcoRI-Sall, or SaK-Pstl. After cloning, the recombinant plasmids may be introduced 
inthe£.colihost W3110. 

Expression 

Each ORF cloned into the expression vector may then be transformed into the strain 
suitable for expression of the recombinant protein product. 1 of each construct was used to 
transform 30 ^1 of E.coli BL21 (pGEX vector), E.coli TOP 10 (pTRC vector) or E.coli BL21- 
DE3 (pET vector), as described above. In the case of the pGEX-His vector, the same E.coli 
strain (W31 10) was used for initial cloning and expression. Single recombinant colonies were 
inoculated into 2ml LB+Amp (100 ng/rnl), incubated at 37°C overnight, then diluted 1:30 in 
20 ml of LB+Amp (100 |ig/ml) in 100 ml flasks, making sure that the OD 6 oo ranged between 
0.1 and 0.15. The flasks were incubated at 30°C into gyratory water bath shakers until OD 
indicated exponential growth suitable for induction of expression (0.4-0.8 OD for pET and 
pTRC vectors; 0.8-1 OD for pGEX and pGEX-His vectors). For the pET, pTRC and pGEX- 
His vectors, the protein expression was induced by addiction of ImM IPTG, whereas in the 
case of pGEX system the final concentration of IPTG was 0.2 mM. After 3 hours incubation 
at 30°C, the final concentration of the sample was checked by OD. In order to check 
expression, 1ml of each sample was removed, centrifuged in a micro fuge, the pellet 
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resuspended in PBS, and analysed by 12% SDS-PAGE with Coomassie Blue staining. The 
whole sample was centrifuged at 6000g and the pellet resuspended in PBS for further use. 

GST-fusion proteins large-scale purification. 

For some ORFs, a single colony was grown overnight at 37°C on LB+Amp agar plate. 
The bacteria were inoculated into 20 ml of LB+Amp liquid colture in a water bath shaker and 
grown overnight. Bacteria were diluted 1 :30 into 600 ml of fresh medium and allowed to 
grow at the optimal temperature (20-37°C) to OD 550 0.8-1. Protein expression was induced 
with 0.2mM IPTG followed by three hours incubation. The culture was centrifuged at 8000 
rpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 7.5 ml 
cold PBS. The cells were disrupted by sonication on ice for 30 sec at 40W using a Branson 
sonifier B-15, frozen and thawed two times and centrifuged again. The supernatant was 
collected and mixed with 150ul Glutatione-Sepharose 4B resin (Pharmacia) (previously 
washed with PBS) and incubated at room temperature for 30 minutes. The sample was 
centrifuged at 700# for 5 minutes at 4C. The resin was washed twice with 10 ml cold PBS for 
10 minutes, resuspended in 1ml cold PBS* and loaded on a disposable column. The resin was 
washed twice with 2ml cold PBS until the flow-through reached OD 28 o of 0.02-0.06. The 
GST-fusion protein was eluted by addition of 700^1 cold Glutathione elution buffer lOmM 
reduced glutathione, 50mM Tris-HCl) and fractions collected until the OD 280 was 0. 1 . 21 ul of 
each fraction were loaded on a 12% SDS gel using either Biorad SDS-PAGE Molecular 
weight standard broad range (Ml) (200, 1 16.25, 97.4, 66.2, 45, 31, 21.5, 14.4, 6.5 kDa) or 
Amersham Rainbow Marker (M") (220, 66, 46, 30, 21.5, 14.3 kDa) as standards. As the MW 
of GST is 26kDa, this value must be added to the MW of each GST-fusion protein. 

For other ORFs, for each clone to be purified as a GST-fusion, a single colony was 
streaked out and grown overnight at 37°C on a LB/Amp. (100p.g/ml) agar plate. An isolated 
colony from this plate was inoculated into 20ml of LB/Amp (100 ng/ml) liquid medium and 
grown overnight at 37°C with shaking. The overnight culture was diluted 1 :30 into 600ml 
LB/Amp (100|ag/ml) liquid medium and allowed to grow at the optimal temperature (20- 
37°C) until the OD 5 5o n m reached 0.6-0.8. Recombinant protein expression was induced by 
addition of IPTG (final concentration 0.2mM) and the culture incubated for a further 3 hours. 
Bacteria were harvested by centrifugation at 8000xg for 15 min at 4°C. 
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The bacterial pellet was resuspended in 7.5ml cold PBS. Cells were disrupted by 
sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 and centrifuged at 
13 OOOxg for 30 min at 4°C. The supernatant was collected and mixed with 150^1 Glutatione- 
Sepharose 4B resin (Pharmacia), previously equilibrated with PBS, and incubated at room 
temperature with gentle agitation for 30 min. The batch- wise preparation was centrifuged at 
700xg for 5 min at 4°C and the supernatant discarded. The resin was washed twice 
(batchwise) with 10ml cold PBS for 10 min, resuspended in 1ml cold PBS, and loaded onto a 
disposable column. The resin continued to be washed with cold PBS, until the OD 28 onm of the 
flow-through reached 0.02-0.01 . The GST-fusion protein was eluted by addition of 700ul cold 
glutathione elution buffer (lOmM reduced glutathione, 50mM Tris-HCl pH 8.0) and fractions 
collected, until the OD 28 onm of the eluate indicated all the recombinant protein was obtained. 
20ul aliquots of each elution fraction were analyzed by SDS-PAGE using a 12% gel. The 
molecular mass of the purified proteins was determined using either the Bio-Rad broad range 
molecular weight standard (Ml) (200, 116, 97.4, 66.2,45.0, 31.0, 21.5, 14.4, 6.5 kDa) or the 
Amersham Rainbow Marker (M2) (220, 66.2, 46.0, 30.0, 21.5, 14.3 kDa). The molecular 
weights of GST-fusion proteins are a combination of the 26 kDa GST protein and its fusion 
partner. Protein concentrations were estimated using the Bradford assay. 

His-fusion soluble proteins large-scale purification. 

For some ORFs, a single colony was grown overnight at 37°C on a LB + Amp agar 
plate. The bacteria were inoculated into 20ml of LB+Amp liquid culture and incubated 
overnight in a water bath shaker. Bacteria were diluted 1:30 into 600ml fresh medium and 
allowed to grow at the optimal temperature (20-37°C) to OD 550 0.6-0.8. Protein expression 
was induced by addition of 1 mM IPTG and the culture further incubated for three hours. The 
culture was centrifuged at 8000 rpm at 4°C, the supernatant was discarded and the bacterial 
pellet was resuspended in 7.5ml cold lOmM imidazole buffer (300 mM NaCl, 50 mM 
phosphate buffer, 10 mM imidazole, pH 8). The cells were disrupted by sonication on ice for 
30 sec at 40W using a Branson sonifier B-15, frozen and thawed two times and centrifuged 
again. The supernatant was collected and mixed with 150pl Ni 2+ -resin (Pharmacia) 
(previously washed with lOmM imidazole buffer) and incubated at room temperature with 
gentle agitation for 30 minutes. The sample was centrifuged at lOOg for 5 minutes at 4°C. 
The resin was washed twice with 10 ml cold lOmM imidazole buffer for 10 minutes, 
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resuspended in 1ml cold lOmM imidazole buffer and loaded on a disposable column. The 
resin was washed at 4°C with 2ml cold lOmM imidazole buffer until the flow-through 
reached the O.D 280 of 0.02-0.06. The resin was washed with 2ml cold 20mM imidazole 
buffer (300 mM NaCl, 50 mM phosphate buffer, 20 mM imidazole, pH 8) until the flow- 
through reached the O.D 280 of 0.02-0.06. The His-fusion protein was eluted by addition of 
700ul cold 250mM imidazole buffer (300 mM NaCl, 50 mM phosphate buffer, 250 mM 
imidazole, pH 8) and fractions collected until the O.D 280 was 0.1. 21ul of each fraction were 
loaded on a 12% SDS gel. 

His-fusion insoluble proteins large-scale purification. 

A single colony was grown overnight at 37 °C on a LB + Amp agar plate. The 
bacteria were inoculated into 20 ml of LB+Amp liquid culture in a water bath shaker and 
grown overnight. Bacteria were diluted 1 :30 into 600ml fresh medium and let to grow at the 
optimal temperature (37°C) to O.D 550 0.6-0.8. Protein expression was induced by addition 
of 1 mM IPTG and the culture further incubated for three hours. The culture was centrifuged 
at 8000rpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 
7.5 ml buffer B (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 8.8). The cells 
were disrupted by sonication on ice for 30 sec at 40W using a Branson sonifier B-15, frozen 
and thawed twice and centrifuged again. The supernatant was stored at -20°C, while the 
pellets were resuspended in 2 ml guanidine buffer (6M guanidine hydrochloride, lOOmM 
phosphate buffer, 10 mM Tris-HCl, pH 7.5) and treated in a homogenizer for 10 cycles. The 
product was centrifuged at 13000 rpm for 40 minutes. The supernatant was mixed with 1 50ul 
Ni 2+ -resin (Pharmacia) (previously washed with buffer B) and incubated at room temperature 
with gentle agitation for 30 minutes. The sample was centrifuged at 700 g for 5 minutes at 
4°C. The resin was washed twice with 10 ml buffer B for 10 minutes, resuspended in 1ml 
buffer B, and loaded on a disposable column. The resin was washed at room temperature 
with 2ml buffer B until the flow-through reached the OD 280 of 0.02-0.06. The resin was 
washed with 2ml buffer C (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 6.3) until 
the flow-through reached the O.D 280 of 0.02-0.06. The His-fusion protein was eluted by 
addition of 700ul elution buffer (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 
4.5) and fractions collected until the OD 280 was 0.1 . 21ul of each fraction were loaded on a 
12% SDS gel. 
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Purification of His-fusion proteins. 

For each clone to be purified as a His-fusion, a single colony was streaked out and 
grown overnight at 37°C on a LB/Amp (100 ng/ml) agar plate. An isolated colony from this 
plate was inoculated into 20ml of LB/Amp (100 (ig/ml) liquid medium and grown overnight 
at 37°C with shaking. The overnight culture was diluted 1 :30 into 600ml LB/Amp (100 
(ig/ml) liquid medium and allowed to grow at the optimal temperature (20-37°C) until the 
ODssonm reached 0.6-0.8. Expression of recombinant protein was induced by addition of IPTG 
(final concentration 1 .OmM) and the culture incubated for a further 3 hours. Bacteria were 
harvested by centrifugation at 8000xg for 15 min at 4°C 

The bacterial pellet was resuspended in 7.5ml of either (i) cold buffer A (300mM 
NaCl, 50mM phosphate buffer, lOmM imidazole, pH 8.0) for soluble proteins or (ii) buffer B 
(8M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 8.8) for insoluble proteins. Cells 
were disrupted by sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 
and centrifuged at 13 OOOxg for 30 min at 4°C. For insoluble proteins, pellets were 
resuspended in 2.0 ml buffer C (6M guanidine hydrochloride, lOOmM phosphate buffer, 
lOmM Tris-HCl, pH 7.5) and treated with a Dounce homogenizer for 10 cycles. The 
homogenate was centrifuged at 13 OOOxg for 40 min and the supernatant retained. 

Supernatants for both soluble and insoluble preparations were mixed with 150jli1 Ni 2+ - 
resin (previously equilibrated with either buffer A or buffer B, as appropriate) and incubated 
at room temperature with gentle agitation for 30 min. The resin was Chelating Sepharose Fast 
Flow (Pharmacia), prepared according to manufacturers protocol. The batch-wise preparation 
was centrifuged at 700xg for 5 min at 4°C and the supernatant discarded. The resin was 
washed twice (batch-wise) with 10ml buffer A or B for 10 min, resuspended in 1.0 ml buffer 
A or B and loaded onto a disposable column. The resin continued to be washed with either (i) 
buffer A at 4°C or (ii) buffer B at room temperature, until the OD 2 80nm of the flow-through 
reached 0.02-0.01. The resin was further washed with either (i) cold buffer C (300mM NaCl, 
50mM phosphate buffer, 20mM imidazole, pH 8.0) or (ii) buffer D (8M urea, lOmM Tris- 
HCl, lOOmM phosphate buffer, pH 6.3) until the the OD 2 80nm of the flow-through reached 
0.02-0.01. The His-fusion protein was eluted by addition of 700jal of either (i) cold elution 
buffer A (300mM NaCl, 50mM phosphate buffer, 250mM imidazole, pH 8.0) or (ii) elution 
buffer B (8 M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 4.5) and fractions 
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collected until the O.D 2 80nm indicated all the recombinant protein was obtained. 20ul aliquots 
of each elution fraction were analyzed by SDS-PAGE using a 12% gel. Protein concentrations 
were estimated using the Bradford assay. 

His-fusion proteins renaturation 

In the cases where denaturation was required to solubilize proteins, a renaturation step 
was employed prior to immunization. Glycerol was added to the denatured fractions obtained 
above to give a final concentration of 10%(v/v). The proteins were diluted to 200ug/ml using 
dialysis buffer I (10% (v/v) glycerol, 0.5M arginine, 50mM phosphate buffer, 5.0mM reduced 
glutathione, 0.5mM oxidised glutathione, 2.0M urea, pH 8.8) and dialysed against the same 
buffer for 12-14 hours at 4°C. Further dialysis was performed with buffer II (10% (v/v) 
glycerol, 0.5M arginine, 50mM phosphate buffer, 5.0mM reduced glutathione, 0.5mM 
oxidised glutathione, pH 8.8) for 12-14 hours at 4°C. 

Alternatively, 10% glycerol was added to the denatured proteins. The proteins were 
then diluted to 20pg/ml using dialysis buffer I (10% glycerol, 0.5M arginine, 50mM 
phosphate buffer, 5mM reduced glutathione, 0.5mM oxidised glutathione, 2M urea, pH 8.8) 
and dialysed against the same buffer at 4°C for 12-14 hours. The protein was further dialysed 
against dialysis buffer II (10% glycerol, 0.5M arginine, 50mM phosphate buffer, 5mM 
reduced glutathione, 0.5mM oxidised glutathione, pH 8.8) for 12-14 hours at 4°C. 

Protein concentration was evaluated using the formula: 

Protein (mg/ml) = (1.55 x OD 2 go) - (0.76 x OD 260 ) 

Purification of proteins 

To analyse the solubility, pellets obtained from 3.0ml cultures were resuspended in 
500ul buffer Ml. (PBS pH 7.2). 25ul of lysozyme (lOmg/ml) was added and the bacteria 
incubated for 15 min at 4°C. Cells were disrupted by sonication on ice four times for 30 sec at 
40W using a Branson sonifier 450 and centrifuged at 13 OOOxg for 30 min at 4°C. The 
supernatant was collected and the pellet resuspended in buffer M2 [8M urea, 0.5M NaCl, 
20mM imidazole and 0.1M NaH 2 P0 4 ] and incubated for 3 to 4 hours at 4°C. After 
centrifugation, the supernatant was collected and the pellet resuspended in buffer M3 [6M 
guanidinium-HCl, 0.5M NaCl, 20mM imidazole and 0.1M NaH 2 P0 4 ] overnight at 4°C. The 
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supernatants from all steps were analysed by SDS-PAGE. Some proteins were found to be 
soluble in PBS, others need urea or guanidium-HCl for solubilization. 

For preparative scale purifications, 500ml cultures were induced and fusion proteins 
solubilized in either buffer Ml, M2 or M3 using the procedure described above. Crude 
extracts were loaded onto a Ni-NTA superflow column (Quiagen) equilibrated with buffer 
Ml, M2 or M3 depending on the solubilization buffer employed. Unbound material was 
eluted by washing the column with the same buffer. The recombinant fusion protein was 
eluted with the corresponding buffer containing 500mM imidazole then dialysed against the 
same buffer in the absence of imidazole. 

Mice immunisations 

20p.g of each purified protein are used to immunise mice intraperitoneally. In the case 
of some ORFs, Balb-C mice were immunised with Al(OH) 3 as adjuvant on days 1,21 and 42, 
and immune response was monitored in samples taken on day 56. For other ORFs, CD1 mice 
could be immunised using the same protocol. For ORFs 25 and 40, CD1 mice were 
immunised using Freund's adjuvant, and the same immunisation protocol was used, except 
that the immune response was measured on day 42, rather than 56. Similarly, for still other 
ORFs, CD1 mice were immunised with Freund's adjuvant, but the immune response was 
measured on day 49. Alternatively, 20^g of each purified protein was mixed with Freund's 
adjuvant and used to immunise CD1 mice intraperitoneally. For many of the proteins, the 
immunization was performed on days 1, 21 and 35, and immune response was monitored in 
samples taken on days 34 and 49. For some proteins, the third immunization was performed 
on day 28, rather than 35, and the immune response was measured on days 20 and 42, rather 
than 34 and 49. 

ELISA assay (sera analysis) 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C. Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 7ml of Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD 6 2o- The 
bacteria were let to grow until the OD reached the value of 0.3-0.4. The culture was 
centrifuged for 10 minutes at 10000 rpm. The supernatant was discarded and bacteria were 
washed once with PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated 
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for 2 hours at room temperature and then overnight at 4°C with stirring. lOOpl bacterial cells 
were added to each well of a 96 well Greiner plate and incubated overnight at 4°C. The wells 
were then washed three times with PBT washing buffer (0.1% Tween-20 in PBS). 200 pi of 
saturation buffer (2.7% Polyvinylpyrrolidone 10 in water) was added to each well and the 
plates incubated for 2 hours at 37°C. Wells were washed three times with PBT. 200 pi of 
diluted sera (Dilution buffer: 1% BSA, 0.1% Tween-20, 0.1% NaN 3 in PBS) were added to 
each well and the plates incubated for 90 minutes at 37°C. Wells were washed three times 
with PBT. 100 pi of HRP-conjugated rabbit anti-mouse (Dako) serum diluted 1 :2000 in 
dilution buffer were added to each well and the plates were incubated for 90 minutes at 37°C. 
Wells were washed three times with PBT buffer. 100 pi of substrate buffer for HRP (25 ml of 
citrate buffer pH5, 10 mg of O-phenildiamine and 10 pi of H 2 0) were added to each well and 
the plates were left at room temperature for 20 minutes. 100 pi H 2 S0 4 was added to each 
well and OD 49 o was followed. The ELISA was considered positive when OD490 was 2.5 
times the respective pre-immune sera. 

Alternatively, The acapsulated MenB M7 strain was plated on chocolate agar plates 
and incubated overnight at 37°C. Bacterial colonies were collected from the agar plates using 
a sterile dracon swab and inoculated into Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD 620 . The bacteria 
were let to grow until the OD reached the value of 0.3-0.4. The culture was centrifuged for 10 
minutes at 10 OOOrpm. The supernatant was discarded and bacteria were washed once with 
PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated for 1 hour at 37°C 
and then overnight at 4°C with stirring. lOOpl bacterial cells were added to each well of a 96 
well Greiner plate and incubated overnight at 4°C. The wells were then washed three times 
with PBT washing buffer (0.1% Tween-20 in PBS). 200pl of saturation buffer (2.7% 
Polyvinylpyrrolidone 10 in water) was added to each well and the plates incubated for 2 hours 
at 37°C. Wells were washed three times with PBT. 200pl of diluted sera (Dilution buffer: 1% 
BSA, 0. 1% Tween-20, 0.1 % NaN 3 in PBS) were added to each well and the plates incubated 
for 2 hours at 37°C. Wells were washed three times with PBT. lOOpl of HRP-conjugated 
rabbit anti-mouse (Dako) serum diluted 1:2000 in dilution buffer were added to each well and 
the plates were incubated for 90 minutes at 37°C. Wells were washed three times with PBT 
buffer. lOOpl of substrate buffer for HRP (25ml of citrate buffer pH5, lOmg of O- 
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phenildiamine and 10p.l of H 2 0 2 ) were added to each well and the plates were left at room 
temperature for 20 minutes. lOOul of 12.5% H 2 S0 4 was added to each well and OD 490 was 
followed. The ELISA titers were calculated abitrarely as the dilution of sera which gave an 
OD 490 value of 0.4 above the level of preimmune sera. The ELISA was considered positive 
when the dilution of sera with OD 490 of 0.4 was higher than 1 :400. 

FACScan bacteria Binding Assay procedure. 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C. Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 4 tubes containing 8ml each Mueller-Hinton Broth (Difco) 
containing 0.25% glucose. Bacterial growth was monitored every 30 minutes by following 
OD 620 . The bacteria were let to grow until the OD reached the value of 0.35-0.5. The culture 
was centrifuged for 10 minutes at 4000rpm. The supernatant was discarded and the pellet was 
resuspended in blocking buffer (1% BSA in PBS, 0.4% NaN 3 ) and centrifuged for 5 minutes 
at 4000rpm. Cells were resuspended in blocking buffer to reach OD 620 of 0.07. 100u.l bacterial 
cells were added to each well of a Costar 96 well plate. lOOul of diluted (1:100, 1:200, 1:400) 
sera (in blocking buffer) were added to each well and plates incubated for 2 hours at 4°C. 
Cells were centrifuged for 5 minutes at 4000rpm, the supernatant aspirated and cells washed 
by addition of 200p,l/well of blocking buffer in each well. lOOpl of R-Phicoerytrin conjugated 
F(ab) 2 goat anti-mouse, diluted 1:100, was added to each well and plates incubated for 1 hour 
at 4°C. Cells were spun down by centrifugation at 4000rpm for 5 minutes and washed by 
addition of 200uVwell of blocking buffer. The supernatant was aspirated and cells 
resuspended in 200ul/well of PBS, 0.25% formaldehyde. Samples were transferred to 
FACScan tubes and read. The condition for FACScan (Laser Power 15mW) setting were: FL2 
on; FSC-H threshold:92; FSC PMT Voltage: E 01; SSC PMT: 474; Amp. Gains 6.1; FL-2 
PMT: 586; compensation values: 0. 

OMV preparations 

Bacteria were grown overnight on 5 GC plates, harvested with a loop and resuspended 
in 10 ml 20mM Tris-HCl. Heat inactivation was performed at 56°C for 30 minutes and the 
bacteria disrupted by sonication for 10' on ice ( 50% duty cycle, 50% output ). Unbroken 
cells were removed by centrifugation at 5000g for 10 minutes and the total cell envelope 
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fraction recovered by centrifugation at 500(% at 4°C for 75 minutes. To extract cytoplasmic 
membrane proteins from the crude outer membranes, the whole fraction was resuspended in 
2% sarkosyl (Sigma) and incubated at room temperature for 20 minutes. The suspension was 
centrifuged at lOOOOg- for 10 minutes to remove aggregates, and the supernatant further 
ultracentrifuged at 50000g- for 75 minutes to pellet the outer membranes. The outer 
membranes were resuspended in lOmM Tris-HCl, pH8 and the protein concentration 
measured by the Bio-Rad Protein assay, using BSA as a standard. 

Whole Extracts preparation 

Bacteria were grown overnight on a GC plate, harvested with a loop and resuspended 
in 1ml of 20mM Tris-HCl. Heat inactivation was performed at 56°C for 30' minutes. 

Western blotting 

Purified proteins (500ng/lane), outer membrane vesicles (5jxg) and total cell extracts 
(25ug) derived from MenB strain 2996 were loaded onto a 12% SDS-polyacrylamide gel and 
transferred to a nitrocellulose membrane. The transfer was performed for 2 hours at 150mA at 
4°C, using transfer buffer (0.3 % Tris base, 1 .44 % glycine, 20% (v/v) methanol). The 
membrane was saturated by overnight incubation at 4°C in saturation buffer (10% skimmed 
milk, 0.1% Triton XI 00 in PBS). The membrane was washed twice with washing buffer (3% 
skimmed milk, 0.1% Triton XI 00 in PBS) and incubated for 2 hours at 37°C with mice sera 
diluted 1 :200 in washing buffer. The membrane was washed twice and incubated for 90 
minutes with a 1:2000 dilution of horseradish peroxidase labelled anti-mouse Ig. The 
membrane was washed twice with 0.1% Triton X100 in PBS and developed with the Opti- 
4CN Substrate Kit (Bio-Rad). The reaction was stopped by adding water. 

Bactericidal assay 

MC58 and 2996 strains were grown overnight at 37°C on chocolate agar plates. 5-7 
colonies were collected and used to inoculate 7ml Mueller-Hinton broth. The suspension was 
incubated at 37°C on a nutator and let to grow until OD 620 was in between 0.5-0.8. The 
culture was aliquoted into sterile 1.5ml Eppendorf tubes and centrifuged for 20 minutes at 
maximum speed in a microfuge. The pellet was washed once in Gey's buffer (Gibco) and 
resuspended in the same buffer to an OD 620 of 0.5, diluted 1 :20000 in Gey's buffer and stored 
at 25°C. 
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50fj.l of Gey's buffer/1% BSA was added to each well of a 96-well tissue culture plate. 
25|xl of diluted (1:100) mice sera (dilution buffer: Gey's buffer/0.2% BSA) were added to 
each well and the plate incubated at 4°C. 25ul of the previously described bacterial 
suspension were added to each well. 25 ul of either heat-inactivated (56°C waterbath for 30 
minutes) or normal baby rabbit complement were added to each well. Immediately after the 
addition of the baby rabbit complement, 22ul of each sample/well were plated on Mueller- 
Hinton agar plates (time 0). The 96-well plate was incubated for 1 hour at 37°C with rotation 
and then 22ul of each sample/well were plated on Mueller-Hinton agar plates (time 1). After 
overnight incubation the colonies corresponding to time 0 and time lh were counted. 

Gene Variability 

The ORF4 and 919 genes were amplified by PCR on chromosomal DNA extracted 
from various Neisseria strains (see list of strains). The following oligonucleotides used as 
PCR primers were designed in the upstream and downstream regions of the genes: 

orf4.1 (forward) CGAATCCGGACGGCAGGACTC 
orf4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 

919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 
919.6 (reverse) CAAATAAGAAAGGAATTTTG or 

GGTATCGCAAAACTTCGCCTTAATGCG 

The PCR cycling conditions were: 

1 cycle 2 min. at 94° 

30 cycles 30 sec. at 94° 

30 sec. at ~ 54° or ~ 60° ( in according to Tm of the primers) 

40 sec. at 72° 
1 cycle 7 min. at 72° 

The PCR products were purified from 1 % agarose gel and sequenced using the 
following primers: 

orf4.1 (forward) CGAATCCGGACGGCAGGACTC 
orf 4.2 (forward) CGACCGCGCCTTTGGGACTG 
orf 4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 
orf 4.4 (reverse) TCTTTGAGTTTGATCCAACC 
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919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 

919.2 (forward) ATCCTTCCGCCTCGGCTGCG 

919.3 (forward) AAAACAGCGGCACAATCGAC 

919.4 (forward) ATAAGGGCTACCTCAAACTC 

919.5 (forward) GCGCGTGGATTATTTTTGGG 

919.6 (reverse) CAAATAAGAAAGGAATTTTG or 

GGTATCGCAAAACTTCGCCTTAATGCG 

919.7 (reverse) CCCAAGGTAATGTAGTGCCG 

919.8 (reverse) TAAAAAAAAGTTCGACAGGG 

919.9 (reverse) CCGTCCGCCTGTCGTCGCCC 

919.10 (reverse) TCGTTCCGGCGGGGTCGGGG 

All documents cited herein are incorporated by reference in their entireties. 
The following Examples are presented to illustrate, not limit, the invention 

EXAMPLE 1 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PCR) assay in order to clone the ORFs as 
indicated: 



Table 1: Oligonucleotides used for PCR for Examples 2-10 



ORF 


Primer 


Sequence 


Restriction sites 


279 


Forward 
Reverse 


CGCGGATCCCATATG-TTGCCTGCAATCACGATT 
<SEQ ID3021> 

CCCGCTCGAG-TTTAGAAGCGGGCGGCAA <SEQ 
ID 3022> 


BamHI-Ndel 
Xhol 


519 


Forward 
Reverse 


CGCGGATCCCATATG-TTCAAATCCTTTGTCGTCA 
<SEQ ID 3023> 

CCCGCTCGAG-TTTGGCGG I I I I GCTGC <SEQ ID 
3024> 


BamHI-Ndel 
Xhol 


576 


Forward 
Reverse 


CGCGGATCCCATATG-GCCGCCCCCGCATCT 
<SEQ ID 3025> 

CCCGCTCGAG-ATTTAC I I I I I I GATGTCGAC 
<SEQ ID 3026> 


BamHI-Ndel 
Xhol 


919 


Forward 
Reverse 


CGCGGATCCCATATG-TGCCAAAGCAAGAGCATC 
<SEQ ID 3027> 

CCCGCTCGAG-CGGGCGGTATTCGGG <SEQ ID 
3028> 


BamHI-Ndel 
Xhol 


121 


Forward 


CGCGGATCCCATATG-GAAACACAGCTTTACAT 
<SEQ ID 3029> 


BamHI-Ndel 
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Reverse 


CCCGCTCGAG-ATAATAATATCCCGCGCCC <SEO 
ID 3030> 


Xhol 


128 


1 VJI W CM U 


bouuuM I OOUM 1 M I u-MU I bMUAAUoUAU 1 <SEO 

ID 3031 > 


BamHI-Ndel 




Reverse 


CCCGCTCGAG-GACCGCGTTGTCGAAA <SEO in 
3032> 


Xhol 


206 


Forward 


<SEQ ID 3033> 


BamHI-Ndel 




Reverse 


CCCGCTCGAG-TTCTGTAAAAAAAGTATGTGC 
<SEQ ID 3034> 


Xhol 


287 


Forward 


CCGGAATTCTARflTAft^.rTTTPArtr^T^^^r^^ 
<SEQ ID 3035> 


EcoRI-Nhel 




Reverse 


CCCGCTCGAG-ATCCTGCTC 1 1 1 1 1 1 GCC <SEO in 
3036> 


Xhol 


406 


Forward 


CGCGGATCCCATATG-TGCGGGACACTGACAG 
<SEQ ID 3037> 


BamHI-Ndel 




Reverse 


CCCGCTCGAG-AGGTTGTCCTTGTCTATG <SEO 
ID 3038> 


Xhol 



Localization of the ORFs 

The following DNA and amino acid sequences are identified by titles of the following 
form: [g, m, or a] [#].[seq or pep], where "g" means a sequence from N. gonorrhoeae, "m" 
means a sequence from N. meningitidis B, and "a" means a sequence from N. meningitidis A; 
"#" means the number of the sequence; "seq" means a DNA sequence, and "pep" means an 
amino acid sequence. For example, "gOOl.seq" refers to an N. gonorrohoeae DNA sequence, 
number 1. The presence of the suffix "-1" to these sequences indicates an additional sequence 
found for the same ORF, thus, data for an ORF having both an unsuffixed and a suffixed 
sequence designation applies to both such designated sequences. Further, open reading 
frames are identified as ORF #, where "#" means the number of the ORF, corresponding to 
the number of the sequence which encodes the ORF, and the ORF designations may be 
suffixed with ".ng" or ".a", indicating that the ORF corresponds to a N. gonorrhoeae sequence 
or a N. meningitidis A sequence, respectively. The word "partial" before a sequence indicates 
that the sequence may be a partial or a complete ORF. Computer analysis was performed for 
the comparisons that follow between "g", "m", and "a" peptide sequences; and therein the 
"pep" suffix is implied where not expressly stated. Further, in the event of a conflict between 
the text immediately preceding and describing which sequences are being compared, and the 
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designated sequences being compared, the designated sequence controls and is the actual 
sequence being compared. 



ORF: contig: 
279 gnm4.seq 

The following partial DNA sequence was identified inN. meningitidis <SEQ ID 3039>: 
m279.seq 

1 ATAACGCGGA TTTGCGGCTG CTTGATTTCA ACGGTTTTCA GGGCTTCGGC 
51 AAGTTTGTCG GCGGCGGGTT TCATCAGGCT GCAATGGGAA GGTACGGACA 
101 CGGGCAGCGG CAGGGCGCGT TTGGCACCGG CTTCTTTGGC GGCAGCCATG 
151 GCGCGTCCGA CGGCGGCGGC GTTGCCTGCA ATCACGATTT GTCCGGGTGA 

2 01 GTTGAAGTTG ACGGCTTCGA CCACTTCGCT TTGGGCGGCT TCGGCACAAA 
251 TGGCTTTAAC CTGCTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 

3 01 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA GGCGCACGAG 
351 TTTGACCGCG TCGGCAAAAT TCAATGCGCC GGCGG CAACG AGTGCGGTGT 

4 01 ATTCGCCGAG GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 
451 TCTAAATAG 

This corresponds to the amino acid sequence <SEQ ID 3040; ORF 279>: 
m279.pep 

1 ITRICGCLIS TVFRASASLS AAGFIRLQWE GTDTGSGRAR LAPASLAAAM 
51 ARPTAAA LPA ITICPGELKL TASTTSLWAA SAQMALTCSS SKPRIAAIAP 
101 TPCGTADCIS SARRRTSLTA SAKFNAPAAT SAVYSPRLCP ATAAGVLPPA 
151 SK* 

The following partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 3041>* 

g279.seq 

1 atgacgcgga tttgcggctg cttgatttca acggttttga gtgtttcggc 
51 aagtttgtcg gcggcgggtt tcatcaggct gcaatgggaa ggaacggata 
101 ccggcagcgg cagggcgcgt ttggctccgg cttctttggc ggcagccatg 
151 gtgcgtccga cggcggcggc gttgcctgca atcacgactt gtccgggcga 
201 gttgaagttg acggcttcga ccacttcgcc ctgtgcggat tcggcacaaa 
251 tctgcctgac ctgttcatct tccaaaccca aaatggccgc cattgcgcct 
301 acgccttgcg gtacggcgga ctgcatcagt tcggcgcgca ggcggacgag 
351 tttgacggca tcggcaaaat ccaatgcttc ggcggcgaca agcgcggtgt 
401 attcgccgag gctgtgtccg gcaacggcgg caggcgtttt gccgcccact 
451 tccaaatag 

This corresponds to the amino acid sequence <SEQ ID 3042* ORF 279 ng>* 
g279.pep ^ ' 5 

1 MTRICGCLIS TVLSVSASLS AAGFIRLQWE GTDTGSGRAR LAPASLAAAM 
51 VRPTAAA LPA ITTCPGELKL TASTTSPCAD SAQICLTCSS SKPKMAAIAP 
101 TPCGTADCIS SARRRTSLTA SAKSNASAAT SAVYSPRLCP ATAAGVLPPT 
151 SK* 

ORF 279 shows 89.5% identity over a 152 aa overlap with a predicted ORF (ORF 279.ng) 
from N. gonorrhoeae; 

10 20 30 40 50 60 

m279.pep ITRICGCLISTVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 

HMIIMIIIh Mil MMMIII MIMMIMIMM II I Mill Ml || 1 1 Ml 

g279 MTRICGCLISTVLSVSASLSAAGFIRLQWEGTDTGSGRARLAPASLAAA>lVRPTiL 1 ^ 

10 20 30 40 50 60 
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70 80 90 100 HO 120 

m2 79 .pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 
'I Ml I II I II MM I |||: I I I I I | j | :: | j | M I I I I I I I I I f [ I | | | | | f | | 
g279 ITTCPGELKLTASTTSPCADSAQICLTCSSSKPKMAAIAPTPCGTADCISSARRRTSLTA 

70 80 90 100 HO 120 

130 140 150 

m279 . pep SAKFNAPAATSAVYSPRLCPATAAGVLPPASKX 
IN M M M I I ! i I I I M I I I ; | | ! | | : [ | | 
g 2 7 9 S AKSNAS AATS AVYS PRLC P ATAAGVLP PTS KX 

130 140 150 



a279. seq 








1 


ATGACNCNGA 


TTTGCGGCTG 


CTTGATTTCA 


51 


GAGTTTGTCG 


GCGGCGGGTT 


TCATGAGGCT 


101 


CNGGCAGCGG 


CAGGGCGCGT 


TTGGCGCCGG 


151 


GCGCGCTCGA 


CGGCGGCGGC 


ATTGCCTGCA 


201 


GTTGAAGTTG 


ACGGCTTCAA 


CCACTTCATC 


251 


TTTGTTTTAC 


CTGTTCATCT 


TCCAAGCCGA 


301 


ACGCCTTGCG 


GTACGGCGGA 


CTGCATCAGT 


351 


TTTGACCGCG 


TCGGCAAAAT 


CCAATGCGCC 


401 


ATTCGCCGAN 


GCTGTGTCCG 


GCAACGGCGG 


451 


TCCGAATAG 







This corresponds to the amino acid sequence <SEQ ID 3044; ORF 279.a>: 

a27 9.pep 

1 MTXICGCL1S TVXRASASLS AAGFMRLQWE GTDTGSGRAR LAPASLAASI 

5i ARSTAAALPA ITTCPGELKL TASTTSSCAD SAQICFTCSS SKPRIAAIAP 

101 TPCGTADCIS SARXRTSLTA SAKSNAPAAT SAVYSPXLCP ATAAGVLPPA 



151 SE* 



m279/a279 ORFs 279 and 279.a showed a 88.2% identity in 1 52 aa overlap 

10 20 30 40 50 60 

m279.pep ITRICGCLISTVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 
:| MINIMI I I Ml II II I |:M I M II I I II M | | | I M I I I I:: | | 

MTX I CGCL I ST VXRAS AS LSAAGFMRLQWEGTDTGSGRAR LAPASLAASI ARSTAAALPA 
10 20 30 40 50 60 



a279 



70 80 90 100 110 120 
m279.pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 
11 IIIMIIMIIII I Ml: MIIIIIMMI II Ml || MM || | | | , 

ITTCPGELKLTASTTSSCADSAQICFTCSSSKPRIAAIAPTPCGTADC IS SARXRTSLTA 
70 80 90 100 HO 120 



a279 



130 140 150 

m27 9 .pep S AK FN A PAAT S AV Y S PRLC P ATAAGVLP PA SKX 

III MINIMUM I I li M I I ! I I I I I : | 
a 2 7 9 SAKSNAPAAT S AV YS PXLC PATAAGVL P PASEX 

130 140 150 



519 and 519-1 gnm7.seq 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 3045>- 

m519.seq (partial) 1 x 

1 . . TCCGTTATCG GGCGTATGGA GTTGGACAAA ACGTTTGAAG AACGCGACGA 

51 AATCAACAGT ACTGTTGTTG CGGCTTTGGA CGAGGCGGCC GGGgCTTgGG 

101 GTGTGAAGGT TTTGCGTTAT GAGATTAAAG ACTTGGTTCC GCCGCAAGAA 

151 ATCCTTCGCT CAATGCAGGC GCAAATTACT GCCGAACGCG AAAAACGCGC 

201 CCGTATCGCC GAATCCGAAG GTCGTAAAAT CGAACAAATC AACCTTGCCA 
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GTGGTCAGCG CGAAGCCGAA ATCCAACAAT CCGAAGGCGA GGCTCAGGCT 
GCGGTCAATG CGTCAAATGC CGAGAAAATC GCCCGCATCA ACCGCGCCAA 
AGGTGAAGCG GAATCCTTGC GCCTTGTTGC CGAAGCCAAT GCCGAAGCCA 
TCCGTCAAAT TGCCGCCGCC CTTCAAACCC AAGGCGGTGC GGATGCGGTC 
AATCTGAAGA TTGCGGAACA ATACGTCGCT GCGTTCAACA ATCTTGCCAA 
AG AAAG CAAT ACGCTGATTA TGCCCGCCAA TGTTGCCGAC ATCGGCAGCC 
TGATTTCTGC CGGTATGAAA ATTATCGACA GCAGCAAAAC CGCCAAaTAA 

This corresponds to the amino acid sequence <SEQ ID 3046; ORF 519>: 

m519.pep (partial) 

1 . . SVIGRMELDK TFEERDEINS TWAALDEAA GAWGVKVLRY EIKDLVPPQE 

51 ILRSMQAQIT AEREKRARIA ESEGRKIEQI NLASGQREAE IQQSEGEAQA 

101 AVNASNAEKI ARINRAKGEA ESLRLVAEAN AEAIRQIAAA LQTQGGADAV 

151 NLKIAEQYVA AFNNLAKESN TLIMPANVAD IGSLISAGMK IIDSSKTAK* 



251 
301 
351 
401 
451 
501 
551 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3047>- 

g519.seq 

1 atggaatttt tcattatctt gttggcagcc gtcgccgttt tcggcttcaa 

51 atcctttgtc gtcatccccc agcaggaagt ccacgttgtc gaaaggctcg 

101 ggcgtttcca tcgcgccctg acggccggtt tgaatatttt gattcccttt 

151 atcgaccgcg tcgcctaccg ccattcgctg aaagaaatcc ctttagacgt 

201 acccagccag gtctgcatca cgcgcgataa tacgcaattg actgttgacg 

251 gcatcatcta tttccaagta accgatccca aactcgcctc atacggttcg 

301 agcaactaca ttatggcaat tacccagctt gcccaaacga cgctgcgttc 

351 cgttatcggg cgtatggagt tggacaaaac gtttgaagaa cgcgacgaaa 

4 01 tcaacagtac cgtcgtctcc gccctcgatg aagccgccgg ggcttggggt 

451 gtgaaagtcc tccgttacga aatcaaggat ttggttccgc cgcaagaaat 

501 ccttcgcgca atgcaggcac aaattaccgc cgaacgcgaa aaacgcgccc 

551 gtattgccga atccgaaggc cgtaaaatcg aacaaatcaa ccttgccagt 

601 ggtcagcgtg aagccgaaat ccaacaatcc gaaggcgagg ctcaggctgc 

651 ggtcaatgcg tccaatgccg agaaaatcgc ccgcatcaac cgcgccaaag 

701 gcgaagcgga atccctgcgc cttgttgccg aagccaatgc cgaagccaac 

751 cgtcaaattg ccgccgccct tcaaacccaa agcggggcgg atgcggtcaa 

801 tctgaagatt gcgggacaat acgttaccgc gttcaaaaat cttgccaaag 

851 aagacaatac gcggattaag cccgccaagg ttgccgaaat cgggaaccct 

901 aattttcggc ggcatgaaaa attttcgcca gaagcaaaaa cgqccaaata 

951 a 

This corresponds to the amino acid sequence <SEQ ID 3048; ORF 519.ng>: 

g519 .pep 

1 MEFFIILLAA VAVFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 1DRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAN 

251 RQIAAALQTQ SGADAVNLKI AGQYVTAFKN LAKEDNTRIK PAKVAEIGNP 

301 NFRRHEKFSP EAKTAK* 



ORF 5 1 9 shows 87.5% identity over a 200 aa overlap with a predicted ORF (ORF 5 1 9.ng) 
from N. gonorrhoeae: 

m519/g519 

10 20 30 

m519 - pep SVIGRMELDKTFEERDEINSTWAALDEAA 

g519 YFQVTDPKXAS YGSSNYI^I TQLAQTTL^ 

90 1Q 0 HO 120 130 140 

40 50 60 70 80 90 

m519 . pep GAWGVKVLRYEIKDLVPPQEILRSMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 
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1 1 1 1 M 1 1 1 i ! 1 1 1 1 II 1 1 1 1 h I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 M I i 1 1 1 1 : 

g 519 GAWGVKVLRYEIKDLVPPQEILRAMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 
150 160 170 180 190 200 

100 110 120 130 140 150 

m5 1 9 . pep I QQS EGE AQAAVNASNAEKI ARINRAKGE AES LRLVAE ANAE AI RQ I AAALQTQGGADAV 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I : I I I I I 
g 519 I QQS EGE AQAAVNASNAE KI AR I NRAKGEAE S LRLVAE ANAE ANRQ I AAALQTQS GADAV 

210 220 230 240 250 260 



160 170 180 190 200 

m519 .pep NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL- I SAGMKI IDSSKTAK 

I I I I I I I I ' I h I I I I h I I I I I : I I I I : : h =1111 

g 5 1 9 NLKI AGQ YVTAFKNLAKEDNTRI KPAKVAE I GNPNFRRHE KFS PEAKTAK 

270 280 290 300 310 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3049>: 

a519.seq 

1 ATGGAATTTT TCATTATCTT GCTGGCAGCC GTCGTTGTTT TCGGCTTCAA 

51 ATCCTTTGTT GTCATCCCAC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GTATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAAT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

401 TCAACAGCAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG AGCTTGGGGT 

4 51 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC TGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

751 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTCGCCGC GTTCAACAAT CTTGCCAAAG 

851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 

901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3050; ORF 519.a>: 

a519.pep 

1 MEFFIILLAA VVVFG FKSFV VIPQQEVHVV ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 I SAGMKI IDS SKTAK* 

m519/a519 ORFs 519 and 519. a showed a 99.5% identity in 199 aa overlap 



10 20 30 

m519.pep SVIGRMELDKTFEERDEINSTWAALDEAA 

I I I I I I I I I I I I I I I ! t I I I I I I : I I I I I I 
a 519 YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIGRMELDKTFEERDEINSTVVSALDEAA 
90 100 110 120 130 140 

40 50 60 70 80 ' 90 

m519.pep GAWGVKVLRYEIKDLVPPQEILRSMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 

l li 1 1 1 i 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

a519 GAWGVKVLRYEIKDLVPPQEILRSMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 
150 160 170 180 190 200 



100 110 120 130 140 150 
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m519 .pep 



a519 



IQQSEGEAQMVNASNAEKIARINRAKGEAESLRLVAEANAEAIRQIAAALQTQGGADAV 
M I I I II I I I I I I I I I I I I I I I I I | | | | | | ! | | M | | | | | | | | | | | [ | | | | | | || | | | | | 

IQQSEGEAQAAVNASNAEKIARINRAKGEAESLRLVAEANAEAIRQIAAALQTQGGADAV 
210 220 230 240 250 260 



m519 . pep 



a519 



160 170 180 190 200 

NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSLISAGMKIIDSSKTAKX 
I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I 1 I I I I I | | | | 
NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSLISAGMKIIDSSKTAKX 
270 280 290 300 310 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 305 1>: 

m519-l. seq 



1 ATGGAATTTT TCATTATCTT GTTGGTAGCC GTCGCCGTTT TCGGTTTCAA 

51 ATCCTTTGTT GTCATCCCAC AACAGGAAGT CCACGTTGTC GAAAGGCTGG 

101 GGCGTTTCCA TCGCGCCCTG ACGGcCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GCATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAGT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

4 01 TCAACAGTAC TGTTGTTGCG GCTTTGGACG AGGCGGCCGG GGCTTGGGGT 

4 51 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC CGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

751 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTCGCTGC GTTCAACAAT CTTGCCAAAG 

851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGGAGCCTG 

901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 3052; ORF 519-1>: 



1 MEFFIILLVA VAVFG FKSFV VIPQQEVHVV ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTVVA ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 



The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3053>: 

g519-l . seq 



1 ATGGAATTTT TCATTATCTT GTTGGCAGCC GTCGCCGTTT TCGGCTTCAA 

51 ATCCTTTGTC GTCATCCCCC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGATAA TACGCAATTG ACTGTTGACG 

251 GCATCATCTA TTTCCAAGTA ACCGATCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCAAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAGT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

4 01 TCAACAGTAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG GGCTTGGGGT 

4 51 GTGAAAGTCC TCCGTTACGA AATCAAGGAT TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCGCA ATGCAGGCAC AAATTACCGC CGAACGCGAA AAACGCGCCC 

551 GTATTGCCGA ATCCGAAGGC CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGTG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCCAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GCGAAGCGGA ATCCCTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

751 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGGGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTAGCCGC GTTCAACAAT CTTGCCAAAG 



m519-l. 
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851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 
901 ATTTCTGCCG GCATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3054; ORF 519-1. ng>: 

g519~l.pep 

1 MEFFIILLAA VAVFG FKSFV VIPQQEVHVV ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTVVS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 



ra519-l/g519-l ORFs 519-1 and ' 51 9-1 . ng showed a 99.0% identity in 315 aa 
overlap 

10 20 30 40 50 60 

g519-l.pep MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
I I I M I I I : I M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I | | | | I i | | | | | | | | | 
m519-l MEFFIILLVAVAVFGFKSFWIPQQEVHVVERLGRFHRALTAGLNILIPFIDRVAYRHSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

g519-l.pep KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1. 1 I | | | | | | | 
m519-l KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

"70 80 90 100 110 120 

130 140 150 160 170 180 

g519-l.pep RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRAMQAQITAERE 
I I I I I I I I I I I I I M I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I | | | | : | | | | j | | | | | 
m519-l RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

130 140 150 160 170 ~ 180 

190 200 210 220 230 240 

g519-l.pep KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
M I I II I I I I I I II I I I I I I I I I I I I I I I I I | | | | | j | | M | || | | | | | | | | | | | | | | | | 
m519-l KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

190 200 210 220 230 240 

250 260 270 280 290 300 

g519-l.pep LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
M I I I I I I I I I I I I I I II I I I I I I I I M I II I I I I I I I I I I I I | | I | | | | | | | | || | | | | 
m519-l LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 

250 260 270 280 290 300 

310 

g5 1 9- 1 . pep I SAGMKI I DSSKTAKX 
I I I I I I I I I I I I I I I I 
m5 1 9 - 1 I SAGMK I I DSSKTAKX 

310 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3055>: 

a519-l . seq 

1 ATGGAATTTT 

51 ATCCTTTGTT 

101 GGCGTTTCCA 

151 ATCGACCGCG 

201 ACCCAGCCAG 

251 GTATCATCTA 

301 AGCAACTACA 

351 CGTTATCGGG 



TCATTATCTT GCTGGCAGCC 
GTCATCCCAC AGCAGGAAGT 
TCGCGCCCTG ACGGCCGGTT 
TCGCCTACCG CCATTCGCTG 
GTCTGCATCA CGCGCGACAA 
TTTCCAAGTA ACCGACCCCA 
TTATGGCGAT TACCCAGCTT 
CGTATGGAAT TGGACAAAAC 



GTCGTTGTTT TCGGCTTCAA 
CCACGTTGTC GAAAGGCTCG 
TGAATATTTT GATTCCCTTT 
AAAGAAATCC CTTTAGACGT 
TACGCAGCTG ACTGTTGACG 
AACTCGCCTC ATACGGTTCG 
GCCCAAACGA CGCTGCGTTC 
GTTTGAAGAA CGCGACGAAA 
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4 01 TCAACAGCAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG AGCTTGGGGT 

4 51 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC TGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

751 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTCGCCGC GTTCAACAAT CTTGCCAAAG 

851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 

901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3056; ORF 519-La>: 

a519-l .pep. 

1 MEFFIILLAA WVFG FKSFV , VIPOOEVHVV ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KETPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKT FEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 

m519-l/a519-l ORFs 519-1 and 519-1. a showed a 99.0% identity in 315 

overlap 

10 20 30 40 50 60 

a519-l.pep MEFFI ILLAAVVVFGFKS FVVI PQQEVHWERLGRFHRALTAGLNILI PFI DRVAYRHSL 
I I N I I I I : I I : I I I I I I I I I I M | | | | | | | | | | | | | ( | | | | ! | | , | | ,,,,,, | ,,,, , 
ME FFI I LLVAVAVFGFKS FVVI PQQE VH VVERLGRFHRALTAGLN I LI PFI DRVA YRHS L 
10 20 30 40 50 60 



m519-l 



70 80 90 100 110 120 

a519-l. pep KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I | | | M I I I I I I I I I I I 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
70 80 90 100 110 120 



m519-l 



13 0 140 150 160 170 180 

a519-l . pep RMELDKTFEERDEINSTVVSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
N I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I 
m519-l RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

13 0 140 150 160 170 180 

19 0 200 210 220 230 240 

351 9-1. pep KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

IIINIMIIIIIIIIIMIIIIIIIIIMIIIIII Illlllllllll | 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
190 200 210 220 230 240 



m519-l 



250 260 270 280 290 300 

3519-1 . pep LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
250 260 270 280 290 300 



m519-l 



310 

3519-1. pep I SAGMK I I DSSKTAKX 
I I I I I I I I I I I I I I I I 
m519-l I SAGMK I I DSSKTAKX 

310 
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576 and 576-1 gnm22.seq 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 3057>: 



. seq . . 


(partial) 


1 


. . ATGCAGCAGG 


51 


GCAAATGAAG 


101 


CCATGCAGGC 


151 


GCTCAGGAAG 


201 


AGAAAAACAC 


251 


TTCTGAAAGA 


301 


CTGCAATACA 


351 


CGACATCGTT 


401 


TCGACAGCAG 


451 


GTGATTCCGG 


501 


AGCCACGTTC 


551 


GCGACAAAAT 


601 


AAAATCGGCG 


651 


CATCAAAAAA 



This corresponds to the amino acid sequence <SEQ ID 3058; ORF 576>: 

m576.pep. . (partial) 

1 . .MQQASYAMGV DIGRSLKQMK EQGAEIDLKV . FTEAMQAVYD GKEIKMTEEQ 
51 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 
101 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

151 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 
201 KIGAPENAPA KQPAQVDIKK VN* 



. seq. 


. (partial) 


1 


. . atgggcgtgg 


51 


ggaaatcgat 


101 


gcaaagaaat 


151 


ttcctgcagg 


201 


gaaggccaac 


251 


aagacggcgt 


301 


cagggtgaag 


351 


cgaaggccgc 


401 


gcggcccggc 


451 


ggcgtacggc 


501 


caaccttgcc 


551 


ccactttggt 


601 


gcgcccgcca 



This corresponds to the amino acid sequence <SEQ ID 3060; ORF 576.ng>: 

g57 6.pep. . (partial) 

1 . . MGVDIGRSLK QMKEQGAEID LKVFTDAMQA VYDGKEIKMT EEQAQEVMMK 

51 FLQEQQAKAV EKHKADAKAN KEKGEAFLKE NAAEDGVKTT ASGLQYKITK 

101 QGEGKQPTKD DIVTVEYEGR LIDGTVFDSS KANGGPATFP LSQVIPGWTE 

151 GVRLLKEGGE ATFYIPSNLA YREQGAGEKI GPNATLVFDV KLVKIGAPEN 
201 APAKQPDQVD IKKVN* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

m576/g576 ORFs 576 and 576. ng showed a 97.2% identity in 215 aa overlap 

10 20 30 40 50 60 

m57 6 . pep MQQAS YAMGVDI GRSLKQMKEQGAE I DLKV FTEAMQA V YDGKE I KMTEEQAQE VMMKFLQ 

' I I M M M I I I I I I I I I i I I I I I I : I I I M | I | | | | | | | | | , | | | | | , j | j | 
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9 576 MGVDIGRSLKQMKEQGAEIDLKVFTDAMQAVYDGKEIKMTEEQAQEVMMKFLQ 

10 20 30 40 50 

70 8 ° 90 100 110 120 

m57 6.pep EQQAKAVEKHKADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIV 
"IIIIIMIIilllllllMllMiMlhllllllllliiiiiiii, iiiiinn, - 

g576 EQQAKAVEKHKADAKANKEKGEAFLKENAAEDGVKTTASGLQYKITKQGEGKQPTKDDIV 
60 70 80 90 100 HQ 

130 140 150 160 170 180 

m57 6.pep TVEYEGRLIDGTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYRE 
MINN I UNI ill I 11 II 11:1 Ml I M ! I M I I I I: M ! I ||| IIIIIMMMM 
g576 TVEYEGRLIDGTVFDSSKANGGPATFPLSQVIPGWTEGVRLLKEGGEAT FYIPSNLAYRE 

120 130 140 150 160 170 

190 200 210 220 

m57 6 . pep QGAGDKIGPNATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 
I I I I : I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | 
957 6 QGAGEKIGPNATLVFDVKLVKIGAPENAPAKQPDQVDIKKVNX 
180 190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 306 1>: 

a57 6.seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

4 01 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

4 51 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCTGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

701 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3062; ORF 576. a>: 

a576. pep 

1 MNTIFKISAL TLSAALALSA CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VILGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 

m576/a576 ORFs 576 and 576. a showed a 99.5% identity in 222 aa overlap 

10 20 30 

mb 1 b ' P e P MQQASYAMGVDIGRSLKQMKEQGAEIDLKV 

I I M I I I I I I I I I I I I i I | | | M f I I I I I I 
CGKKEAAPASASE PAAASSAQGDTSS IGSTMQQAS YAMGVDIGRSLKQMKEQGAE I DLKV 
30 40 50 60 70 80 



a576 



40 50 60 70 80 90 

m57 6 . pep FTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKHKADAKANKEKGEAFLKENAA 

I I I HI M I I I I I I I I | | | j | | | | I I I I I I I I I I I I I I I I I I I I I I I I I ! I I | | | 

FTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKHKADAKANKEKGEAFLKENAA 
90 3-00 HO 120 130 140 



a576 
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100 110 120 130 140 150 

m57 6.pep KDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLIDGTVFDSSKANGGPVTFPLSQ 

I I I I I M I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I | I I I I 
a57 6 KDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLIDGTVFDSSKANGGPVTFPLSQ 

150 160 170 180 190 200 

160 170 180 190 200 210 

m57 6.pep VIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPNATLVFDVKLVKIGAPENAPA 

II I I I II I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I I 
a57 6 VILGWTEGVQLLKEGGEAT FYIPSNLAYREQGAGDKIGPNATLVFDVKLVKIGAPENAPA 

210 220 230 240 250 260 

220 

m5 7 6 . pep KQPAQVD I KKVNX 

I I I I I I I I I I I I I 
a 57 6 KQPAQVD I KKVNX 

270 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 3063>: 

m576-l . seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

401 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

4 51 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

701 GCGACAAAAT CGGTCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

7 51 AAAATCGGCG CACGCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3064; ORF 576-l>: 

m57 6-l.pep 

1 MNTIFKISAL TLSAALALSA CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVD IKK VN* 

The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3065>: 

g576-l. seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTGCCGCG CAGGGCGACA CCTCTTCAAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC AATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 ACAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGATG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCCCAGGAAG TGATGATGAA ATTCCTGCAG GAGCAGCAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGATG CGAAGGCCAA CAAAGAAAAA GGCGAAGCCT 

401 TCCTGAAGGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGT 

4 51 CTGCAGTACA AAATCACCAA ACAGGGTGAA GGCAAACAGC CGACAAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACCGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG CCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACGG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 
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7 01 GCGAAAAAAT CGGTCCGAAC GCCACTTTGG TATTTGACGT GAAACTGGTC 
7 51 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG ATCAAGTCGA 
801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3066; ORF 576-l.ng>: 

g576-l.pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASAA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTDAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPATFPLSQ 

201 VIPGWTEGVR LLKEGGEATF YIPSNLAYRE QGAGEKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPDQVDIKK VN + 



g576-l/m576-l ORFs 576-1 and 576-1. ng showed a 97.8% identity in 272 aa 

overlap 

10 20 30 40 50 60 

g57 6-l .pep MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASAAQGDTSSIGSTMQQASYAMGV 
I I M I I I I I I I I I I I I I I I I t I I I I I ! I I I I I II I I I I : I I | | | | | | | | { | | | | | | | | | | 
m57 6-l MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGV 

10 20 30 40 50 60 

70 80 90 100 110 120 

g57 6-1 . pep DIGRSLKQMKEQGAEIDLKVFTDAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 
I M II I II I I I fli I Ml 111 1:1 I I I i I Ml I I I M Ml M I I I I I I II I Ml I I I I I I 
m57 6-l DIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 

70 80 90 100 110 120 

130 140 150 160 170 180 

g57 6-l.pep KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
N I I I I I I I I I I I I I I I II I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | I I I I I I I | I I 
m57 6-l KADAPCANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 

130 140 150 160 170 180 

190 200 210 220 230 240 

g57 6-l . pep GTVFDSSKANGGPATFPLSQVIPGWTEGVRLLKEGGEATFYIPSNLAYREQGAGEKIGPN 
M M M M I II II M II II I II I M M I I M M M I I M II I I I M M M M I I M M M 
m576-l GTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPN 

190 200 210 220 230 240 

250 260 270 

g57 6-1 . pep ATLVFDVKLVKIGAPENAPAKQPDQVDIKKVNX 
I I I I I M II II I I I M M I I I I I I II I I M I I 
m57 6-1 ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 

250 260 270 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3067>: 

a576-l . seq 

1 ATGAACACCA 

51 ACTTTCCGCC 

101 CTGCCGCCGC 

151 ATGCAGCAGG 

201 GCAAATGAAG 

251 CCATGCAGGC 

301 GCTCAGGAAG 

351 AGAAAAACAC 

4 01 TTCTGAAAGA 

4 51 CTGCAATACA 

501 CGACATCGTT 

551 TCGACAGCAG 

601 GTGATTCTGG 

651 AGCCACGTTC 



TTTTCAAAAT CAGCGCACTG 
TGCGGCAAAA AAGAAGCCGC 
TTCTTCCGCG CAGGGCGACA 
CAAGCTATGC GATGGGCGTG 
GAACAGGGCG CGGAAATCGA 
AGTGTATGAC GGCAAAGAAA 
TCATGATGAA ATTCCTTCAG 
AAGGCGGACG CGAAGGCCAA 
AAATGCCGCC AAAGACGGCG 
AAATCACCAA ACAGGGCGAA 
ACCGTGGAAT ACGAAGGCCG 
CAAAGCCAAC GGCGGCCCGG 
GTTGGACCGA AGGCGTACAG 
TACATCCCGT CCAACCTTGC 



ACCCTTTCCG CCGCTTTGGC 
CCCCGCATCT GCATCCGAAC 
CCTCTTCGAT CGGCAGCACG 
GACATCGGAC GCTCCCTGAA 
TTTGAAAGTC TTTACCGAAG 
TCAAAATGAC CGAAGAGCAG 
GAACAACAGG CTAAAGCCGT 
TAAAGAAAAA GGCGAAGCCT 
TGAAGACCAC TGCTTCCGGC 
GGCAAACAGC CGACCAAAGA 
CCTGATTGAC GGTACGGTAT 
TCACCTTCCC TTTGAGCCAA 
CTTCTGAAAG AAGGCGGCGA 
CTACCGCGAA CAGGGTGCGG 
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701 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 
751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 
801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3068; ORP 576-1. a>: 

a576-l.pep 

1 MNT I FKI SAL TLSAALALS A CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VILGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 

a576-l/m576-l ORFs 576-1 and 576-1. a showed a 99.6% identity in 272 aa 

overlap 

10 20 30 40 50 60 

a57 6-l.pep MNT I FKI S ALTLS AALALS ACGKKEAAPAS AS E PAAAS S AQG DTS S I G S TMQQAS Y AMG V 
I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ] II I I I I I I I I I I I I | I | I | | | 
m5 7 6- 1 MNT I FKI SALT LS AALALS ACGKKEAAPAS AS E PAAAS SAQGDTS S IGSTMQQAS YAMGV 

10 20 30 40 50 60 

70 80 90 100 110 120 ■ 

a57 6-1 . pep DIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 
I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I 
m57 6-l D I GRS LKQMKEQGAE I DLKVFTEAMQAV YDGKE I KMTEEQAQE VMMKFLQEQQAKAVEKH 

70 80 90 100 110 120 

130 140 150 160 170 180 

a57 6-l.pep KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I I I I I ! I I I I I I I I I I I I I I | 
m57 6- 1 KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDI VTVEYEGRLI D 

130 140 150 160 170 180 

190 200 210 220 230 240 

a57 6-1 . pep GTVFDSSKANGGPVTFPLSQVILGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPN 
I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M M I I I I 
m57 6-l GTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPN 

190 200 210 220 230 240 

250 260 270 

a57 6-1 . pep ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 
I I I I I I I I I I I I II I I I I I i I I I I I I I I I I I I I 
m57 6-l AT LV FD VKLVK I G A PEN AP AKQ PAQV D I KKVNX 

250 260 270 



919 gnm43.seq 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3069>: 

m919.seq 

1 ATGAAAAAAT ACCTATTCCG CGCCGCCCTG TACGGCATCG CCGCCGCCAT 

51 CCTCGCCGCC TGCCAAAGCA AGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG TCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TCGGCGGCGG CGGGGCCGTC TATACCGTTG TACCGCACCT 

201 GTCCCTGCCC CACTGGGCGG CGCAGGATTT CGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTTCAAAC CCCCGTCCAT TCCTTTCAGG CAAAACAGTT 

351 TTTTGAACGC TATTTCACGC CGTGGCAGGT TGCAGGCAAC GGAAGCCTTG 

401 CCGGTACGGT TACCGGCTAT TACGAACCGG TGCTGAAGGG CGACGACAGG 
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451 CGGACGGCAC AAGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 

501 CTCCGTCCCC CTGCCTGCCG GTTTGCGGAG CGGAAAAGCC CTTGTCCGCA 

551 TCAGGCAGAC GGGAAAAAAC AGCGGCACAA TCGACAATAC CGGCGGCACA 

601 CATACCGCCG ACCTCTCCcG ATTCCCCATC ACCGCGCGCA CAACAGCAAT 

651 CAAAGGCAGG TTTGAAGGAA GCCGCTTCCT CCCCTACCAC ACGCGCAACC 

701 AAATCAACGG CGGCGCGCTT GACGGCAAAG CCCCGATACT CGGTTACGCC 

751 GAAGACCCTG TCGAACTTTT TTTTATGCAC ATCCAAGGCT CGGGCCGTCT 

801 GAAAACCCCG TCCGGCAAAT ACATCCGCAT CGGCTATGCC GACAAAAACG 

851 AACATCCyTA CGTTTCCATC GGACGCTATA TGGCGGATAA GGGCTACCTC 

901 AAACTCGGAC AAACCTCCAT GCAGGGCATT AAGT CTT AT A TGCGGCAAAA 

951 TCCGCAACGC CTCGCCGAAG TTTTGGGTCA AAACCCCAGC TATATCTTTT 

1001 TCCGCGAGCT TGCCGGAAGC AGCAATGACG GCCCTGTCGG CGCACTGGGC 

1051 ACGCCGCTGA TGGGGGAATA TGCCGGCGCA GTCGACCGGC ACTACATTAC 

1101 CTTGGGTGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CCGGCAGCGC GATTAAAGGC 

1201 GCGGTGCGCG TGGATTATTT TTGGGGATAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATATGT CTGGCAGCTC CTACCCAACG 

13 01 GTATGAAGCC CGAATACCGc CCGTAA 



This corresponds to the amino acid sequence <SEQ ID 3070; ORF 919>: 

m919 . pep 

1 MKKYLFRAAL YGIAAAILAA CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 

51 GTTVGGGGAV YTWPHLSLP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SFQAKQFFER YFTPWQVAGN GS LAG TVTGY YEPVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

201 HTADLSRFPI TARTTAI KGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

3 01 KLGQTSMQGI KSYMRQNPQR LAEVLGQNPS YIFFRELAGS SNDGPVGALG 

3 51 TPLMGEYAGA VDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 

401 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 

The following partial DNA sequence was identified in N [gonorrhoeae <SEQ ID 3071>: 

g919 . seq 

1 ATGAAAAAAC ACCTGCTCCG CTCCGCCCTG TACGGcatCG CCGCCgccAT 

51 CctcgCCGCC TGCCAAAgca gGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG CCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TTGCCGGCGG CGGGGCCGTC TATACCGTTG TGCCGCACCT 

201 GTCCATGCCC CACTGGGCGG CGCaggATTT TGCCAAAAGC CTGCAATCCT 

2 51 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

3 01 TGCGCCCAAG CCTTTCAAAC CCCCGTGCAT TCCTTTCAGG CAAAGcGgTT 
3 51 TTTTGAACGC TATTTCACGC cgtGGCaggt tgcaggcaAC GGAAGcCTTG 
401 Caggtacggt TACCGGCTAT TACGAACCGG TGCTGAAGGG CGACGGCAGG 
451 CGGACGGAAC GGGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 
501 CTCCGTCCCG CTGCCTGCCG GTTTGCGGGG CGGAAAAAAC CTTGTCCGCA 
5 51 TCAGGCAGac gg GG AAAAAC AGCGGCACGA TCGACAATGC CGGCGGCACG 
601 CATACCGCCG ACCTCTCCCG ATTCCCCATC ACCGCGCGCA CAACGGcaat 
651 caaaGGCAGG TTTGAaggAA GCCGCTTCCT CCCTTACCAC ACGCGCAACC 
701 AAAtcaacGG CGGCgcgcTT GACGGCAAag cccCCATCCT CggttacgcC 
751 GAagaccCcG tcgaacttTT TTTCATGCAC AtccaaggCT CGGGCCGCCT 
801 GAAAACCCcg tccggcaaat acatCCGCAt cggaTacgcc gacAAAAACG 
851 AACAtccgTa tgtttccatc ggACGctaTA TGGCGGACAA AGGCTACCTC 
901 AAGctcgggc agACCTCGAT GCAGGgcatc aaagcCTATA TGCGGCAAAA 
951 TCCGCAACGC CTCGCCGAAG TTTTGGGTCA AAACCCCAGC TATATCTTTT 

1001 TCCGCGAGCT TGCCGGAAGC GGCAATGAGG GCCCCGTCGG CGCACTGGGC 

1051 ACGCCACTGA TGGGGGAATA CGCCGGCGCA ATCGACCGGC ACTACATTAC 

1101 CTTGGGCGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CAGGCAGCGC GATCAAAGGC 

1201 GCGGTGCGCG TGGATTATTT TTGGGGTTAC GGCGACGAAG CCGGCGAACT 

12 51 TGCCGGCAAA CAGAAAACCA CGGGATACGT CTGGCAGCTC CTGCCCAACG 

13 01 GCATGAAGCC CGAATACCGC CCGTGA 
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This corresponds to the amino acid sequence <SEQ ID 3072; ORF 919.ng>: 

g919 .pep 

1 MKKHLLRSAL YGIAAAILAA CQSRSIQTFP QPDTSVINGP DRPAGIPDPA 

51 GTTVAGGGAV YTWPHLSMP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SFQAKRFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDGR 

151 RTERARFPIY GIPDDFISVP LPAGLRGGKN LVRIRQTGKN SGTIDNAGGT 

2 01 HTADLSRFPI TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

301 KLGQTSMQGI KAYMRQNPQR LAEVLGQNPS YIFFRELAGS GNEGPVGALG 

351 TPLMGEYAGA IDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 

401 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 



ORF 919 shows 95.9 % identity over a 441 aa overlap with a predicted ORF (ORF 919.ng) 
from N. gonorrhoeae: 

m919/g919 

10 20 30 40 50 60 

m919.pep MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
MhhhlllllMIIIMIIhlllllMIIIIMMIIIhMlllllllhlllll 
g919 MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 

10 20 30 40 50 60 

70 80 90 100 HO 120 

m919 . pep YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

liMMIhllllMIIIIIMIIIIIIIIMMMIMMMIIIIIIIIIIthllll • 

g919 YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 

70 80 90 100 110 ~ 120 



130 140 150 160 170 180 

m919 . pep YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

IMIIIIIIIIIIIIMMIIIItllll III MMMMIIMMMIIMMMM 

g919 YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKN 



130 



140 



150 



160 



170 



180 



190 200 210 220 230 240 

m919 . pep LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

Ml Mi II IIMMIhllMilllll III II Mill I II Mill II II MM IMMII 

g919 LVRI RQTGKNSGTIDNAGGTHTADLSRFP I TARTTAI KGRFEGSRFLPYHTRNQINGGAL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m919.pep DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

1 1 1 1 1 1 < I! 1 1 1 1 1 1 i 1 1 1 1 1 M I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ M 1 1 1 M 1 1 1 1 1 1 1 ; 1 M 1 1 

g919 DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

250 260 270 280 290 300 



310 32 0 330 340 350 360 

m9 1 9 . pep KLGQTSMQGI KSYMRQNPQRLAEVLGQNPSYI FFRELAGSSNDGPVGALGTPLMGEYAGA 

I I I I I I I I I I h I I I I I I I I I I'l I I I I I | | | | | | | | | | | | : | : | | | | | | | | | | | | | | | | | 
9919 KLGQTSMQGI KAYMRQNPQRLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 



310 



320 



330 



340 



350 



360 



37 0 380 390 400 410 420 

m9 19 . pep VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAI KGAVRVDYFWGYGDEAGELAGK 

= 1 i 1 1 1 1 1 1 1 1 ( M I J I i 1 1 1 1 1 1 1 1 1 1 1 ! I ! 1 1 1 1 M 1 1 1 1 ! f M }[ 1 1 1 1 1 1 J I F ! r f 

g 9 1 9 I DRHY I TLGAPL FVATAH PVT RKALNRL I MAQDTGS A I KGAVRVDYFWGYGDEAGELAGK 

37 0 380 390 400 410 420 
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430 440 
m919 .pep QKTTGYVWQLLPNGMKPEYRPX 

I M 1 1 II M 1 1 i 1 1 ; II 1 1 1 M 

g 9 1 9 QKTTGYVWQLLPNGMKPEYRPX 

430 440 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3073>: 

a919.seq 

1 ATGAAAAAAT ACCTATTCCG CGCCGCCCTG TGCGGCATCG CCGCCGCCAT 

51 CCTCGCCGCC TGCCAAAGCA AGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG TCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TCGGCGGCGG CGGGGCCGTT TATACCGTTG TGCCGCACCT 

201 GTCCCTGCCC CACTGGGCGG CGCAGGATTT CGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTTCAAAC CCCCGTCCAT TCCGTTCAGG CAAAACAGTT 

351 TTTTGAACGC TATTTCACGC CGTGGCAGGT TGCAGGCAAC GGAAGCCTTG 

4 01 CCGGTACGGT TACCGGCTAT TACGAGCCGG TGCTGAAGGG CGACGACAGG 

4 51 CGGACGGCAC AAGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 

501 CTCCGTCCCC CTGCCTGCCG GTTTGCGGAG CGGAAAAGCC CTTGTCCGCA 

551 TCAGGCAGAC GGGAAAAAAC AGCGGCACAA TCGACAATAC CGGCGGCACA 

601 CATACCGCCG ACCTCTCCCA ATTCCCCATC ACTGCGCGCA CAACGGCAAT 

651 CAAAGGCAGG TTTGAAGGAA GCCGCTTCCT CCCCTACCAC ACGCGCAACC 

701 AAATCAACGG CGGCGCGCTT GACGGCAAAG CCCCGATACT CGGTTACGCC 

751 GAAGACCCCG TCGAACTTTT TTTTATGCAC ATCCAAGGCT CGGGCCGTCT 

801 GAAAACC CCG TCCGGCAAAT ACATCCGCAT CGGCTATGCC GACAAAAACG 

851 AACATCCCTA CGTTTCCATC GGACGCTATA TGGCGGACAA AGGCTACCTC 

901 AAGCTCGGGC AGACCTCGAT GCAGGGCATC AAAGCCTATA TGCAGCAAAA 

951 CCCGCAACGC CTCGCCGAAG TTTTGGGGCA AAACCCCAGC TATATCTTTT 

1001 TCCGAGAGCT TACCGGAAGC AGCAATGACG GCCCTGTCGG CGCACTGGGC 

1051 ACGCGGCTGA TGGGCGAGTA CGCCGGCGCA GTCGACCGGC ACTACATTAC 

1101 CTTGGGCGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CCGGCAGCGC GATTAAAGGC 

1201 GCGGTGCGCG TGGATTATTT TTGGGGATAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATATGT CTGGCAGCTT CTGCCCAACG 

1301 GTATGAAGCC CGAATACCGC CCGTAA 

This corresponds to the amino acid sequence <SEQ ID 3074; ORF 919.a>: 

a919.pep 

1 MKKYLFRAAL CGIAAAILAA CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 

51 GTTVGGGGAV YTVVPHLSLP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SVQAKQFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

201 HTADLSQFPI TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

301 KLGQTSMQGI KAYMQQNPQR LAEVLGQNPS YIFFRELTGS SNDGPVGALG 

351 TPLMGEYAGA VDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 

401 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 

m919/a919 ORFs 919 and 919.a showed a 98.6% identity in 441 aa overlap 

10 20 30 " 40 50 60 

m919.pep MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

I M I I I I I I I I I II I I | I | | || I I I I I I I I I I I I | | | | | | || | | || j | | | | | | j 

a919 MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

10 • 20 30 40 50 60 

70 80 90 100 110 120 

m919 . pep YTVVPHLS LPHWAAQD FAKSLQS FRLGCANLKNRQGWQDVCAQAFQT PVH S FQAKQFFER 

M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I N I I MINIM 
a919 YTVVPHLS LPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQT PVHSVQAKQFFER 

70 80 90 100 110 120 
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130 140 150 160 170 180 

m919 . pep YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
I I I M I II I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I | | M | | | | | | M | | | | | M 
a919 YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m919.pep LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I | | | | I | | | || | || | 
a919 LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m919.pep DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I | I I | | || | | | M I 
a919 DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m919 . pep KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
I M I I I I I M i: I !: I I M M I I I M I I ! II I I I I I I : I I I M ! I I 1 I I I M I I M I I I I 
a919 KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

310 320 330 340 350 360 

370 380 390 400 410 420 

m919.pep VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I M I I II | I | I 
a919 VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 

370 380 390 400 410 420 

430 440 
m9 1 9 . pep QKTTGYVWQLLPNGMKPEYRPX 
I I I I I I I I II I I I I I I I I I I I I 
a919 QKTTGYVWQLLPNGMKPEYRPX 
430 440 



121 and 121-1 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 3075>: 

m!21.seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

4 01 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

4 51 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

501 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

551 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

601 xxxxxxCAGC TTCCTTACGA CAAAAACGGT GCAAAGTCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACGCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCAT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGACG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGnATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 
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1051 GCAACCGGCG CATCCAAACC GTGTATTCTG AnCGCGGGAT AT TAT TAT TG ' 
1101 A 

This corresponds to the amino acid sequence <SEQ ID 3076; ORF 12 1>: 

ml21 .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL Axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

201 xxQLPYDKNG AKSAQGNILP QLLDRLLAHP YFAQRHPKST GRELFAINWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICDGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

The following partial DNA sequence was identified inN. gonorrhoeae <SEQ ID 3077>: 

gl21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATTATG TCGGGAACCA GTATGGACGG 

51 GGCGGATGCC GTGCTGGTAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCTGACC GGTTGCGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCAC AGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCTCCGTGC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTtac AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACTGa cgcggatttT TACCGTCggc gacttcCGCA 

401 GCCGCGACCT TGCTGCCGGC GGacaAGGTG CGCCGCTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA TGACAGGGAA ACACGCGTGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGGCGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGAcgcgtg gacgcaggca 

601 cacTGGcagc TGCCTTACGA CAAAAacggt gcAAAGgcgg cacAAGGCAA 

651 catatTGCcg cAACTGCTCG gcaggctGCT CGCCcaccCG TATTTCTCAC 

701 AACCCcaccc aaAAAGCACG GGgcGCGaac TgtttgcccT AAattggctc 

751 gaaacctAcc ttgacggcgg cgaaaaccga tacgacgtat tgcggacgct 

801 ttcccgattc accgcgcaaA ccgTttggga cgccgtctca CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCTCAATG GGTGGAGGCG gccgCATTtg 

1001 cgtggttggC GGCGTGTTGG ATTAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCGACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 3078; ORF 121. ng>: 

gl21 .pep 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFTP YPDRLRRKLL 

51 DLQDTGTDEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRWLNIGGI ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ORF 121 shows 73.5% identity over a 366 aa overlap with a predicted ORF (ORF121.ng) 
from N. gonorrhoeae: 

m!21/gl21 

10 20 30 40 50 60 

ml21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I M I : I I I I II I I : I 1 I 
gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I I I I : I I I I I I I I I M I I I I I I I II I I I I I I I I I I ! I I I I I I I I t I II I I I I I M I I I I 
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gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
I : : : 

gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: : I I I I I I I I I I : I I I I I I I I I I | | | | | | | | : | IMM 

gl21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 21 . pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I II I I I I i I I I | | | | | | | | 

gl21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | 1 I | | | | 

gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
I I I I I I 

gl21 GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3079>: 

al21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GT ACT GAT AC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 



This corresponds to the amino acid sequence <SEQ ID 3080; ORF 121. a>: 

al21 .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 
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301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 
351 ATGASKPCIL GAGYYY* 

ml21/al21 ORFs 121 and 121. a showed a 74.0% identity in 366 aa overlap 



10 20 30 40 50 60 

ml 21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I M I I I I I ! I I I I I I I I I I I I i I I I I I I I I I I I I | | | | | | | | | | | | | : | | | | | | | | | | | | 
al21 • METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I I I I : I I i I I I I I I I I I I I I I I I I I I i I I I I M I I I I I I I ! I I I I i t I : I I : I I I i I I I I 
al21 HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 21 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : : : 
al21 AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
s I I I I I I I I I I : I I II I I I I I I I I I I I [ t I I I I I I I i I 

al21 PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

m!21 .pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I I I I I I: I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I i I j I I I I I I | | | 
al21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml2l.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHPCATGASKPCIL 
I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I : I I I I : I I f I I I I ! I I I I I I M i I I 
al21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml 21. pep XAGYYYX 
I I I I I I 

al21 GAGYYYX 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 308 1>: 

ml21-l . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CGCGGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCACTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA CAACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGACGCGTG GACGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CAT ATT GCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 
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851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG ANCGCGGGAT 

1101 A 



CAATCCTGTT 
CCCTGCACAG 
GCCGNATTTG 
TCCGCACAAA 
ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 3082; ORF 121-1>: 

ml21-l.pep 



1 


METQLYIGIM 


51 


DLQDTGADEL 


101 


TVRHAPEHGY 


151 


HEALFRDNRE 


201 


HWQLPYDKNG 


251 


ETYLDGGENR 


301 


LMADLAECFG 


351 


ATGASKPCIL 



ml21-l/gl21 ORFs 121-1 and 121. ng showed a 95.6% identity in 366 aa overlap 



ml21-l .pep 
gl21 



ml21-l .pep 



gl21 



ml21-l .pep 



gl21 



m!21-l .pep 



gl21 



ml21-l .pep 



gl21 



ml21-l .pep 



gl21 



10 20 30 40 50 60 

METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
i N I I I I I I I I I I I I I I I I | I | : | | | | | | | | | | | | | | | | | | j | I | I : | | ] | | | I j: | | | 

METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 
10 20 30 40 50 60 

70 80 90 100 110 120 

HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
MII:IMMillllHI!llllllllll I Ml Mill III IN INI II MINIM I 
HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGIANISVLPPDA 
li I I I I M M II M M M I M I M I I M M M i M M I M M M I I I I I II I II III | 
AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

I ' M I I II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M M M I I : M M 1 I I 

PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 
190 200 210 220 230 240 

250 260 270 280 290 300 

GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
I I I I I I M I I I I I I I I I I I I I I I I I I I I I M M I I M I M M M M M M M I M M M 
GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I I I I I I I M I I I I I I M : M I I I M I I M M M M I M M M M I M I I M M II M 
LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 " 360 



m!21-l .pep XAGYYYX 
I I I I M 

g!21 GAGYYYX 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3083>: 

al21-l.seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 
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51 


GGCGGATGCC 


GTACTGATAC 


101 


AAGGGCACGC 


CTTTACCCCC 


151 


GATTTGCAGG 


ACACAGGCGC 


201 


GCAAGAACTC 


AGCCGCCTGT 


251 


GTCAAAACCT 


CGCGCCGTCC 


301 


ACCGTCAGAC 


ACGCGCCGGA 


351 


GCCGCTGCTG 


GCGGAACGGA 


401 


GCCGCGACCT 


TGCGGCCGGC 


451 


CACGAAGCCC 


TGTTCCGCGA 


501 


CGGCGGGATT 


GCCAACATCA 


551 


GCTTCGACAC 


AGGACCGGGC 


601 


CACTGGCAGC 


TTCCTTACGA 


651 


CATATTGCCG 


CAACTGCTCG 


701 


AACCCCACCC 


TAAAAGCACG 


751 


GAAACCTACC 


TTGACGGCGG 


801 


TTCCCGATTC 


ACCGCGCAAA 


851 


CAGATGCCCG 


TCAAATGTAC 


901 


TTAATGGCGG 


ATTTGGCAGA 


951 


CACCGCCGAA 


CTGAACCTCG 


1001 


CATGGATGGC 


GGCGTGTTGG 


1051 


GCAACCGGCG 


CATCCAAACC 


1101 


A 





GGATGGACGG CGGCAAATGG CTGGGCGCGG 
TACCCCGGCA GGTTACGCCG CAAATTGCTG 
GGACGAACTG CACCGCAGCA GGATGTTGTC 
ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 
GACATTACCG CCCTCGGCTG CCACGGGCAA 
ACACAGTTAC AGCGTACAGC TTGCCGATTT 
CTCAGATTTT TACCGTCGGC GACTTCCGCA 
GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 
CGACAGGGAA ACACGCGCGG TACTGAACAT 
GCGTACTCCC CCCCGACGCA CCCGCCTTCG 
AATATGCTGA TGGACGCGTG GATGCAGGCA 
CAAAAACGGT GCAAAGGCGG CACAAGGCAA 
ACAGGCTGCT CGCCCACCCG TATTTCGCAC 
GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 
CGAAAACCGA TACGACGTAT TGCGGACGCT 
CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 
ATTTGCGGCG GCGGCATCCG CAATCCTGTT 
ATGTTTCGGC ACACGCGTTT CCCTGCACAG 
ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 
GTCAACCGCA TTCCCGGTAG TCCGCACAAA 
GTGTATTCTG GGCGCGGGAT ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 3084; ORF 121-l.a>: 

al21-l.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml21-l/al21-l ORFs 121-1 and 121-1. a showed a 96.4% identity in 366 aa overlap 

10 20 30 40 50 60 

ml21-l . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
1 1 1 1 1 1 1 1 i ■ I I ■ ■ ■ I I I I I I ■ I I I I I I I I I I t I I I I I I I I I f = I | | | | | | | I | f | 

METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLL DLQDTGADEL 
10 20 30 40 50 60 



al21-l 



70 80 90 100 110 120 

ml21-l.pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 

llll = IIMM UNI I Ml M I i I II M I M I I I I I I II I I 1:1 I: I I I | | | M 

HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 
70 80 90 100 110 120 



al21-l 



130 140 150 160 170 180 

ml21-l.pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLN I GGI ANISVLPPDA 

IIIIHIIIIillllllllMI I I i I I I I I I |:| | | | I I I || I! MMIIMIM 

AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGI ANISVLPPDA 
130 140 150 160 170 180 



a!21-l 



190 200 210 220 230 240 

ral21-l .pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

0 , IIIIIMMMIIII I N MM Ml ilillllMIII 

al21-l PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

!50 200 210 220 230 " 240 

250 260 270 280 290 300 

H1121-1 . pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 

I I I I I I I I I I I I I I I I | | | | | M I I I I I I II I I II II || | | | I I I I 

al21-l GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 
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310 320 330 340 350 360 

ml21-l.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I I I I II I I I I I M I I II : I I I I I I I I I I I I I I : II I I : I 1 I i I II I I I I I I I I I I I I 
al21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml21-l.pep XAGYYYX 
I I I I I I 



al21 GAGYYYX 



128 and 128-1 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3085>: 

ml28 . seq (partial) 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

2 01 GGGCGTGGTG TCGCACCTCA ACTGCGTCGC CGACACGCCC GAACTGCGCG 
251 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 
301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

3 51 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCAC 

1 TACGCCAGCG AAAAACTGCG CGAAGCCAAA TACGCGTTCA GCGAAACCGA 

51 wGTCAAAAAA TAyTTCCCyG TCGGCAAwGT ATTAAACGGA CTGTTCGCCC 

101 AAmTCAAAAA ACTmTACGGC ATCGGATTTA CCGAAAAAAC yGTCCCCGTC 

151 TGGCACAAAG ACGTGCGCTA TTkTGAATTG CAACAAAACG G CG AAmCCAT 

201 AGGCGGCGTT TATATGGATT TGTACGCACG CGAAGGCAAA CGCGGCGGCG 

251 CGTGGATGAA CGACTACAAA GGCCGCCGCC GTTTTTCAGA CGGCACGCTG 

301 CAAyTGCCCA CCGCCTACCT CGTCTGCAAC TTCGCCCCAC CCGTCGGCGG 

3 51 CAGGGAAGCC CGCyTGAGCC ACGACGAAAT CCTCATCCTC TTCCACGAAA 

4 01 CCGGACACGG GCTGCACCAC CTGCTTACCC AAGTGGACGA ACTGGGCGTA 
451 TCCGGCATCA ACGGCGTAkA ATGGGACGCG GTCGAACTGC CCAGCCAGTT 
501 TATGGAAAAT TTCGTTTGGG AATACAATGT CTTGGCACAA mTGTCAGCCC 
551 ACGAAGAAAC CGGcgTTCCC yTGCCGAAAG AACTCTTsGA CAAAwTGCTC 
601 GCCGCCAAAA ACTTCCAAsG CGGCATGTTC yTsGTCCGGC AAwTGGAGTT 
651 CGCCCTCTTT GATATGATGA TTTACAGCGA AGACGACGAA GGCCGTCTGA 
701 AAAACTGGCA ACAGGTTTTA GACAGCGTGC GCAAAAAAGT CGCCGTCATC 
751 CAGCCGCCCG AATACAACCG CTTCGCCTTG AGCTTCGGCC ACATCTTCGC 
801 AGGCGGCTAT TCCGCAGCTn ATTACAGCTA CGCGTGGGCG GAAGTATTGA 
851 GCGCGGACGC ATACGCCGCC TTTGAAGAAA GCGACGATGT CGCCGCCACA 
901 GGCAAACGCT TTTGGCAGGA AATCCTCGCC GTCGGGGnAT CGCGCAGCGG 
951 nGCAGAATCC TTCAAAGCCT TCCGCGGCCG CGAACCGAGC ATAGACGCAC 

1001 TCTTGCGCCA CAGCGGTTTC GACAACGCGG TCTGA 

This corresponds to the amino acid sequence <SEQ ID 3086; ORF 128>: 

ml28 .pep (partial > 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA I KAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNCVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NH 



// 



1 YASEKLREAK YAFSETXVKK YFPVGXVLNG LFAQXKKLYG IGFTEKTVPV 

51 WHKDVRYXEL QQNGEXIGGV YMDLYAREGK RGGAWMNDYK GRRRFSDGTL 

101 QLPTAYLVCN FAP PVGGREA RLSHDEILIL FHETGHGLHH LLTQVDELGV 

151 SGINGVXWDA VELPSQFMEN FVWEYNVLAQ XSAHEETGVP LPKELXDKXL 

201 AAKNFQXGMF XVRQXEFALF DMMIYSEDDE GRLKNWQQVL DSVRKKVAVI 

251 QPPEYNRFAL SFGHIFAGGY SAAXYSYAWA EVLSADAYAA FEESDDVAAT 

301 GKRFWQEILA VGXSRSGAES FKAFRGREPS IDALLRHSGF DNAV* 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3087>: 

gl28 . seq 

1 atgattgaca acgCActgct ccacttgggc gaagaaccCC GTTTTaatca 

51 aatccaaacc gaagACAtca AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

3 51 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 
401 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 
451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
501 CCAAAACGTC CTAGACGCGA .CCGACGCGTT CGGCATTTAC TTTGACGATG 
551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 
651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 
701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 
751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 
801 AACCGccaaa CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 
851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACtaca AAGGCCGCCG CCGCTTTGCC GACGgcacGC 

13 01 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

13 51 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AacCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
14 51 TGTCCGGCAT CAAcggcgtA GAATGGGACG CGGTCGAACT GCCCAGCCAG 
1501 TTTATGGAAA ACTTCGTTTG GGAATACAAT GTATTGG CAC AAATGTCCGC 
1551 CCACGAAGAA AccgGCGAGC CCCTGCCGAA AGAACTCTTC GACAAAATGC 
16 01 TcgcCGCCAA AAACTTCCAG CGCGGTATGT TCCTCGTCCG GCAAATGGAG 
1651 TTCGCCCTCT TCGATATGAT GATTTACAGT GAAAGCGACG AATGCCGTCT 
1701 GAAAAACTGG CAGCAGGTTT TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 
1751 TCCAACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCacatctTC 
1801 GCcggcGGCT ATTCCGCAGG CTATTACAGC TACGCATGGG CCGAAGTCCt 
1851 cAGCACCGAT GCCTACGCCG CCTTTGAAGA AAGcGACGac gtcGCCGCCA 
1901 CAGGCAAACG CTTCTGGCAA GAAAtCCttg ccgtcggcgg ctCCCGCAGC 
1951 gcgGCGGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 
2001 ACTGCTGCGC CAaagcggtT TCGACAACGC gGCttgA 

This corresponds to the amino acid sequence <SEQ ID 3088; ORF 128.ng>: 

gl28 .pep 

1 MIDNALLHLG EEPRFNQIQT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA . 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

4 01 I GGVYMDL YA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 
4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 
501 FMENFVWEYN VLAQMSAHEE TGEPLPKELF DKMLAAKNFQ RGMFLVRQME 
551 FALFDMMIYS ESDECRLKNW QQVLDSVRKE VAVIQPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSTD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
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651 AAESFKAFRG REPSIDALLR QSGFDNAA* 



ORF 128 shows 91.7% identity over a 475 aa overlap with a predicted ORF (ORF 128.ng) 
from N. gonorrhoeae: 

ml28/gl28 

10 20 30 40 50 60 

gl28 . pep M I DNALLHLGEE PRFNQ I QTEDI KPAVQT AI AE ARGQ I AAVKAQTHTGWANTVERLTGI T 

I IIIIIMIMIIhlhllllllhlMIIIII MMMMMMMM 1 1 1 i I 

ml2 8 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 

70 80 90 100 110 120 

gl28 . pep ERVGRI WGVVSHLNSVVDTPELRAVYNELMPE I TVFFTE IGQDI ELYNRFKTI KNS PEFA 

MIIIIMIIIIII I H I II 1 1 ! 1 1 1 1 1 1 M 1 1 1 M M I II 1 1 1 1 1 1 II 1 1 1 M I M ! 

ml28 ERVGRIWGWSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 

130 140 150 160 170 180 

g!28 .pep TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

MMMMMM 

ml 2 8 TLS PAQKTKLNH 

130 

// 

340 350 360 

gl28 . pep YAGEKLREAKYAFSETEVKKYFPVGKVLAG 

MMMMMMMM MMMM 1 1 I 

ml2 8 Y AS E KLRE AKYAFS E TX VKKY F P VGX VLNG 

10 20 30 

370 380 390 400 410 420 

gl28 . pep LFAQI KKLYG I GFAEKTVPVWHKDVRYFELQQNGKT I GGVYMDLYAREGKRGG AWMNDYK 

1 1 1 1 1 1 1 M 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 IIMIhMllilMIIIIIMIIIIMMII 

ml2 8 LFAQXKKLYG I GFTE KTVP VWHKDVRYXELQQNGEX I GGV YMDLYAREGKRGGAWMWD YK 

40 50 60 70 80 90 

430 440 450 460 470 480 

gl28 - pep GRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVDELGV 

I II I h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 lllllllllllllllllllll 

ml28 GRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVDELGV 

100 110 120 130 140 150 

490 500 510 520 530 540 

gl28 . pep SGINGVEWDAVELPSQFMENFWEYNVLAQMSAHEETGEPLPKELFDKMLAAKNFQRGMF 

MINI 1 1 1 1 1 M 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 MIMM MINI 1 1 lllllll Ml 

ml 2 8 SGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQXGMF 

160 170 180 190 200 210 

550 560 570 580 590 600 

gl28 . pep LVRQMEFALFDMMIYSESDECRLKNWQQVLDSVRKEVAVIQPPEYNRFANSFGHIFAGGY 

IN MMIMIMIhM lllllllllllllhlllllllllllll MMMIMI 

ml 2 8 XVRQXE FALFDMMI YS EDDEGRLKNWQQVLDS VRKKVAVI QPPE YNRFALS FGH I FAGGY 

220 230 240 250 260 270 

610 620 630 640 650 660 

gl28 . pep SAGYYSYAWAEVLSTDAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRGREPS 

1 1 : 1 1 1 1 1 1 1 1 M H 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 MMMMMMMM! 

ml28 SAAXYS YAWAEVLS ADAYAAFEESDDVAATGKRFWQE I LAVGXSRSGAES FKAFRGRE PS 

280 290 300 310 320 330 
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670 679 
gl28 . pep IDALLRQSGFDNAAX 

lllllhlllllh 
ml2 8 IDALLRHSGFDNAVX 

340 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3089>: 

al28.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 
51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 
101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 
151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 
201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 
251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 
301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 
351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 
4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 
4 51 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 
551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 
601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

7 01 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 3090; ORF 128.a>: 

al28.pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGVV SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

2 51 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 I GGV YMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAVVRPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28/al28 ORFs 128 and 128.a showed a 66.0% identity in 677 aa overlap 

10 20 30 40 50 60 

ml28 . pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

1 " 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1, 1 1 1 1 1 M | ,, , 

al28 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 

70 80 90 100 HO 120 

ml28 . pep ERVGRIWGVVSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

I I I I • I I I I I I I I I I : I I I I | | | : | | | | | M | | | | | | | | | | | | | | | | | | | | I I 

al28 ERVGRIWGVVSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 HO. 120 

130 

ml28.pep TLS PAQKTKLNH 

III II I I I I I I 

al28 TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

130 14 0 150 160 170 180 

ml28.pep 

al28 FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 

190 200 210 220 230 240 

ml28.pep 

al28 TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

250 2 60 270 280 290 " 300 

i9B 140 150 

mi^B .pep YASEKLREAKYAFSETXVKKYFPVGX 

al28 ARPAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKY^ 

310 3 20 330 340 350 360 

160 170 180 190 200 210 

ml28 . pep VLNGLFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWM 

11111111 1 I I I I I I I II I II I I I I I I I I | | | : | | | | | | | | | | | | | | | | | | | | 

al28 VLN GL FAQ IKKL YG I G FTEKTVPVWHKDVR YFE LQQNGET I GGV YMDL YAREGKRGGAWM 

370 3 80 390 400 410 420 

22 0 230 240 250 260 270 

ml28 .pep NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 

I I I I I I I I I I I I I I M:]IM!:IIIII| III I I I I I I I I I I I I I I I I I 

al28 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 

430 440 450 460 470 480 

280 290 300 310 320 330 

ml28 . pep ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQ 

11111111 M I I MINI III I III II II III MM Mm 

a128 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

490 500 510 520 530 540 

340 3 50 360 370 380 390 

ml28.pep xgmfxvr Q x efalfdmmiyseddegrlknwqqvldsvrkkvaviqppeynrfalsfghif 
1,1 111 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I:: I I I I I I I I I I I I I I 
al28 rgmflvr Q mef alfdmmiyseddegrlknwqqvldsvrkevavvrppeynrfansfghif 

550 560 570 580 590 600 
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400 410 420 430 440 450 

ml28 . pep AGGYSAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAESFKAFRG 
I N I I I : I I I I I I I | | | | | | | | | || | | | | | | | | , | || , , | | | | , | m I : I I M ! I I I I 
al28 AGGYS AGYYS YAWAE VLSADAYAAFEE S DDVAATGKRFWQE I LAVGGSRS AAES FKAFRG 

610 620 630 640 650 660 



460 470 
ml 2 8 . pep- RE PS I DALLRHSG FDNAVX 

I I I I I I I I I I 1 I I I I I I : 
al28 RE P S I DALLRH S G FDN AAX 

670 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 3091>- 

ml28-l.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCGC CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCACGATC 

401 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAAGCAA AACAGGCTAC AAAATCGGCT TGCAGATTCC 

651 ACACTACCTC GCCGTCATCC AATACGCCGA CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAACTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGCAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTGA ACCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCAG CGAAAAACTG CGCGAAGCCA AATACGCGTT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 C C AAAT C AAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC ACCCGTCGGC 

1351 GGCAGGGAAG CCCGCCTGAG CCACGACGAA ATCCTCATCC TCTTCCACGA 

14 01 AACCGGACAC GGGCTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

1451 TATCCGGCAT CAACGGCGTA GAATGGGACG CGGTCGAACT GCCCAGCCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCAC AAATGTCAGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGCATGT TCCTCGTCCG GCAAATGGAG 

1651 TTCGCCCTCT TT GAT AT GAT GATTTACAGG GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAAAA GTCGCCGTCA 

1751 TCCAGCCGCC CGAATACAAC CGCTTCGCCT TGAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGC CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGTT TCGACAACGC GGTCTGA 

This corresponds to the amino acid sequence <SEQ ID 3092; ORF 128-1>: 

ml28-l.pep. 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 
51 NTVEPLTGIT ERVGRIWGVV SHLNSVADTP ELRAVYNELM PEITVFFTEI 
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101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDAL AM FAA 

201 AAQSESKTGY KIGLQIPHYL AVIQYADNRE LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLANALQTAK LLG FKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLNLADLQP WDLGYASEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFAPPVG 

4 51 GREARLSHDE ILILFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKK VAVIQPPEYN RFALSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG REPSIDALLR HSGFDNAV* 

The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3093>: 

gl28-l.seq (partial) 

1 ATGATTGACA ACGCACTGCT CCACTTGGGC GAAGAACCCC GTTTTAATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

4 01 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGCCAAA CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 C C AAAT C AAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGCTTTGCC GACGGCACGC 

1301 • TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TGTCCGGCAT CAACGGCGTA AAA 

This corresponds to the amino acid sequence <SEQ ID 3094; ORP 128-1. ng>: 

gl28-l.pep (partial) 

1 MIDNALLHLG EEPRFNQIKT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDAL AM FAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLG FKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV K 



ml28-l/gl28-l ORFs 128-1 and 128-1. ng showed a 94.5% identity in 4 91 aa 
overlap 

10 20 30 40 50 60 
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gl28-l .pep MIDNALLHLGEEPRFNQIKTEDIKPAVQTAIAEARGQIAAVKAQTHTGWANTVERLTGIT 
I I I I I I I I 1 1 I I I ! : II I I I II I I 1 : I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I 
ml28-l MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 

70 80 90 100 110 120 

gl28-l.pep ERVGRIWGVVSHLNSWDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

I I I I I I I I I II I I I I I : I I I I I II I I I I I ! I I I II I I I I I I I I I I I I I I I I I I I I I | I | 
ml 28-1 ERVGRIWGVVSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 

130 140 150 160 170 180 

gl 28-1 .pep TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
I I I I I I I I I I : I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml28-l TLSPAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

130 140 150 160 170 180 

190 200 210 220 230 240 

gl28-l.pep FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYAGNRELREQIYRAYV 
I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I M I I I I I I I I I I I I I I 1! I I I I I I I I I I I 
ml28-l FDDAAPLAGIPEDALAMFAAAAQSESKTGYKIGLQIPHYLAVIQYADNRELREQIYRAYV 

190 200 210 220 230 240 

250 260 270 280 290 300 

gl28-l.pep TRASELSNDGKFDNTANIDRTLENALKTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
I I I I I I I : I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I 
ml28-l TRASELSDDGKFDNTANIDRTLANALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

250 260 270 280 290 300 

310 320 330 340 350 360 

gl28-l . pep ARRAKPYAEKDLAEVKAFAREHLGLADPQPWDLSYAGEKLREAKYAFSETEVKKYFPVGK 

I I I I I I I I I I I I I I I I I I I II 1:111 I I I I I : I I : I I I I I I I I II I I I I I I I I I I I I I 
ml 2 8 - 1 ARRAKPYAEKDLAEVKAFARESLNliADLQPWDLGYASEKLREAKYAFSETEVKKYFPVGK 

310 320 330 340 350 360 

370 380 390 400 410 420 

gl28-l.pep VLAGLFAQIKKLYGIGFAEKTVPVWHKDVRYFELQQNGKTIGGVYMDLYAREGKRGGAWM 

II I I I I I I M I I I I I I : I I I I I I I I I I I I I I II I I I I : I I II I I I I I I I II I I I I I I I I 
ml28-l VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 

430 440 450 460 470 480 

gl28-l.pep NDYKGRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 
I I I I I I I I I : I I I I I I I I I I I I I I I I II I I I : I I I I I I I 11 I I I I I I I I I I I I I I I I I I 
ml28-l NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 

430 440 450 460 470 480 

490 

gl28-l.pep ELGVSGINGVK 
I I I I I I I I I 1 : 

ml28-l ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

490 500 510 520 530 540 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3095>: 

al28-l. seq 

1 ATGACTGACA 

51 AATCAAAACC 

101 CGCGCGAACA 

151 AACACTGTCG 

201 GGGCGTGGTG 

251 CCGCCTACAA 

301 GGACAAGACA 

351 CGAGTTCGAC 

4 01 TGCGCGATTT 

4 51 GAATTGGCAA 



ACGCACTGCT CCATTTGGGC 
. GAAGACATCA AACCCGCCCT 
AATCGCCGCC ATCAAAGCCC 
AACCCCTGAC CGGCATCACC 
TCGCACCTCA ACTCCGTCAC 
TGAATTAATG CCCGAAATTA 
TCGAGCTGTA CAACCGCTTC 
ACCCTCTCCC ACGCGCAAAA 
CGTCCTCAGC GGCGCGGAAC 
AACTGCAAAC CGAAGGCGCG 



GAAGAACCCC GTTTTGATCA 
GCAAACCGCC ATTGCCGAAG 
AAACGCACAC CGGCTGGGCA 
GAACGCGTCG GCAGGATTTG 
CGACACGCCC GAACTGCGCG 
CCGTCTTCTT CACCGAAATC 
AAAACCATCA AAAACTCCCC 
AACCAAACTC AACCACGATC 
TGCCGCCCGA ACAGCAGGCA 
CAACTTTCCG CCAAATTCTC 
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501 CCAAAACGTC CTAGACGCGA 

551 CCGCACCGCT TGCCGGCATT 

601 GCCGCGCAAA GCGAAGGCAA 

651 GCACTACCTC GCCGTCATCC 

7 01 AAATCTACCG CGCCTACGTT 

751 AAATTCGACA ACACCGCCAA 

801 AACCGCCAAA CTGCTCGGCT 

851 CCAAAATGGC GGACACCCCC 

901 GCCCGCCGCG CCAAACCCTA 

951 CTTCGCCCGC GAAAGCCTCG 

1001 GCTACGCCGG CGAAAAACTG 

1051 GAAGTCAAAA AATACTTCCC 

1101 CCAAATCAAA AAACTCTACG 

1151 TCTGGCACAA AGAGGTGCGC 

1201 ATAGGCGGCG TTTATATGGA 

1251 CGCGTGGATG AACGACTACA 

1301 TGCAACTGCC CACCGCCTAC 

1351 GGCAAAGAAG CCCGCTTGAG 

14 01 AACCGGACAC GGCCTGCACC 

14 51 TATCCGGCAT CAACGGCGTA 

1501 TTTATGGAAA ATTTCGTTTG 

1551 CCACGAAGAA ACCGGCGTTC 

1601 TCGCCGCCAA AAACTTCCAA 

1651 TTCGCCCTCT TTGATATGAT 

17 01 GAAAAACTGG CAACAGGTTT 

17 51 TCCGACCGCC CGAATACAAC 

1801 GCAGGCGGCT ATTCCGCAGG 

1851 GAGCGCGGAC GCATACGCCG 

1901 CAGGCAAACG CTTTTGGCAG 

1951 GCGG CAGAAT CCTTCAAAGC 

2001 ACTCTTGCGC CACAGCGGCT 



CCGACGCGTT CGGCATTTAC TTTGACGATG 
CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 
AACAGGCTAC AAAATCGGTT TGCAGATTCC 
AATACGCCGA CAACCGCAAA CTGCGCGAAC 
ACCCGCGCCA GCGAGCTTTC AGACGACGGC 
CATCGACCGC ACGCTCGAAA ACGCCCTGCA 
TCAAAAACTA CGCCGAATTG TCGCTGGCAA 
GAACAAGTTT TAAACTTCCT GCACGACCTC 
CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
GCCTCGCCGA TTTGCAACCG TGGGACTTGG 
CGCGAAGCCA AATACGCATT CAGCGAAACC 
CGTCGGCAAA GTATTAAACG GACTGTTCGC 
GCATCGGATT TACCGAAAAA ACCGTCCCCG 
TATTTTGAAT TGCAACAAAA CGGCGAAACC 
TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
AAGGCCGCCG CCGTTTTTCA GACGGCACGC 
CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 
CCATGACGAA ATCCTCACCC TCTTCCACGA 
ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 
GAATGGGACG CAGTCGAACT GCCCAGTCAG 
GGAATACAAT GTCTTGGCGC AAATGTCCGC 
CCCTGCCGAA AGAACTCTTC GACAAAATGC 
CGCGGAATGT TCCTCGTCCG CCAAATGGAG 
GATTTACAGC GAAGACGACG AAGGCCGTCT 
TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 
CGCTTCGCCA ACAGCTTCGG CCACATCTTC 
CTATTACAGC TACGCGTGGG CGGAAGTATT 
CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 
GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 
CTTCCGCGGA CGCGAACCGA GCATAGACGC 
TCGACAACGC GGCTTGA 



This corresponds to the amino acid sequence <SEQ ID 3096; ORF 128-l.a>: 

al28-l.pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGVV SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDAL AM FAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET ' 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAVVRPPEYN RFANSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG REPSIDALLR HSGFDNAA* 



ml28-l/al28-l ORFs 128-1 and 128-1. a showed a 97.8% identity in 677 aa overlap 

10 20 30 40 50 60 

al28-l.pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I i I I I I I I I I I I I I I I I I I 1 I I I I I I 
ml28-l MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 • 20 30 40 50 60 

70 80 90 100 110 120 

al2 8-l.pep ERVGRIWGVVSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
I I I I I I I I I I I I I I I I : I I I i I I I : I I I I I I ri I 1 II I I I I I I I I I I I I I I I 1 I I I 1 I I I 
ml28-l ERVGRIWGVVSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 



130 140 150 160 170 180 
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al28-l - pep TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 11 I I I I I I I I I I I | | | | | 
ml28-l TLSPAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

130 140 150 160 170 180 

190 200 210 220 230 240 

al28-l -pep FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 
I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I | I I 
ml28-l FDDAAPLAGIPEDALAMFAAAAQSESKTGYKIGLQIPHYLAVIQYADNRELREQIYRAYV 

190 200 210 220 230 240 

250 260 270 280 290 300 

al28-l • pep TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I ! I I I I I I I I I I I I I I I I I I I I I I I 
ml28-l TRASELS DDGKFDNTAN I DRTLANALQTAKLLG FKN YAELS LATKMADT PEQVLN FLHDL 

250 260 270 280 290 300 

310 320 330 340 350 360 

al28-l . pep ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 
I I I i I I I I I I I I I I I I I I I I M I :l I 1 I I I I I I I I I : I I 1 I I I I I I I I I I I I I ! I II I I I 
ml28-l ARRAKPYAEKDLAEVKAFARESLNLADLQPWDLGYASEKLREAKYAFSETEVKKYFPVGK 

310 320 330 340 350 360 

370 380 390 400 410 420 

al28-l . pep VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
I I I I I I I I M I I I I I I I I I I I I I I I I I I I | I I I II || | | | | | | | | | | | | | j | 1 | | | | | | | 
ml28-l VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 



430 440 450 460 470 480 

al28-l . pep NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 
I II I I H I I I I I II I I II I I I 1111:11 I I 1:1 I II I I I i I I I I I I I I I 1 I I i I I I I II 
ml28-l NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 

430 440 450 460 470 480 

490 500 510 520 530 540 

al28-l . pep ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
I I I f I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I 1 I I I I I I I I I I | | | I | | | I I I I I I 
m!28-l ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

490 500 510 520 530 540 

550 560 570 580 590 600 

al28-l . pep RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAVVRPPEYNRFANSFGHIF 
I M I I I i I M I I I I IMM I I I I Ml I I I I II INI I i I: II I:: M I I I I | I |||||| 
ml 2 8 - 1 RGMFLVRQME FALFDMMI YSEDDEGRLKNWQQVLDSVRKKVAVIQPPE YNRFALS FGHI F 

550 560 570 580 590 600 

610 620 630 640 650 660 

al28-l .pep AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 
I I I I I I I I I I ! I I I I I I M I I I I I I I I I I I I I I I I II M I I I I i I I l.l M I I I I I I I I I I 
ml28-l AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 

610 620 630 640 650 660 



670 679 
al28-l.pep RE PS I DALLRHSG FDNAAX 

I I I I I II I I I I II I I I I : 
ml 28-1 REPSI DALLRHSGFDNAVX 

670 



206 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3097>: 

m206 . seq 
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1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAr AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

3 01 GCCCGCGACA TGGCGGCGGC AAGCCGsAAA ATCCCCGAcA GCCGCyTCAA 

351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

401 ACGTCGGACT CTACATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

451 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TTTTTACAGA ATGA 



This corresponds to the amino acid sequence <SEQ ID 3098; ORF 206>: 

m2 06 .pep . . 



1 MFPPDKTLFL CLSALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRXKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3099>: 



1 atgttttccc ccgacaaaac ccttttcctc tgtctcggcg cactgctcct 

51 cgcctcatgc ggcacgacct ccggcaaaca ccgccaaccg aaacccaaac 

101 agacagtccg gcaaatccaa gccgtccgca tcagccacat cggccgcaca 

151 caaggctcgc aggaactcat gctccacagc ctcggactca tcggcacgcc 

201 ctacaaatgg ggcggcagca gcaccgcaac cggcttcgac tgcagcggca 

251 tgatztcaatt ggtttacaaa aacgccctca acgtcaagct gccgcgcacc 

301 gcccgcgaca tggcggcggc aagccgcaaa 1 atccccgaca gccgcctcaa 

3 51 ggccggcgac atcgtattct tcaacaccgg cggcgcacac cgctactcac 

401 acgtcggact ctacatcggc aacggcgaat tcatccatgc ccccggcagc 

451 ggcaaaacca tcaaaaccga aaaactctcc acaccgtttt acgccaaaaa 

501 ctaccttgga gcgcatacgt tttttacaga atga 



This corresponds to the amino acid sequence <SEQ ID 3100; ORF 206.ng>: 

g2 06.pep 



1 MFSPDKTLFL CLGALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIGRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQLVYK NALNVKLPRT 

101 ARDMAAASRK I PDSRLKAGD IVFFNTGGAH RYSHVGLYIG NGEFIHAPGS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 



ORF 206 shows 96.0% identity over a 177 aa overlap with a predicted ORF (ORF 206.ng) 
from N. gonorrhoeae: 

m206/g206 



g206 . seg 



m206 -pep 




10 20 30 40 50 60 

MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 



g206 



10 20 30 40 50 60 



m206 .pep 



70 80 90 100 110 120 

LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRXKAGD 



g206 




80 90 100 110 120 



m206 .pep 



130 140 150 160 170 

LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 



g206 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3 101>: 

a206.seq 

1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAA AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGCAAA ATCCCCGACA GCCGCCTTAA 

351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

4 01 ACGTCGGACT CTATATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

4 51 GGCAAAACCA TCAAAACCGA • AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TCTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 3102; ORF 206.a>: 

a206.pep 

1 MFPPDK TLFL CLSALLLASC GTT SGKHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRLKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

m206/a206 ORFs 206 and 206.a showed a 99.4% identity in 1 77 aa overlap 

10 20 30 40 50 60 

m206.pep MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 

1 M Mill I! IIIIIMMMMMII I I I IMIMIII II III I I MMMII I 

a206 MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 

10 20 30 40 50 ^ 60 

70 80 90 100 110 120 

m206.pep LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRXKAGD 

III M Mi :l I Mill II M MM M I II Ml MIIMM I I III II 1 MM 

a2 0 6 LGLIGTPYKWGGS STATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKI PDSRLKAGD 

70 80 90 100 110 120 

130 140 150 160 170 

m206.pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 
I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | M I I I I 
a206 LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 

130 140 150 160 170 



287 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3 103>: 

m287.seq 

1 ATGTTTAAAC GCAGCGTAAT CGCAATGGCT TGTATTTTTG CCCTTTCAGC 

51 CTGCGGGGGC GGCG.GTGGCG GATCGCCCGA TGTCAAGTCG GCGGACACGC 

101 TGTCAAAACC TGCCGCCCCT GTTGTTTCTG AAAAAGAGAC AGAGGCAAAG 

151 GAAGATGCGC CACAGGCAGG TTCTCAAGGA CAGGGCGCGC CATCCGCACA 

201 AGGCAGTCAA GATATGGCGG CGGTTTCGGA AGAAAATACA GGCAATGGCG 

251 GTGCGGTAAC AGCGGATAAT CCCAAAAATG AAGACGAGGT GGCACAAAAT 

301 GATATGCCGC AAAATGCCGC CGGTACAGAT AGTTCGACAC CGAATCACAC 

351 CCCGGATCCG AATATGCTTG CCGGAAATAT GG AAAATCAA GCAACGGATG 

4 01 CCGGGGAATC GTCTCAGCCG GCAAACCAAC CGGATATGGC AAATGCGGCG 

4 51 GACGGAATGC AGGGGGACGA TCCGTCGGCA GGCGGGCAAA ATGCCGGCAA 

501 TACGGCTGCC CAAGGTGCAA ATCAAGCCGG AAACAATCAA GCCGCCGGTT 

551 CTTCAGATCC CATCCCCGCG TCAAACCCTG CACCTGCGAA TGGCGGTAGC 
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601 AATTTTGGAA GGGTTGATTT GGCTAATGGC GTTTTGATTG ACGGGCCGTC 

651 GCAAAATATA ACGTTGACCC ACTGTAAAGG CGATTCTTGT AGTGGCAATA 

7 01 ATTTCTTGGA TGAAGAAGTA CAGCTAAAAT CAGAATTTGA AAAATTAAGT 

7 51 GAT GC AG AC A AAATAAGTAA TTACAAGAAA GATGGGAAGA ATGATAAATT 

801 TGTCGGTTTG GTTGCCGATA GTGTGCAGAT GAAGGGAATC AATCAATATA 

851 TTATCTTTTA TAAACCTAAA CCCACTTCAT TTGCGCGATT TAGGCGTTCT 

901 GCACGGTCGA GGCGGTCGCT TCCGGCCGAG ATGCCGCTGA TTCCCGTCAA 

951 TCAGGCGGAT ACGCTGATTG TCGATGGGGA AGCGGTCAGC CTGACGGGGC 

1001 ATTCCGGCAA TATCTTCGCG CCCGAAGGGA ATTACCGGTA TCTGACTTAC 

1051 GGGGCGGAAA AATTGCCCGG CGGATCGTAT GCCCTTCGTG TTCAAGGCGA 

1101 ACCGGCAAAA GGCGAAATGC TTGCGGGCGC GGCCGTGTAC AACGGCGAAG 

1151 TACTGCATTT CCATACGGAA AACGGCCGTC CGTACCCGAC CAGGGGCAGG 

1201 TTTGCCGCAA AAGTCGATTT CGGCAGCAAA TCTGTGGACG GCATTATCGA 

1251 CAGCGGCGAT GATTTGCATA TGGGTACGCA AAAATTCAAA GCCGCCATCG 

1301 ATGGAAACGG CTTTAAGGGG ACTTGGACGG AAAATGGCAG CGGGGATGTT 

1351 TCCGGAAAGT TTTACGGCCC GGCCGGCGAG GAAGTGGCGG GAAAATACAG 

1401 CTATCGCCCG ACAGATGCGG AAAAGGGCGG ATTCGGCGTG TTTGCCGGCA 

14 51 AAAAAGAGCA GGATTGA 

This corresponds to the amino acid sequence <SEQ ID 3104; ORF 287>: 

m287 .pep 

1 MFKRSVIAMA CIFALSA CGG GGGGSPDVKS ADTLSKPAAP WSEKETEAK 

51 EDAPQAGSQG QGAPSAQGSQ DMAAVSEENT GNGGAVTADN PKNEDEVAQN 

101 DMPQNAAGTD SSTPNHTPDP NMLAGNMENQ ATDAGESSQP ANQPDMANAA 

151 DGMQGDDPSA GGQNAGNTAA QGANQAGNNQ AAGSSDPIPA SNPAPANGGS 

201 NFGRVDLANG VLIDGPSQNI TLTHCKGDSC SGNNFLDEEV QLKSEFEKLS 

251 DADKISNYKK DGKNDKFVGL VADSVQMKGI NQYIIFYKPK PTSFARFRRS 

301 ARSRRSLPAE MPLIPVNQAD TLIVDGEAVS LTGHSGNIFA PEGNYRYLTY 

351 GAEKLPGGSY ALRVQGEPAK GEMLAGAAVY NGEVLHFHTE NGRPYPTRGR 

401 FAAKVDFGSK SVDGIIDSGD DLHMGTQKFK AAIDGNGFKG TWTENGSGDV 

451 SGKFYGPAGE EVAGKYSYRP TDAEKGGFGV FAGKKEQD* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3105>: 

g287 . seq 

1 atgtttaaac gcagtgtgat tgcaatggct tgtatttttc ccctttcagc 

51 ctgtgggggc ggcggtggcg gatcgcccga tgtcaagtcg gcggacacgc 

101 cgtcaaaacc ggccgccccc gttgttgctg aaaatgccgg ggaaggggtg 

151 ctgccgaaag aaaagaaaga tgaggaggca gcgggcggtg cgccgcaagc 

201 cgatacgcag gacgcaaccg ccggagaagg cagccaagat atggcggcag 

251 tttcggcaga aaatacaggc aatggcggtg cggcaacaac ggacaacccc 

301 aaaaatgaag acgcgggggc gcaaaatgat atgccgcaaa atgccgccga 

351 atccgcaaat caaacaggga acaaccaacc cgccggttct tcagattccg 

401 cccccgcgtc aaaccctgcc cctgcgaatg gcggtagcga ttttggaagg 

451 acgaacgtgg gcaattctgt tgtgattgac ggaccgtcgc aaaatataac 

501 gttgacccac tgtaaaggcg attcttgtaa tggtgataat ttattggatg 

551 aagaagcacc gtcaaaatca gaatttgaaa aattaagtga tgaagaaaaa 

601 attaagcgat ataaaaaaga cgagcaacgg gagaattttg tcggtttggt 

651 tgctgacagg gtaaaaaagg atggaactaa caaatatatc atcttctata 

701 cggacaaacc acctactcgt tctgcacggt cgaggaggtc gcttccggcc 

751 gagattccgc tgattcccgt caatcaggcc gatacgctga ttgtggatgg 

801 ggaagcggtc agcctgacgg ggcattccgg caatatcttc gcgcccgaag 

851 ggaattaccg gtatctgact tacggggcgg aaaaattgcc cggcggatcg 

901 tatgccctcc gtgtgcaagg cgaaccggca aaaggcgaaa tgcttgttgg 

951 cacggccgtg tacaacggcg aagtgctgca tttccatatg gaaaacggcc 

1001 gtccgtaccc gtccggaggc aggtttgccg caaaagtcga tttcggcagc 

1051 aaatctgtgg acggcattat cgacagcggc gatgatttgc atatgggtac 

1101 gcaaaaattc aaagccgcca tcgatggaaa cggctttaag gggacttgga 
1151 cggaaaatgg cggcggggat gtttccggaa ggttttacgg cccggccggc 

1201 gaggaagtgg cgggaaaata cagctatcgc ccgacagatg ctgaaaaggg 

1251 cggattcggc gtgtttgccg gcaaaaaaga tcgggattga 

This corresponds to the amino acid sequence <SEQ ID 3106; ORF 287.ng>: 

g287.pep 

1 MFKRSVIAMA CIFPLSACGG GGGGSPDVKS ADTPSKPAAP VVAENAGEGV 
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51 LPKEKKDEEA AGGAPQADTQ DATAGEGSQD MAAVSAENTG NGGAATTDNP 

101 KNEDAGAQND MPQNAAESAN QTGNNQPAGS SDSAPASNPA PANGGSDFGR 

151 TNVGNSVVID GPSQNITLTH CKGDSCNGDN LLDEEAPSKS EFEKLSDEEK 

201 IKRYKKDEQR ENFVGLVADR VKKDGTNKYI IFYTDKPPTR SARSRRSLPA 

251 EIPLIPVNQA DTLIVDGEAV SLTGHSGNIF APEGNYRYLT YGAEKLPGGS 

301 YALRVQGEPA KGEMLVGTAV YNGEVLHFHM ENGRPYPSGG RFAAKVDFGS 

351 KSVDGIIDSG DDLHMGTQKF KAAIDGNGFK GTWTENGGGD VSGRFYGPAG 

401 EEVAGKYSYR PTDAEKGGFG VFAGKKDRD* 



m287/g287 ORFs 287 and 287.ng showed a 70.1% identity in 499 aa overlap 

10 2-0 30 4 0 4 9 

m2 87 . pep MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPVVSE KETEA 

I I I I j I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I E I : I I : | | 

g287 MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPVVAENAGEGVLPKEKKDEEA 

10 20 30 40 50 60 



50 60 70 80 90 100 109 

m287 . pep KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGT 

I I I I : I I : : : I I I I I I I I I I I I I I I I I : I : I I I I I I I I I I I I I I I I I 
g287 AGGAPQADTQD — ATAGEGSQDMAAVSAENTGNGGAATTDN PKNEDAGAQNDMPQNAA-- 

70 80 90 100 110 

110 120 130 140 150 160 169 

m2 87 . pep DSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTA 

g287 



170 180 190 200 210 220 229 

m287 . pep AQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDS 
: : I I I : I I I I Mill I M I I I I I I I I I : I I I : : : : I : I : I I I I I I I I I I I I | | | j| 
g287 -ESANQTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSWIDGPSQNITLTHCKGDS 
120 130 140 150 160 170 



230 240 250 260 270 280 289 

m287.pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKP 

I : I : I : I I I I : I I I I II I I I : I I : MM : :: I I I I I I I I : I I : I I I M 
g287 CNGDNLLDEEAPSKSEFEKLSDEEKIKRYKKDEQRENFVGLVADRVKKDGTNKYIIFYTD 
180 190 200 210 220 230 



290 300 310 320 330 340 349 

m287.pep KPTSFARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLT 
II : M M M M I M I : M M M M M M M II M M II M M I M I M II M M 

g287 KPPT RSARSRRSLPAEIPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLT 

240 250 260 270 280 290 



350 360 370 380 390 400 409 

m287 . pep YGAEKLPGGS YALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGS 

I M I I I M I I M I I I I I I I I I I I I I: I : I I I I I I I I I I I I I I II II : I II II I II II I 
g287 YGAEKLPGGS YALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRFAAKVDFGS 

300 310 320 330 340 350 



410 420 430 440 450 4 60 469 

m287 . pep KSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYR 
II II I M II II I II I I II II I I II I I I I I 1 I I I I I II : I I II I : I I II M I II I! I I M I 
g287 KSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYR 

360 370 380 390 400 410 



m287.pep 



470 480 489 

PTDAEKGGFGVFAGKKEQDX 



WO 99/57280 



PCT/US99/09346 



107 



g287 



I I I I I I I I 1 I I I I I M :: I I 
PTDAEKGGFGVFAGKKDRDX 
420 430 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3107>: 



a287 . seq 




1 


Al bl I 1 AAAL 


01 


L 1 (jl GbbbGL 


lux 




1 CI 

Id 1 


L 1 (jLCbAAAb 


oni 
ZUl 


L-(j A 1 AC(jC Ab 


Z J JL 


111 LbbLAbA 


jUJ. 


paaaataaap 


"3 S1 


T APAPATAPT 
i. n^nun 1 rib 1 


401 


P AP AT ATPPP 


4 51 


A APPA APPPP 


SOI 

Jul 


PTPPPPAPPP 


SSI 

wJ w> J- 


PTPA AAAPA A 


QUI 


PCTAAPPPPA 


fiST 

OJi 


TPPPATPAAP 




A APAPAA APT 


7 SI 


TPAPAATTTP 




APAPPAPP A A 


fl SI 


AP A ATPP A ZiP 


901 


TCTTCATCTG 


951 


GGCCGAGATG 


1001 


ATGGGGAAGC 


1051 


GAAGGGAATT 


1101 


ATCGTATGCC 


1151 


CGGGCACGGC 


1201 


GGCCGTCCGT 


1251 


CAGCAAATCT 


1301 


GTACGCAAAA 


1351 


TGGACGGAAA 


1401 


CGGCGAAGAA 


1451 


AGGGCGGATT 



1 TGCAATGGCT 
I GATCGCCCGA 
, GTTGTTACTG 
i TGAGGAGGCG 
I CCGGAAAAGG 
: AATGGCGGTG 
: GCAAAATGAT 
k ATCACACCCC 
■ .CCGGATGCCG 
. TGCGGCGGAC 
GCAATACGGC 
GGCTCTCAAA 
CAGCGATTTT 
GTTCGGAAAA 
GATTTCTTAG 
TGATGAAGAA 
TTGTCGGTTT 
GTCATCATTT 
GCGTTCTGCA 
CCGTCAATCA 
ACGGGGCATT 
GACTTACGGG 
AAGGCGAACC 
GGCGAAGTGC 
AGGCAGGTTT 
TTATCGACAG 
GTTATCGATG 
GGATGTTTCC 
AATACAGCTA 
GCCGGCAAAA 



TGTATTGTTG 
TGTTAAGTCG 
AAGATGTCGG 
GTGAGTGGTG 
CGGTCAAGAT 
CGGCAACAAC 
ATGCCGCAAA 
TGCACCGAAT 
GGGAATCGGC 
GGAATGCAGG 
AGATCAAGCT 
ATCCTGCCTC 
GGAAGGATAA 
TGTAACGTTG 
ATGAAGAAGC 
AAAATTAATA 
GGTTGCTGAC 
ATAAAGACAA 
CGGTCGAGGC 
GGCGGATACG 
CCGGCAATAT 
GCGGAAAAAT 
GGCAAAAGGC 
TGCATTTCCA 
GCCGCAAAAG 
CGGCGATGAT 
GAAACGGCTT 
GGAAGGTTTT 
TCGCCCGACA 
AAGAGCAGGA 



CCCTTTCAGC 
GCGGACACGC 
GGAAGAGGTG 
CGCCGCAAGC 
ATGGCGGCAG 
GGATAATCCC 
ATGCCGCCGA 
ATGCCAACCA 
ACAACCGGCA 
GGGACGATCC 
GCAAATCAAG 
TTCAACCAAT 
ATGTAGCTAA 
ACACATTGTA 
ACCACCAAAA 
AATATAAAAA 
AGGGTAGAAA 
GTCCGCTTCA 
GGTCGCTTCC 
CTGATTGTCG 
CTTCGCGCCG 
TGTCCGGCGG 
GAAATGCTTG 
TATGGAAAAC 
TCGATTTCGG 
TTGCATATGG 
TAAGGGGACT 
ACGGCCCGGC 
GATGCGGAAA 
TTGA 



This corresponds to the amino acid sequence <SEQ ID 3108; ORF 287.a>: 

a287 .pep 

1 MFKRSVIAMA CIVALSA CGG GGGGSPDVKS ADTLSKPAAP VVTEDVGEEV 

51 LPKEKKDEEA VSGAPQADTQ DATAGKGGQD MAAVSAENTG NGGAATTDNP 

101 ENKDEGPQND MPQNAADTDS STPNHTPAPN MPTRDMGNQA PDAGESAQPA 

151 NQPDMANAAD GMQGDDPSAG ENAGNTADQA ANQAENNQVG GSQNPASSTN 

201 PNATNGGSDF GRINVANGIK LDSGSENVTL THCKDKVCDR DFLDEEAPPK 

251 SEFEKLSDEE KINKYKKDEQ RENFVGLVAD RVEKNGTNKY VIIYKDKSAS 

301 SSSARFRRSA RSRRSLPAEM PLIPVNQADT LIVDGEAVSL TGHSGNIFAP 

351 EGNYRYLTYG AEKLSGGSYA LSVQGEPAKG EMLAGTAVYN GEVLHFHMEN 

4 01 GRPSPSGGRF AAKVDFGSKS VDGIIDSGDD LHMGTQKFKA VIDGNGFKGT 

451 WTENGGGDVS GRFYGPAGEE VAGKYSYRPT DAEKGGFGVF AGKKEQD* 

m287/a287 ORFs 287 and 287. a showed a 77.2% identity in 501 aa overlap 



m287.pep 
a287 

m287.pep 
a287 



10 20 30 40 49 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPVVSE KETEA 

I I I I I ! I I I I I I I I I I I I I I I II I I I I | | I || | | | | | | | | I : | | : || 

MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPAAPVVTEDVGEEVLPKEKKDEEA 

10 20 30 40 50 60 

50 60 70 80 90 100 109 

KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGT 
MM : I I : : : I : I I I I I I I I I I I I I I I : I : I M : I : I | MINIMI I 
VSGAPQADTQ— DATAGKGGQDMAAVSAENTGNGGAATTDNPENKDEGPQNDMPQNAADT 
70 80 90 100 110 
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110 120 130 140 150 160 169 

m287 . pep DSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTA 

Nil I I I : : I III I I I I I : I I N I I I I I I ! I I I I I I | I I I I I : I I I I I I 

a287 DSSTPNHTPAPNMPTRDMGNQAPDAGESAQPANQPDMANAADGMQGDDPSAG-ENAGNTA 
120 130 140 150 160 170 

170 180 190 200 210 220 229 

m287 . pep AQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDS 

1*1111 I I I : : I I : : I : : | | : I I I I : I I I : : : I | I : : | : I : | : | | I I II 
a287 DQAANQAENNQVGGSQNPASSTNPNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKV 
180 190 200 210 220 230 

230 240 250 260 270 280 289 

m287 .pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKEVGLVADSVQMKGINQYI I FYKP 

I : MINI: I I II I I I I I : I I : : | | | | : :: | | I I I I I I : : I I : I : I : I I 
a287 CD-RDFLDEEAPPKSEFEKLSDEEKINKYKKDEQRENFVGLVADRVEKNGTNKYVIIYKD 
240 250 260 270 280 290 

290 300 310 320 330 340 

m287 . pep KP- -TS FARFRRSARSRRSLPAEMPLI PVNQADTLIVDGEAVSLTGHSGN I FAPEGNYRY 

I • I N II II II I I I I I | | | | | | | | |j | | | | | | M I I I I I I I | | | | M I I I I I I I I I 
a287 KSASSSSARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNI FAPEGNYRY 

300 310 320 330 340 350 

350 360 370 380 390 400 

m287 . pep LTYGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDF 

IN MMM M I I I I I I I i I I I : I I I I I ! I I i II I I I I I I :■ I I I I I I I I 1 

a287 LTYGAEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPSPSGGRFAAKVDF 
360 370 380 390 400 410 

410 420 430 440 450 460 

m287.pep GSKSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYS 

I I M M M I I M ! t I M ! ! t I 1 I I : | | | M I M I I I I M : I I I I j : MINI 

a2 8 7 GSKSVDG 1 1 DSGDDLHMGTQKFKAVI DGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYS 

420 430 440 450 460 470 



470 480 489 

m287 . pep YRPTDAEKGGFGVFAGKKEQDX 

I I I I II I I I I I I I I I I I I II I I 
a 2 8 7 YR PT DAE KGG FG V FAGKKEQDX 

480 490 



406 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3109>: 

xn406 . seq 

1 ATGCAAGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGTAAACGCT 

101 TTGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

2 01 CACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 
251 TTGATGCACT GATTCGTGGC GAATACATAA ACAGCCCTGC CGTCCGTACC 

3 01 GATTACACCT ATCCACGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 
351 TTTGACAGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

4 01 CTCGCACCCA ATCAGACGGT AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 
451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CTAACCCGCG 
501 CGACACTGCC TTTCTTTCCC ACTTGGTACA GACCGTATTT TTCCTGCGCG 
551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACAGATGT GTTTATTAAC 
601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 
651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 
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701 GAACCAATAA AAAATTGCTC ATCAAACCAA AAACCAATGC GTTTGAAGCT 

751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTAAGCAA 

801 AGGAATTAAA CCGACGGAAG GATTAATGGT CGATTTCTCC GATATCCGAC 

8 51 CATACGGCAA TCATACGGGT AACTCCGCCC CATCCGTAGA GGCTGATAAC 

901 AGTCATGAGG GGTATGGATA CAGCGATGAA GTAGTGCGAC AACATAGACA 

951 AGGACAACCT TGA 

This corresponds to the amino acid sequence <SEQ ID 31 10; ORF 406>: 

m406 .pep 

1 MQARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINS PAVRT ■ 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSKSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 

251 AYKENYALWM GPYKVS KG IK PTEGLMVDFS DIRPYGNHTG NSAPSVEADN 

3 01 SHEGYGYSDE WRQHRQGQP * 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 31 1 1>: 

g406 . seq 

1 ATGCGGGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGCAAACGCT 

101 TCGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

2 01 AACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 
251 TTGATGCACT GATTCGCGGC GAATACATAA ACAGCCCTGC CGTCCGCACC 

3 01 GATTACACCT ATCCGCGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 
351 TTTGACGGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 
401 CGCGCACCCA ATCAGACGGT AGCGGAAGTA GGAGCAGTCT GGGC TTAAAT 
451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CCAACCCGCG 
501 CGACACTGCC TTTCTTTCCC ACTTGGTGCA GACCGTATTT TTCCTGCGCG 
551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACAGATGT GTTTATTAAC 
601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 
651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 
701 GAACCAATAA AAAATTGCTC ATCAAACCCA AAACCAATGC GTTTGAAGCT 
751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTAAGCAA 
801 AGGAATCAAA CCGACGGAAG GATTGATGGT CGATTTCTCC GATATCCAAC 
851 CATACGGCAA TCATACGGGT AACTCCGCCC CATCCGTAGA GGCTGATAAC 
901 AGTCATGAGG GGTATGGATA CAGCGATGAA GCAGTGCGAC AACATAGACA 
951 AGGGCAACCT TGA 

This corresponds to the amino acid sequence <SEQ ID 31 12; ORF 406>: 

g406.pep 

1 MRARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINS PAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSRSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 

251 AYKENYALWM GPYKVS KG IK PTEGLMVDFS DIQPYGNHTG NSAPSVEADN 

301 SHEGYGYSDE AVRQHRQGQP * 



ORF 406 shows 98.8% identity over a 320 aa overlap with a predicted ORF (ORF406.a) from 
N. gonorrhoeae: 

g406/m406 

10 20 30 40 50 60 

g4 06 . pep MRARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 

IHIIIIMIIIIMIIIMIIIIMIMIIIIMMIIIIIMIIMilllMIMIII 

m4 0 6 MQARLLI PI LFSVFI LSACGTLTGI PSHGGGKRFAVEQELVAAS ARAAVKDMDLQALHGR 
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10 20 30 40 50 60 

70 80 90 100 HO 120 

g4 06 . pep KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

I M I! 1 1 M 1 1 M II 1 1 II II 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 M 1 1 1 1 1 ! M M 1 1 i ; 

m4 06 KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

70 80 90 100 110 120 

130 140 150 160 170 180 

g4 0 6 . pep LTTSLSTLNAPALSRTQSDGSGSRSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

M M I ! 1 1 1 1 1 1 1 1 1 1 II 1 , 1 1 hi Ml IN 1 1 M 1 1 II 1 1 1 II I M 1 1 1 1 ! 1 1 1 , 1 1 1 

m4 06 LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

130 140 150 160 170 180 

190 200 210 220 230 240 

g4 06 . pep FLRGIDWSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

I I M II MMI II IM Mill IMIMI II III M MIIMIMI II II I IM I M 1 1 M 

m406 FLRGIDWSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

190 200 210 220 230 240 

250 260 270 280 290 300 

g4 06 . pep IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIQPYGNHTGNSAPSVEADN 

I M I M IMM 1 1 1 II IMIMMMII IMMMMIMI MIMI Ml MM II MM 

m406 IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIRPYGNHTGNSAPSVEADN 

250 260 270 280 290 300 

310 320 
g4 0 6 . pep SHEGYGYSDEAVRQHRQGQPX 

lllllllllhlMIIIIIII 

m4 0 6 SHEGYGYSDEWRQHRQGQPX 

310 320 



The following partial DNA sequence 



a406. seq 






1 


ATGCAAGCAC 


GGCTGCTGAT 


51 


CGCCTGCGGG 


ACACTGACAG 


101 


TCGCGGTCGA 


ACAAGAACTT 


151 


GACATGGATT 


TACAGGCATT 


201 


AACTATGGGC 


GACCAAGGTT 


251 


TTGATGCACT 


GATTCGTGGC 


301 


GATTACACCT 


ATCCACGTTA 


351 


TTTGACAGGT 


TTAACCACTT 


401 


CGCGCACCCA 


ATCAGACGGT 


451 


ATTGGCGGGA 


TGGGGGATTA 


501 


CGACACTGCC 


TTTCTTTCCC 


551 


GCATAGACGT 


TGTTTCTCCT 


601 


ATCGACGTAT 


TCGGAACGAT 


651 


TGCCGAAACA 


CTGAAAGCCC 


701 


GAACCAATAA 


AAAATTGCTC 


751 


GCCTATAAAG 


AAAATTACGC 


801 


AGGAATTAAA 


CCGACAGAAG 


851 


CATACGGCAA 


TCATATGGGT 


901 


AGTCATGAGG 


GGTATGGATA 


951 


AGGGCAACCT 


TGA 



i identified in N. meningitidis <SEQ ID 31 13>: 

ACCTATTCTT TTTTCAGTTT TTATTTTATC 
GTATTCCATC GCATGGCGGA GGTAAACGCT 
GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 
ACACGGACGA AAAGTTGCAT TGTACATTGC 
CAGGCAGTTT GACAGGGGGT CGCTACTCCA 
GAATACATAA ACAGCCCTGC CGTCCGTACC 
CGAAACCACC GCTGAAACAA CATCAGGCGG 
CTTTATCTAC ACTTAATGCC CCTGCACTCT 
AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 
TCGAAATGAA ACCTTGACGA CTAACCCGCG 
ACTTGGTACA GACCGTATTT TTCCTGCGCG 
GCCAATGCCG ATACGGATGT GTTTATTAAC 
ACGCAACAGA AC CG AAATGC ACCTATACAA 
AAACAAAACT GGAATATTTC GCAGTAGACA 
ATCAAACCAA AAACCAATGC GTTTGAAGCT 
ATTGTGGATG GGACCGTATA AAGTAAGCAA 
GATTAATGGT CGATTTCTCC GATATCCAAC 
AACTCTGCCC CATCCGTAGA GGCTGATAAC 
CAGCGATGAA GCAGTGCGAC GACATAGACA 



This corresponds to the amino acid sequence <SEQ ID 31 14; ORF 406.a>: 

a406.pep 

1 MQARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSKSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 
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251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIQPYGNHMG NSAPSVEADN 
301 SHEGYGYSDE AVRRHRQGQP * 

m406/a406 ORFs 406 and 406. a showed a 98.8% identity in 320 aa overlap 

10 20 30 40 50 60 

m4 06.pep MQARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAAS7UIAAVKDMDLQALHGR 
I I I I 1 i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I II I 
a406 MQARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARMVKDMDLQALHGR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m406.pep KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I I II I I I I I I 
a4 06 KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m4 06.pep LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 
I I I I I I I I I I I I I I II I I I I 1 I I I I I I I I I I I I I I I I I M I I I I I I I I I I 1 I I M I I I ! I 
a4 06 LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

130 140 150 160 170 180 

190 200 210 220 230 240 

m406.pep FLRGI DVVS PANADTDVFIN I DVFGT I RNRTEMHLYN AETLKAQTKLE Y FAVDRTNKKLL 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
a4 06 FLRGI DVVS PANADTDVFIN I DVFGT IRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m406 . pep IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIRPYGNHTGNSAPSVEADN 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I II I I I 
a406 IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIQPYGNHMGNSAPSVEADN 

250 260 270 280 290 300 

310 320 
m4 0 6 . pep SHEGYGYSDEWRQHRQGQPX 
I I I I I I I I I I : I I : I I II I I I 
a 4 0 6 SHEGYGYSDEAVRRHRQGQPX 

310 320 



EXAMPLE 2 
Expression of ORF 919 
The primer described in Table 1 for ORF 919 was used to locate and clone ORF 919. 
The predicted gene 919 was cloned in pET vector and expressed in E. coli. The product of 
protein expression and purification was analyzed by SDS-PAGE. In panel A) is shown the 
analysis of 919-His fusion protein purification. Mice were immunized with the purified 919- 
His and sera were used for Western blot (panel B), FACS analysis (panel C), bactericidal 
assay (panel D), and ELISA assay (panel E). Symbols: Ml, molecular weight marker; PP, 
purified protein, TP, N. meningitidis total protein extract; OMV, K meningitidis outer 
membrane vesicle preparation. Arrows indicate the position of the main recombinant protein 
product (A) and the N. meningitidis immunoreactive band (B). These experiments confirm 
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that 919 is a surface-exposed protein and that it is a useful immunogen. The hydrophilicity 
plots, antigenic index, and amphipatic regions of ORF 919 are provided in Figure 10. The 
AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 
143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, 
Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 919 and the amino acid 
sequence encoded thereby is provided in Example 1 . 

EXAMPLE 3 
Expression of ORF 279 
The primer described in Table 1 for ORF 279 was used to locate and clone ORF 279. 
The predicted gene 279 was cloned in pGex vector and expressed in E. coli. The product of 
protein expression and purification was analyzed by SDS-PAGE. In panel A) is shown the 
analysis of 279-GST purification. Mice were immunized with the purified 279-GST and sera 
were used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay 
(panel D), and ELISA assay (panel E). Symbols: Ml , molecular weight marker; TP, N. 
meningitidis total protein extract; OMV, N. meningitidis outer membrane vescicle preparation. 
Arrows indicate the position of the main recombinant protein product (A) and the N. 
meningitidis immunoreactive band (B). These experiments confirm that 279 is a surface- 
exposed protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, 
and amphipatic regions of ORF 279 are provided in Figure 1 1 . The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). 
The nucleic acid sequence of ORF 279 and the amino acid sequence encoded thereby is 
provided in Example 1 . 

EXAMPLE 4 
Expression of ORF 576 and 576-1 
The primer described in Table 1 for ORF 576 was used to locate and clone ORF 576. 
The predicted gene 576 was cloned in pGex vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 576- 
GST fusion protein purification. Mice were immunized with the purified 576-GST and sera 
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were used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D) 5 
and ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. meningitidis 
total protein extract; OMV, N. meningitidis outer membrane vescicle preparation. Arrows 
indicate the position of the main recombinant protein product (A) and the N. meningitidis 
immunoreactive band (B).. These experiments confirm that ORF 576 is a surface-exposed 
protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, and 
amphipatic regions of ORF 576 are provided in Figure 12. The AMPHI program is used to 
predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). 
The nucleic acid sequence of ORF 576 and the amino acid sequence encoded thereby is 
provided in Example 1 . 

EXAMPLE 5 
Expression of ORF 519 and 519-1 
The primer described in Table 1 for ORF 519 was used to locate and clone ORF 519. 
The predicted gene 519 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 519- 
His fusion protein purification. Mice were immunized with the purified 519-His and sera were 
used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D), and 
ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. meningitidis total 
protein extract; OMV, N. meningitidis outer membrane vesicle preparation. Arrows indicate 
the position of the main recombinant protein product (A) and the N. meningitidis 
immunoreactive band (B). These experiments confirm that 519 is a surface-exposed protein 
and that it is a usefiil immunogen. The hydrophilicity plots, antigenic index, and amphipatic 
regions of ORF 519 are provided in Figure 13. The AMPHI program is used to predict 
putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, AIDS Res 
Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1 :9). The nucleic 
acid sequence of ORF 519 and the amino acid sequence encoded thereby is provided in 
Example 1 . 
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EXAMPLE 6 
Expression of ORF 121 and 121-1 
The primer described in Table 1 for ORF 121 was used to locate and clone ORF 121. 
The predicted gene 121 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 121- 
His fusion protein purification. Mice were immunized with the purified 121 -His and sera were 
used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Results show that 121 is a surface-exposed protein. Symbols: 
Ml, molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. 
meningitidis outer membrane vescicle preparation. Arrows indicate the position of the main 
recombinant protein product (A) and the N. meningitidis immunoreactive band (B). These 
experiments confirm that 121 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 121 are provided in 
Figure 14. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1992, Scand J Immunol Suppl 11:9). The nucleic acid sequence of ORF 121 and the amino 
acid sequence encoded thereby is provided in Example 1. 

EXAMPLE 7 
Expression of ORF 128 and 128-1 
The primer described in Table 1 for ORF 128 was used to locate and clone ORF 128. 
The predicted gene 128 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 128- 
His purification. Mice were immunized with the purified 128-His and sera were used for 
Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel D) and 
ELISA assay (panel E). Results show that 128 is a surface-exposed protein. Symbols: Ml, 
molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. meningitidis 
outer membrane vesicle preparation. Arrows indicate the position of the main recombinant 
protein product (A) and the K meningitidis immunoreactive band (B). These experiments 
confirm that 128 is a surface-exposed protein and that it is a useful immunogen. The 
hydrophilicity plots, antigenic index, and amphipatic regions of ORF 128 are provided in 
Figure 15. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
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Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1992, Scand J Immunol Suppl 1 1:9). The nucleic acid sequence of ORF 128 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 8 
Expression of ORF 206 
The primer described in Table 1 for ORF 206 was used to locate and clone ORF 206. 
The predicted gene 206 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 206- 
His purification. Mice were immunized with the purified 206-His and sera were used for 
Western blot analysis (panel B). It is worthnoting that the immunoreactive band in protein 
extracts from meningococcus is 38 kDa instead of 17 kDa (panel A). To gain information on 
the nature of this antibody staining we expressed ORF 206 in E. coli without the His-tag and 
including the predicted leader peptide. Western blot analysis on total protein extracts from E. 
coli expressing this native form of the 206 protein showed a recative band at a position of 38 
kDa, as observed in meningococcus. We conclude that the 38 kDa band in panel B) is specific 
and that anti-206 antibodies, likely recognize a multimeric protein complex. In panel C is 
shown the FACS analysis, in panel D the bactericidal assay, and in panel E) the ELISA assay. 
Results show that 206 is a surface-exposed protein. Symbols: Ml, molecular weight marker; 
TP, N. meningitidis total protein extract; OMV, N. meningitidis outer membrane vesicle 
preparation. Arrows indicate the position of the main recombinant protein product (A) and the 
N. meningitidis immunoreactive band (B). These experiments confirm that 206 is a surface- 
exposed protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, 
and amphipatic regions of ORF 5 19 are provided in Figure 16. The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDSResHuman Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 11:9). 
The nucleic acid sequence of ORF 206 and the amino acid sequence encoded thereby is 
provided in Example 1. 
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EXAMPLE 9 
Expression of ORE 287 
The primer described in Table 1 for ORF 287 was used to locate and clone ORF 287. 
The predicted gene 287 was cloned in pGex vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 287- 
GST fusion protein purification. Mice were immunized with the purified 287-GST and sera 
were used for FACS analysis (panel B), bactericidal assay (panel C), and ELISA assay (panel 
D). Results show that 287 is a surface-exposed protein. Symbols: Ml, molecular weight 
marker. Arrow indicates the position of the main recombinant protein product (A). These 
experiments confirm that 287 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 287 are provided in 
Figure 17. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1 992, Scand J Immunol Suppl 11:9). The nucleic acid sequence of ORF 287 and the amino 
acid sequence encoded thereby is provided in Example 1. 

EXAMPLE 10 
Expression of ORF 406 
The primer described in Table 1 for ORF 406 was used to locate and clone ORF 406. 
The predicted gene 406 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 406- 
His fusion protein purification. Mice were immunized with the purified 406-His and sera were 
used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Results show that 406 is a surface-exposed protein. Symbols: 
Ml, molecular weight marker; TP, TV. meningitidis total protein extract; OMV, N. 
meningitidis outer membrane vescicle preparation. Arrows indicate the position of the main 
recombinant protein product (A) and the N. meningitidis immunoreactive band (B). These 
experiments confirm that 406 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 406 are provided in 
Figure 18. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
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1 992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 406 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 11 



Table 2 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 225 among different strains. 
Table 2 



225 gene variability: List of used Neisseria strains 


Identification Strains 


Source / reference 


number 




Group B 




zoOl 225 NG6/88 


R. Moxon / Seiler et al., 1996 


zo02 225 BZ198 


R. Moxon / Seiler et al, 1996 


zo03 225 NG3/88 


R. Moxon / Seiler et al, 1996 


zo04_225 297-0 


R. Moxon / Seiler et al, 1996 


zo05 225 1000 


R. Moxon / Seller et al, 1996 


zo06 225 BZ147 


R. Moxon / Seiler et al , 1996 


zo07 225 BZ169 


R. Moxon / Seiler et al, 1996 


zo08 225 528 


R. Moxon / Seiler et al, 1996 


zo09_225 NGP165 


R. Moxon / Seiler et al, 1996 


zolO 225 BZ133 


R. Moxon / Seiler et al, 1996 


zoll 225 NGE31 


R. Moxon / Seiler et al, 1996 


zol2 225 NGF26 


R. Moxon / Seiler et al, 1996 


zol3 225 NGE28 


R. Moxon / Seiler et al, 1996 


zol4 225 NGH38 


R. Moxon / Seiler et al, 1996 


zol5 225 SWZ107 


R. Moxon / Seiler et al, 1 996 


zol6 225 NGH15 


R. Moxon / Seiler et al, 1996 


zol7 225 NGH36 


R. Moxon / Seiler et al, 1996 


zol8 225 BZ232 


R. Moxon / Seiler et al, 1996 


zol9 225 BZ83 


R. Moxon / Seiler et al, 1996 


zo20 225 44/76 


R. Moxon / Seiler et al, 1996 


zo21 225 MC58 


R. Moxon 


zo96_225 2996 


Our collection 


Group A 




zo22 225 205900 


R. Moxon 


zo23 225 F6124 


R. Moxon 


z2491 Z2491 


R. Moxon / Maiden et al, 1998 


Group C 




zo24 225 90/18311 


R. Moxon 


zo25_225 93/4286 


R. Moxon 
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Others 

zo26_225 A22 (group W) R. Moxon / Maiden et al., 1998 
zo27_225 E26 (group X) R. Moxon / Maiden et al, 1998 
zo28_225 860800 (group Y) R. Moxon / Maiden et al, 1998 
zo29_225 E32 (group Z) R. Moxon / Maiden et al, 1998 

Gonococcus 

zo32_225 Ng F62 R. Moxon / Maiden et al, 1998 

zo33J225 Ng SN4 R. Moxon 

fal090 FA1090 R. Moxon 



References: 

Seiler A. et al, Mol. Microbiol., 1996, 19(4): 84 1-856. 
Maiden et al., Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 

>FA1090 <SEQ ID 3115> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z2491 <SEQ ID 3116> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRVPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSISTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

ZO01_225 <SEQ ID 3117> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO02_225 <SEQ ID 3118> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO03_225 <SEQ ID 3119> 

MDSFFKPAVWAVLWLMFAVRLALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 
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ZO04_225 <SEQ ID 3120> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPV'LPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO05_225 <SEQ ID 3121> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGSAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO06_225 <SEQ ID 3122> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO07_225 <SEQ ID 3123> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO08_225 <SEQ ID 3124> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGSAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

ZO09_225 <SEQ ID 3125> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQI LRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO10_225 <SEQ ID 3126> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

ZOU_225 <SEQ ID 3127> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQ I LRQFAE DEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z012_225 <SEQ ID 3128> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z013_225 <SEQ ID 3129> 
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MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFIQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z014_225 <SEQ ID 3130> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z015_225 <SEQ ID 3131> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z016_225 <SEQ ID 3132> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z017_225 <SEQ ID 3133> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
N ADE L I G S AMG LNE Q P VL PVNR V P ARRAGN ADEL I GNAMG LNE QPVL P VNRAPARRAGN A 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z018_225 <SEQ ID 3134> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z019_225 <SEQ ID 3135> 

MDSFFKPAWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO20_225 <SEQ ID 3136> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPINRAPARRAGNADELIGSAMGLNEQPVLPVNRVPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z021_225 <SEQ ID 3137> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z022_225 <SEQ ID 3138> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
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SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z023_225 <SEQ ID 3139> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z024_225 <SEQ ID 3140> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQI LRQFAE DEQPVL P INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRI SHVGLYIGNNRFIHAPRTGKN IEI TSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

Z025_225 <SEQ ID 3141> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRI SHVGLY I GNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z026_225 <SEQ ID 3142> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRI SHVGLYIGNNRFIHAPRTGKNIE I TSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z027_225 <SEQ ID 3143> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRI SHVGLYIGNNRFIHAPRTGKN IE ITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z028_225 <SEQ ID 3144> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRI SHVGLYIGNNRFIHAPRTGKNIE ITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z029_225 <SEQ ID 3145> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRI SHVGLYIGNNRFIHAPRTGKNIE ITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z032_225 <SEQ ID 3146> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQI LRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z033_225 <SEQ ID 3147> 

MDS FFKPAVWAVLWLMFAVRSALADELTNLLSSREQI LRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRI SHVGLY IGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRIKKNDPSRFLN* 

Z096 225 <SEQ ID 3148> 
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MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 



Figure 19 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 225, nirther confirming its utility as an antigen for both 



Table 3 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 235 among different strains. 
Table 3 

235 gene variability: List of used Neisseria strains " 1 



vaccines and diagnostics. 



EXAMPLE 12 



Identification Strains 
number 

Group B 



Reference 



gnmzqOl NG6/88 

gnmzq02 BZ198 

gnmzq03 NG3/88 

gnmzq04 1000 



Seiler al, 1996 
Seiler etal, 1996 
Seiler et al, 1996 
Seiler etal, 1996 
Seiler et al, 1996 
Seiler et al, 1996 
Seiler et al, 1996 
Seiler etal, 1996 
Seiler et al, 1996 
Seiler et al, 1996 
Seiler et al, 1996 
Seiler etal, 1996 
Seiler et al, 1996 
Seiler etal, 1996 
Seiler et al, 1996 
Seiler et al, 1996 
Seiler et al, 1996 
Virji etal, 1992 



gnmzq05 1000 



gnmzq07 BZ169 
gnmzq08 528 



gnmzq09 NGP165 
gnmzqlO BZ133 
gnmzqll NGE31 
gnmzql3 NGE28 
gnmzql4 NGH38 
gnmzql5 SWZ107 
gnmzql6 NGH15 
gnmzql7 NGH36 
gnmzql8 BZ232 
gnmzql9 BZ83 
gnmzq21 MC58 



Group A 

gnmzq22 205900 



Our collection 
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gnmzq23 F6124 
z2491 Z2491 



Our collection 
Maiden etaL, 1998 



Group C 

gnmzq24 90/18311 Our collection 
gnmzq25 93/4286 Our collection 

Others 

gnmzq26 A22 (group W) Maiden et aL, 1998 
gnmzq27 E26 (group X) Maiden et aL, 1998 
gnmzq28 860800 (group Y) Maiden et aL, 1998 
gnmzq29 E32 (group Z) Maiden et aL, 1998 
gnmzq31 N. lactamica Our collection 

Gonococcus 

gnmzq32 Ng F62 Maiden et aL, 1998 

gnmzq33 Ng SN4 Our collection 

fal090 FA1090 Dempsey et aL 1991 



References: 

Seiler A. et aL, Mol. Microbiol., 1996, 19(4):841-856. 

Maiden R. etaL, Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 

Virji M. etaL, Mol. Microbiol., 1992, 6:1271-1279 

Dempsey J.F. et aL, J. Bacterid., 1991, 173:5476-5486 



The amino acid sequences for each listed strain are as follows: 



FA1090 <SEQ ID 3149> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKPASILVVPPLNESPDVNGTWGMLAST- 
AAPISEAGYYVFPAAVVEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

GNMZQ01 <SEQ ID 3150> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAK7VRLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQVSKT/^AYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ02 <SEQ ID 3151> 

MKPLI LGLAAVLALS ACQVQKAP DFD YTS FKE SKPAS I LWPPLNES PDVNGTWG VLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 



GNMZQ03 <SEQ ID 3152> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
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AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG I LKGPRFVEEQPK* 

GNMZQ04 <SEQ ID 3153> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ05 <SEQ ID 3154> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ07 <SEQ ID 3155> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ08 <SEQ ID 3156> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANNLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ09 <SEQ ID 3157> 

MKPLILGLAAALVLSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGMLAST 
AEPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVQPEKLHQIFGNDAVLYITITEYGTS 
YQILDSVTTVSARARLVDSRNGKVLWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
• DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ10 <SEQ ID 3158> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG ILKGPRFVEEQPK* 

GNMZQ11 <SEQ ID 3159> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG I LKGPRFVEEQPK* 

GNMZQ13 <SEQ ID 3160> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ14 <SEQ ID 3161> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
, YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG I LKGPRFVEEQPK* 

GNMZQ15 <SEQ ID 3162> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 



GNMZQ16 <SEQ ID 3163> 
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MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG ILKGPRFVEEQPK* 

GNMZQ17 <SEQ ID 3164> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSE AG YYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG ILKGPRFVEEQPK* 

GNMZQ18 <SEQ ID 3165> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAAD I HAVRPEKLHQIFGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRG YQ VS KT AA YNLLS P Y S HNG I LKG PR FVEEQPK * 

GNMZQ19 <SEQ ID 3166> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG ILKGPRFVEEQPK* 

GNMZQ21 <SEQ ID 3166> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG I LKG PR FVEEQPK* 

GNMZQ22 <SEQ ID 3167> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRG YQVSKTAAYN LLS PYSHNG I LKG PR FVEEQPK * 

GNMZQ23 <SEQ ID 3168> 

MKPLILGLAAVLALSACQVQKAPDFDYTS FKESKPAS ILVVPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAA YNLLS PYSHNG ILKGPRFVEEQPK* 

GNMZQ24 <SEQ ID 3169> 

MKPLILGLAAVLALSACQVQKAPDFDYTS FKESKPAS ILVVPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG ILKGPRFVEEQPK* 

GNMZQ25 <SEQ ID 3170> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPAS ILVVPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG I LKG PR FVEEQPK* 

GNMZQ26 <SEQ ID 3171> 

MKPLILGLAAVLALSACQVQKAPDFDYTS FKESKPAS ILVVPPLNES PDVNGTWGMLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGI LKGPRFVEEQPK* 

GNMZQ27 <SEQ ID 3172> 

MKPLILGLAAVLALSACQVQKAPDFDYTS FKESKPAS ILVVPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 



WO 99/57280 



126 



PCT/US99/09346 



GNMZQ28 <SEQ ID 3173> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ29 <SEQ ID 3174> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ31 <SEQ ID 3175> 

MKPLILGLAAVLALSACQVQKAPDFDYTAFKESKPASILVVPPLNESPDVNGTWGMLAST 
AEPLSEAGYYVFP7VAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITITEYGTS 
YQI LDSVTTVSARARLVDSRNGKVLWSGSAS IREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKAAAYDLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ32 <SEQ ID 3176> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKPASILVVPPLNESPDVNGTWGMLAST 
AAP I SEAGYYVFPAAVVEETFKENGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQI LDSVTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

GNMZQ33 <SEQ ID 3177> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 
AAP I SEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQI LDSVTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

Z2491 <SEQ ID 3178> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 



Figure 20 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORP 235, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 13 



Table 4 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 287 among different strains. 
Table 4 



287 gene variability: List of used Neisseria strains 

Identification Strains Reference 
number 
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Group B 

287^2 BZ198 
287_9 NGP165 
287J4 NGH38 
287 21 MC58 



Seiler et aL, 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
VivjietaL, 1992 



Group A 

z2491 Z2491 



Maiden etaL, 1998 



Gonococcus 

fal090 FA1090 



Dempsey etaL 1991 



References: 

Seiler A. etaL, Mol. Microbiol., 1996, 19(4):841-856. 

Maiden R. etaL, Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 

Virji M. et aL, Mol. Microbiol., 1992, 6:1271-1279 

Dempsey J.F. et aL, J. Bacterid., 1991, 173:5476-5486 



The amino acid sequences for each listed strain are as follows: 



287_14 <SEQ ID 3179> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSQG 

QGAPSAQGGQDMAAVSEENTGNGGAAATDKPKNEDEGAQNDMPQNAADTDSLTPNHTPAS 

NMPAGNMENQAPDAGESEQPANQPDMANTADGMQGDD'PSAGGENAGNTAAQGTNQAENNQ 

TAGSQNPASSTNPSATNSGGDFGRTNVGNSWIDGPSQNITLTHCKGDSCSGNNFLDEEV 

QLKSEFEKLSDADKISNYKKDGKNDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFAR 

FRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLP 

GGSYALRVQGEPSKGEMLAGTAVYNGEVLHFHTENGRPSPSRGRFAAKVDFGSKSVDGII 

DSGDGLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD* 

287_2 <SEQ ID 3180> 

MFKRS VIAMAC I FALSACGGGGGGS PDVKSADTLSKPAAPWSEKETEAKE DAPQAGSQG 

QGAPSAQGGQDMAAVSEENTGNGGAAATDKPKNEDEGAQNDMPQNAADTDSLTPNHTPAS 

NMPAGNMENQAPDAGESEQPANQPDMANTADGMQGDDPSAGGENAGNTAAQGTNQAENNQ 

TAGSQNPASSTNPSATNSGGDFGRTNVGNSWIDGPSQNITLTHCKGDSCSGNNFLDEEV 

QLKSEFEKLSDADKISNYKKDGKNDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFAR 

FRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLP 

GGSYALRVQGEPSKGEMLAGTAVYNGEVLHFHTENGRPSPSRGRFAAKVDFGSKSVDGII 

DSGDGLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD* 

287_21. <SEQ ID 3181> 

MFKRS VIAMACI FALSACGGGGGGS PDVKSADTLSKPAAPVVSEKETEAKEDAPQAGSQG 
QGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGTDSSTPNHTPDP 
NMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTAAQGANQAGNNQ 
AAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFARFRRS 
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ARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSY 

ALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGSKSVDGIIDSGD 

DLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEKGGFGV 
FAGKKEQD* 

287_9 <SEQ ID 3182> 

MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPAAPVVTEDVGEEVLPKEKKDEEA 

VSGAPQADTQDATAGKGGQDMAAVSAENTGNGGAATTDNPENKDEGPQNDMPQNAADTDS 

STPNHTPAPNMPTRDMGNQAPDAGESAQPANQPDMANAADGMQGDDPSAGENAGNTADQA 

ANQAENNQVGGSQNPASSTNPNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKVCDR 

DFLDEEAPPKSEFEKLSDEEKINKYKKDEQRENFVGLVADRVEKNGTNKYVIIYKDKSAS 

SSSARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYG 

AEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPSPSGGRFAAKVDFGSKS 

VDGIIDSGDDLHMGTQKFKAVIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYRPT 
DAEKGGFGV FAGKKEQD* 

FA1090 <SEQ ID 3183> 

MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPVVAENAGEGVLPKEKKDEEA 
AGGAPQADTQDATAGEG SQDMAAVS AENTGNGGAATT DN PKNE DAGAQN DM PQN AAE SAN 

QTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSVVIDGPSQNITLTHCKGDSCNGDN 

LLDEEAPSKSEFEKLSDEEKIKRYKKDEQRENFVGLVADRVKKDGTNKYIIFYTDKPPTR 

SARSRRSLPAEIPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGS 

YALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRFAAKVDFGSKSVDGIIDSG 

DDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYRPTDAEKGGFG 
VFAGKKDRD* 

Z2491 <SEQ ID 3184> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSQG 

QGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGTDSSTPNHTPDP 

NMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTAAQGANQAGNNQ 

AAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNNFLDEEV 

QLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFARFRRS 

ARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSY 

ALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGSKSVDGIIDSGD 

DLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEKGGFGV 
FAGKKEQD* 



Figure 21 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amin 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 287, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 14 

Table 5 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 519 among different strains. 
Table 5 



519 gene variability: List of used Neisseria strains 
Identification Strains Source / reference 
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number 








Group B 




Ij V \J L 




NG6/88 


Jv. iVIUAUIl / OCIICI ei u/., \.yy\j 




J17 


R7198 


IV. IVIUAUIl / JCUgl ei ul,, iyy\j 


7v0^ 


S 1 Pass 


NGV88 


P A/TnYfvn / ^<=»i1pr ot nl 1 QQA 
xv. IVIUAUIl / odlCl ei CIL.j 1 yyO 


7v04 


<j i y 


297-0 


XV. IVIUAUIl / OCIICI e-t t//., l"i/0 


7v0S 


S1Q 

J17 


1000 


xv. iviuauii / ociicr ei ut., iyyo 


WU 




R7147 


xv. iviuauii / seller t?t Cll.y lyyO 




"519 


n>z-i i \jy 


iv. iviuauii / ociicr ei u/., i^yo 


zvlf 


519 


NGE31 


P MnYnn / Qpi1f*r ot nl 1 QQ£ 
Iv. IVIUAUIl / oCIICi ei Cll.j lyyO 


zvl2_ 


519 


NGF26 


Iv. IVIUAUIl / oeilci CI ul. y lyyO 


zvl8_ 


519 


BZ232 


R. Moxon / Seiler et al, 1996 


zvl9_ 


.519 


BZ83 


R. Moxon / Seiler et al., 1996 


zv20 


519ass 


44/76 


R. Moxon / Seiler et al., 1996 


zv2f 


~519ass 


MC58 


R. Moxon 


zv96 519 


2996 


Our collection 



Group A 

zv22_519ass 205900 
z2491 519 Z2491 



R. Moxon 
R. Moxon / Maiden et al, 1998 



Others 

zv26_519 A22 (group W) R. Moxon / Maiden et al, 1998 
zv27_5 1 9 E26 (group X) R. Moxon / Maiden et al, 1 998 
zv28_519 860800 (group Y) R. Moxon / Maiden et al. 9 1998 
zv29_5 1 9ass E32 (group Z) R. Moxon / Maiden et al, 1998 



Gonococcus 

zv32_519 Ng F62 

fal090 519 FA1090 



R. Moxon / Maiden et al, 1998 
R. Moxon 



References: 

Seiler A. et aL, Mol. Microbiol, 1996, 19(4):841-856. 
Maiden et al., Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090_519 <SEQ ID 3185> 

MEFFI I LLAAVAV FGFKS FVV I PQQEVHVVERLGRFHRALTAGLN ILI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRAMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 
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Z2491_519 <SEQ ID 3186> 

MEFFIILLAAVVVFGFKSFVVIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTVVSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV01_519 <SEQ ID 3187> 

MEFFIILLVAVAVFGFKSFVVIPQQEVHVVERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV02_519 <SEQ ID 3188> 

MEFFI ILLVAVAVFGFKS FWI PQQEVHVVERLGRFHRALTAGLNI LI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV03_519 <SEQ ID 3189> 

MEFFI ILLVAVAVFGFKS FWI PQQEVHVVERLGRFHRALTAGLNI LI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV04_519 <SEQ ID 3190> 

ME FFI I LLVAVAVFG FKS FWI PQQEVH WERLGRFHRALTAGLN I LI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV05_519 <SEQ ID 3191> 

MEFFIILLVAVAVFGFKSFWIPQQEVHVVERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV06_519ASS <SEQ ID 3192> 

ME FFI I LLVAVAVFGFKS FW I PQQEVH WERLGRFHRALT AG LN I L I P FI DRVAYRHS L 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTVFSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERK 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV07_519 <SEQ ID 3193> 

ME FFI I LLVAVAVFG FKS FWI PQQEVH WERLGRFHRALTAGLN I LI P FI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 
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ZV11_519 <SEQ ID 3194> 

ME FFI I LLAAVAV FG FKS FVV I PQQE VH VVERLGRFHRALTAGLN I L I PFI DRVA YRHS L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I S AGMK 1 1 DS SKTAK * 

ZV12_519 <SEQ ID 3195> 

MEFFIILLVAVAVFGFKSFWIPQQEVHVVERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DS SKTAK* 

ZV18_519 <SEQ ID 3196> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DS SKTAK* 

ZV19_519 <SEQ ID 3197> 

ME FFI I LLVAVAVFGFKS FWI PQQEVHVVERLGRFHRALTAGLNI LI PFI DRVAYRH S L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI IDS SKTAK* 

ZV20_519ASS <SEQ ID 3198> 

ME FFI I LLVAVAVFGFKS FWI PQQEVHVVERLGRFHRALTAGLNI LI PFI DRVAYRH SL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE IN ST WAALDE AAGAWGVKVLRYE IKDLVPPQE I LRSMQAQI TAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSM 
I SAGMKI IDS SKTAK* 

ZV21_519ASS <SEQ ID 3199> 

ME FFI I LLVAVAV FG FK S FVV I PQQE VH VVERLGRFHRALT AG LN I L I P F I DRVAYRH S L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE IN STVVAALDEAAGAWGVKVLRYE IKDLVPPQE I LRSMQAQI TAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I S AGMK 1 1 D S S KT AK * 

ZV22_519ASS <SEQ ID 3200> 

ME FFI I LLAA VWFG FKS FVVI PQQE VHWERLGRFHRALTAGLN I LI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTVVSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAKIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI IDS SKTAK* 

ZV26_519 <SEQ ID 3201> 

MEFFIILLAAVWFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE INST WAALDE AAGAWGVKVLRYE IKDLVPPQE I LRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV27 519 <SEQ ID 3202> 
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ME FFI I LLVAVAVFGFKS FW I PQQEVHVVERLGRFHRALTAGLNI LI PFI DRVAYRHS L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV28_519 <SEQ ID 3203> 

ME FFI I LLAAVAV FG FKS FW I PQQE VH WE RLGR FHRALT AGLN I L I P FI DRVAYRH S L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE IN ST VVAALDE AAGAWGVKVLRYE IKDLVPPQE I LRSMQAQIT AERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV29_519ASS <SEQ ID 3204> 

MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDE INS I WSALDEAAGAWGVKVLR YE IKDLVPPQE ILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREPEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSNKTAK* 

ZV32_519 <SEQ ID 3205> 

ME FFI I LLAAVAVFG FKS FW I PQQE VHWERLGRFHRALT AGLN I L I P FI DRVAYRHS L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDE INSTWSALDEAAGAWGVKVLRYEIKDLVPPQE I LRAMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV96_519 <SEQ ID 3206> 

MEFFI I LLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNI LI PFI DRVAYRHS L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDE INST WAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEAN AE AI RQI AAALQTQGGADAVNLK I AEQ YVAAFNN LAKE SNT L I M PAN VAD I G S L 
ISAGMKIIDSSKTAK* 

Figure 22 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 519, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 15 



Table 6 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 919 among different strains. 
Table 6 

919 gene variability: List of used Neisseria strains 
Identification Strains Source / reference 
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number 






Group B 




zmOl 


NG6/88 


iv, iviuAuii / ociict 61 ui.y I Wo 


zm02 


BZ198 


R MnYnn / Q/=>1 1^»r- r>i nl 1 OO/C 

jv. iviuauii / ociicr 6i ui. 9 ly^o 


zm03 


NG3/88 


jv. iviuAuii / oeiier 6i cLi.^ lyyo 


zm04 


297-0 


XV. iviuAuii / ociicr 61 (XL. , lyyo 


zm05 


1000 


iv. iviuAuii / ociier lyyo 


zm06 


BZ147 


R MViYfVn / 9Ipi1pr ot nl 1 QO/C 

IV. iviuauii / ociicr ct cZt.j 1 yyo 


zm07 


BZ169 


R A/fnYOn / ^Ipilpr ot nl 1 QQ/C 

iv, iviuauii / ociicr 6i cn. 9 iyyo 


zm08n 528 


R Moyoti / ^pilpr ot nl 1 QQA 
xv. xvxua.uh / OL/iioi ct C/t.j I yy\j 


zm09 


NGP165 


R MnYnn / 9pi1pr £>/ /// 1 QQA 
av. / ocuci tit, cii.y iyyo 


zmlO 


BZ133 


R Mnynn / ^1 pi Ipr ot nl 1QQA 

XV. IVIUAUII / uCUCI fit £*/., I77O 


zmllasbc NGE31 


R MnYnn / <spilpr ot nl 1 QQ/C 
iv. IVIUAUII / OCIICI 61 Cll.y I77O 


zml2 


NGF26 


R A/foYon / ^pilpr ot nl 1 QQ/C 

-lv. iviuauii / ociici ct c*/., 1 yyo 


zml3 


NGE28 


R lvflYY nn / ^Ipilpr ot nl 1 GQ£ 

iv« iviuauii / ociicr ci u/., iyyo 


zml4 


NGH38 


R A/fnYon / ^[pilpr /p/ /?/ 1 QQ/C 

J-V. iviuauii / OCIICI tit ti/.j 1 y yO 


?ml5 


SWZ107 


R MoYOn / ^Ipilpr ot nl 1 QQA 
xv. iviuauii / ociici et t*/, 5 iyyo 


zml6 


NGH15 


R A/ToYOn / ^Pil^r ot nl 1 OO/C 

iv. iviuauii / ociicr t?t u/.j iyyo 


zml7 


NGH36 


R. Moxon / Seiler et aL, 1996 


zml8 


BZ232 


R. Moxon / Seiler et aL, 1996 


zml9 


BZ83 


R Moxon / Seiler nl 1 QQfi 


zm20 


44/76 


iv. iviuxon / oeiier 61 ul, iyyo 


zm21 


MC58 


R. Moxon 


zm96 


2996 


Our collection 




Group A 




zm22 


205900 


R. Moxon 



zm23asbc F6124 R. Moxon 

z249 1 Z2491 R. Moxon / Maiden et aL, 1 998 

Group C 

zm24 90/18311 R Moxon 
zm25 93/4286 R Moxon 

Others 

zm26 A22 (group W) R. Moxon / Maiden et aL, 1998 
zm27bc E26 (group X) R. Moxon / Maiden et aL, 1 998 
zm28 860800 (group Y) R. Moxon / Maiden et aL, 1998 
zm29asbc E32 (group Z) R. Moxon / Maiden et aL, 1998 
zm31asbc N. lactamica R. Moxon 

Gonococcus 

zm32asbc Ng F62 R. Moxon / Maiden et aL, 1998 

zm33asbc Ng SN4 R. Moxon 



fal090 FA1090 R. Moxon 
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Seiler A. etaL, Mol. Microbiol, 1996, 19(4):841-856. 
Maiden et al., Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090 <SEQ ID 3207> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 

YTVVPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKN 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 

IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPE YRP * 

Z2491 <SEQ ID 3208> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPE YRP* 

ZM01 <SEQ ID 3209> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPE YRP* 

ZM02 <SEQ ID 3210> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPE YRP* 

ZM03 <SEQ ID 3211> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPE YRP* 



ZM04 <SEQ ID 3212> 
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MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTG YVWQLL PNGMKPE YR P * 

ZM05 <SEQ ID 3213> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLSCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQT SMQG I KAYMRQN PQRLAE VLGQN PS Y I FFRE LAGS SN DG PVGALGT PLMGE YAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM06 <SEQ ID 3214> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM07 <SEQ ID 3215> 

MKKYLFRAALYGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM08N <SEQ ID 3216> 

MKKYLFRAALYGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYE PVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPI LGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS IGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGT PLMGE YAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM09 <SEQ ID 3217> 

MKKYLFRAALCGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFT PWQVAGNGS LAGTVTGYYE PVLKGDDRRTAQARFPI YGI PDDFI S VPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQT SMQG IKS YMRQN PQRLAE VLGQNPS Y I FFRELTGSGNDG PVGALGT PLMGE YAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM10 <SEQ ID 3218> 

MKKYLFRAALCGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFT PWQVAGNGS LAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
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KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAH PVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM11ASBC <SEQ ID 3219> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVGGGGAV 

■YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM12 <SEQ ID 3220> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAH PVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

' ZM13 <SEQ ID 3221> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAEQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM14 <SEQ ID 3222> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSRNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM15 <SEQ ID 3223> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDLAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNHQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM16 <SEQ ID 3224> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPGRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM17 <SEQ ID 3225> 



WO 99/57280 



PCT/US99/09346 



137 



MKKYLFRAALYGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 

YT VVPHLS LPHWAAQDFAKS LQS FRLGCANLKNRQGWQDVCAQAFQT PVH S FQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM18 <SEQ ID 3226> 

MKKYL FRAALYG I AAAI LAACQSKS I QT FPQPDTS VING PDR PVG I PDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHS FQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM19 <SEQ ID 3227> 

MKKYLFRAALYGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YT WPHLS L PHWAAQDFAKS LQS FRLGCANLKNRQGWQDVCAQAFQT P VH S FQAKQ FFE R 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG IKS YMRQN PQRLAE VLGQN PS Y I FFRELAG S SN DG PVGALGT PLMGE YAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM20 <SEQ ID 3228> 

MKKYLFRAALYGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 

YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHS FQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQG IKS YMRQNPQRLAEVLGQNPSY I FFRE LAGS SNDGPVGALGTPLMGE YAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM21 <SEQ ID 3229> 

MKKYLFRAALYG I AAAI LAACQSKS IQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 

YTWPHLS LPHWAAQDFAKS LQS FRLGCANLKNRQGWQDVCAQAFQT PVHS FQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQG IKSYMRQNPQRLAEVLGQNPSYIFFRELAGS SNDGPVGALGTPLMGE YAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM22 <SEQ ID 3230> 

MKKYLFRAALCGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKS LQS FRLGCANLKNRQGWQDVCAQAFQT PVHS VQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM23ASBC <SEQ ID 3231> 

MKKYLFRAALYGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQS FRLGC AN LKNRQGWQDVCAQAFQTPVHS FQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
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KLGQT SMQG IKS YMRQN PQRLAEVLGQN PS YI FFRELAGS SNDGPVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTSKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKEPGYVWQLLPNGMKPEYRP* 

ZM24 <SEQ ID 3232> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTG YVWQLL PN GMKPEYRP* 

ZM25 <SEQ ID 3233> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPI YGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQG IKS YMRQN PQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM26 <SEQ ID 3234> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG I KAYMQQN PQRLAEVLGQN PSYIFFRELTGS SNDGPVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM27BC <SEQ ID 3235> 

MKKYLFRAALYGISAAILAACQSKSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
Y FT PWQVAGNGSLAGTVTGY YE PVLKGDDRRTAQARFP I YGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAP I LG YAE D PVE LFFMH I QG S GRLKT PSGKY I R I G YADKNEH P YV S I GRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKEPGYVWQLLPNGMKPEYRP* 

ZM28 <SEQ ID 3236> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDF7VKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFT PWQVAGNGSLAGTVTGY YE PVLKGDDRRTAQARFP I YGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQT SMQG I KAYMRQN PQRLAEVLGQN PS Y I FFREL AG S SN DG PVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM29ASBC <SEQ ID 3237> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS I GRYMADKGYL 
KLGQTSMQGIKSYMRQN PQRLAEVLGQN PSYIFFRELTGSGNDGPVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATTHPITRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 



ZM31ASBC <SEQ ID 3238> 
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MKKHLFRAALYGIAAAILAACQSKSIQTFPQPDTSIIKGPDRPAGIPDPAGTTVGGGGAV 

YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYVFFRELAGSGNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM32ASBC <SEQ ID 3239> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YTVVPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 
YFT PWQVAGNGS LAGTVTGYYE PVLKGDGRRTERARFPI YG I PDDFI SVPLPAGLRGGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGGDGPVGALGTPLMGGYAGA 

IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM33ASBC <SEQ ID 3240> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 

YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPIHSFQAKRFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGI PDDFI SVPLPAGLRGGKN 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPHKLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 

IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM96 <SEQ ID 3241> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLS L PHWAAQD FAKS LQS FRLGCANLKNRQGWQDVC AQAFQT P VHS FQAKQFFER 
YFT PWQVAGNGS LAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 



Figure 23 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 919, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 16 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PCR) assay in order to clone the ORFs as 
indicated: 



Table 7: Oligonucleotides used for PCR to amplify complete or partial ORFs 
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UKr primer Sequence 


Restriction 
sites 


001 


Forward CGCGGATCCC ATATG-TGGATGGTGCTGGTr' A T 


rsamrll- 






Ndel 
Xhol 


003 


Forward CGCGGATCCCATATG-GTCGTATTCGTGGG 


T~) TJT 

Jtsararll- 




Dpyprnp PPPOPTPO A A A A A Tr* A T/^ A A r* a /~»^ ^ 

xvcvcibc v^^^o^ i \^^/\\^-/\J\j\j\ 1 LA I LrAACACGCGC! 


Ndel 
Xhol 


005 


Forward CGCGGATCCCATATG-GACAATATTGArATGT 


rsamiil- 




Reverse CCCGCTCGAG-CATCACATCCGCCCG 


Ndel 
Xhol 


006 


Forward CGCGGATCCCATATG-CTGCTGGTGCTGG 


r>amrll- 




Reverse CCCGCTCGAG-AGTTCCGGCTTTGATGT 


Ndel 
Xhol 


007 


Forward CGCGGATCCCATATG-GCCGAC AACAGC ATG AT 


D otri n r 

jDaiTLtii- 




Reverse CCCGCTCGAG-AAGGCGTTCATGATATA AG 


Ndel 
Xhol 


008 


Forward CGCGGATCCCATATG-AACAACAGACATTTTG 


BamHI- 




Reverse CCCGCTCGAG-CCTGTCCGGTAAAAGAC 


Ndel 
Xhol 


009 


Forward CGCGGATCCCATATG-CCCCGCGCTGCT 


BamHI- 




Reverse CCCGCTCGAG-TGGCTTTTGCCACGTTTT 


Ndel 
Xhol 


Oil 


Forward CGCGGATCCCATATG-AAGACACACCGCAAG 


BamHI- 




Reverse CCCGCTCGAG-GGCGGTCAGTACGGT 


Ndel 
Xhol 


012 


Forward CGCGGATCCCATATG-CTCGCCCGTTGCC 


BamHI- 




Reverse CCCGCTCGAG-AGCGGGGAAGAGGCAC 


Ndel 
Xhol 


013 


Forward CGCGGATCCCATATG-CCTTTGACCATGCT 


BamHI- 




Reverse CCCGCTCGAG-CTGATTCGGCAAAAAAATCT 


Ndel 
Xhol 


018 


Forward CGCGGATCCCATATG-CAGCAGAGGCAGTT 


BamHI- 




Reverse CCCGCTCGAG-GArGAGGPnA A.rnrr 


Ndel 
Xhol 


019 


Forward AAAGAATTC-CTGCCAGCCGGCAAGACCCCGGr 


EcoRJ 




Reverse AAACTGCAG-TCAGCGGGCGGGGACAATGCGCAT 


PstI 


023 


Forward AAAGAATTC-AAAGAATATTCGGCATGGCAGGC. 


EcoRI 




Reverse AAACTGCAG-TTACCCCCAAATCACTTTAACTGA 


PstI 


025 


Forward AAAGAATTC-TGCGCCACCCAACAGCCTGCTCC 


EcoRI 




Reverse AAACTGCAG-TCAGAACGCGATATAGCTGTTCGG 


PstI 


03 1 


Forward CGCGGATCCCATATG-GTCTCCCTTCGCTT 


BamHI- 




Reverse CCCGCTCGAG-ATGTAAGACGGGGACAAC 


Ndel 
Xhol 


032 


Forward CGCGGATCCCATATG-CGGCGAAACGTGC 


xjairirii- ! 
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Ndel 




Reverse 


CCCGCTCGAG-CTGGTTTTTTGATATTTGTG 


Xhol 




T7 nrward 

J. til W Cu VA 


CGCGGATCCCATATG-GCGGCGGCAGACA 


BamHI- 








Ndel 




Reverse 

l V\_/ T VI U V 


CCCGCTCGAG-ATTTGCCGCATCCCGAT 


Xhol 


014 


Forward 


CGCGGATCCCATATG-GCCGAAAACAGCTACGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGACGATTTGGTTCAATT 


Xhol 




Forward 


CGCGGATCCCATATG-CTGAAGCCGTGCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CCGGACTGCGTATCGG 


Xhol 


038 

\J -J O 


Forward 


CGCGGATCCCATATG-ACCGATTTCCGCCA 


BamHI- 








Ndel 




R pverse 


CCCGCTCGAG-TTCTACGCCGTACTGCC 


Xhol 




X Ul W CU IX 


CGCGGATCCCATATG-CCGTCCGAACCGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TAGGATGACGAGGTAGG 


Xhol 


041 


Forward 


CGCGGATCCCATATG-TTCGTGCGCGAACCGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GCCCAAAAACTCTTTCAAA 


Xhol 


042 


Forward 


CGCGGATCCCATATG-ACGATGATTTGCTTGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGCAGCCTGCATTTGAC 


Xhol 


043 


Forward 


AAAAAAGGTACC-ATGGTTGTTTCAAATCAAAATATC 


Kpnl 




Reverse 


AAACTGCAG-TTATTGCGCTTCACCTTCCGCCGC 


PstI 


043a 


Forward 


AAAAAAGGTACC-GCAAAAGTGCATGGCGGCTTGGACGGTGC 


Kpnl 




Reverse 


AAAAAACTGCAG- 


PstI 






TTAATCCTGCAACACGAATTCGCCCGTCCG 




044 


Forward 


CGCGGATCCCATATG-CCGTCCGACTAGAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ATGCGCTACGGTAGCCA 


Xhol 


046 


Forward 


AAAGAATTC-ATGTCGGCAATGCTCCCGACAAG 


EcoRI 




Reverse 


AAACTGCAG-TCACTCGGCGACCCACACCGTGAA 


PstI 


047 


Forward 


CGCGGATCCCATATG-GTCATCATACAGGCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TCCGAAAAAGCCCATTTTG 


Xhol 


048 


Forward 


AAAGAATTC- ATGCTC AAC AAAGGCG AAG AATTGC C 


EcoRI 




Reverse 


AAACTGCAG-TCAAGATTCGACGGGGATGATGCC 


PstI 


049 


Forward 


AAAGAATTC-ATGCGGGCGCAGGCGTTTGATCAGCC 


EcoRI 




Reverse 


AAACTGCAG-AAGGCGTATCTGAAAAAATGGCAG 


PstI 


050 


Forward 


CGCGGATCCCATATG-GGCGCGGGCTGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AATCGGGCCATCTTCGA 


Xhol 


052 


Forward 


AAAAAAGAATTC-ATGGCTTTGGTGGCGGAGGAAAC 


EcoRI 




Reverse 


AAAAAAGTCGAC-TCAGGCGGCGTTTTTCACCTTCCT 


Sail 


052a Forward 


AAAAAAGAATTC-GTGGCGGAGGAAACGGAAATATCCGC 


EcoRI 
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Rgyprse AAAA AA CTGCAG -TTAGCTGTTTTTGGAAACGCCGTCCAACCC PstI 

073 Forward CGC GGATCCCATATG -TGTATGCCATATAAGAT BamHI- 

" Ndel 
Reverse CCC GCTCGAG -CACCGGATTGTCCGAC Xhol 
075 Forward CGC GGATCCCATATG -CCGTCTTACTTCATC BamHI- 

Ndel 

Reverse CCC GCTCGAG -ATCACCAATGCCGATTATTT Xhol 
077a Forward AAAAAA GAATTC -GGCGGCATTTTCATCGACACCTTCCT Eco RJ 

Reverse AAAAAA CTGCAG -TCAGACGAACATCTGCACAAACGCAAT PstI 

080 Forward AAA GAATTC -GCGTCCGGGCTGGTTTGGTTTTACAATTC EcoRI 
Reverse AA ACTGCAG -CTATTCTTCGGATTCTTTTTCGGG PstI 

081 Forward AAA GAATTC -ATGAAACCACTGGACCTAAATTTCATCTG Eco RI 
Reverse AAA CTGCAG -TCACTTATCCTCCAATGCCTC PstI 

082 Forward AAA GAATTC -ATGTGGTTGTTGAAGTTGCCTGC Eco RI 
Reverse AA ACTGCAG -TTACGCGGATTCGGCAGTTGG Pst I 

084 Forward AA AGAATTC -TATCACCCAGAATATGAATACGGCTACCG EcoRI 
Reverse AAA CTGCAG -TTATACTTGGGCGCAACATGA Pst I 

085 Forward CGC GGATCCCATATG -GGTAAAGGGCAGGACT BamHI- 

Ndel 

Reverse CCCG CTCGAG -CAAAGCCTTAAACGCTTCG Xhol 

086 Forward AAAAAAGGTACC-TATTTGGCATCAAAAGAAGGCGG Kpnl 
Reverse AAA CTGCAG -TTACTCCACCCGATAACCGCG PstI 

087 Forward AAA GAATTC -ATGGGCGGTAAAACCTTTATGC EcoRI 
Reverse AAA CTGCAG -TTACGCCGCACACGCAATCGC Pst I 

087a Forward AAAAA AGAATTC -AAGCTATTAGGCGTGCCGATTGTGATTCA EcoRI 
Reverse AAAAAA CTGCAG -TTACGCCTGCAAGATGCCCAGCTTGCC PstI 

088 Forward AAAAAA GAATTC -ATGTTTTTATGGCTCGCACATTTCAG Eco RI 
Reverse AAAAAA CTGCAG -TCAGCGGATTTTGAGGGTACTCAAACC Pst I 

089 Forward CGC GGATCCCATATG -CCGCCCAAAATCAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TGCGCATACCAAAGCCA Xhol 

090 Forward CGC GGATCCCATATG -CGCATAGTCGAGCA BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGCAAAACGGCGGTACG Xhol 

091 Forward AA AGAATTC -ATGGAAATAGCCGTACCGCCGAGTCC Eco RI 
Reverse AA ACTGCAG -TCAGCGCAGGGGGTAGCCCAAGCC Pst I 

092 Forward AA AGAATTC -ATGTTTTTTATTTCAATCCG Eco RI 
Reverse AA ACTGCAG -TCAAATCTGTTTCGACAATGC PstI 

093 Forward AAA GAATTC -ATGCAGAATTTTGGCAAAGTGGC Eco RI 
Reverse AAA CTGCAG -CTATGGCTCGTCATACCGGGC Pst I 

094 Forward AAA GAATTC -ATGCCGTCACGGAAGCGCATCAACTC Eco RI 
Reverse AAA CTGCAG -TTATCCCGGCCATACCGCCGAACA PstI 

095 Forward AA AGAATTC -ATGTCCTTTCATTTGAACATGGACGG Eco RI 
Reverse AA ACTGCAG -TCAACGCCGCAGGCACTAACGCCC Pst I 

096 Forward AAAGAATTC-ATGGCTCGTCATACCGGGCAGGG EcoRI 
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Reverse AA ACTGC AG -TC AA AGGAA AAGGCCGTCTG A A A AClCd pit! — 

097 Forward AAAGAATTC-ATGGACACTTCAAAACAAACACTGTTG Eco RI 
Reverse AAACTGCAG-TCAGCCCAAATACCAGAATTTCAG PstI 

098 Forward AAAGAATTC-GATGAACGCAGCCCAGCATGGATACG Eco RI 
Reverse AAACTGCAG-TTACGACATTCTGATTTGGCA PstI 

102 Forward AAAAAAGAATTC-GGCCTGATGATTTTGGAAGTCAACAC Eco RI 

Reverse AAAAAACTGCAG-TTATCCTTTAAATACGGGGACGAGTTC PstI 

105 Forward CGC GGATCCCATATG -TCCGCAAACGA AT A C.G BamHI- 

Ndel 

Reverse CCCGCTCGAG-GTGTTCTGCCAGTTTCAG Xhol 

107 Forward AAAAA AGAATTC - EcoRI 

CTGATGATTTTGGAAGTCAACACCCATTATCC 

Reverse AAAAAACTGCAG-TTATCCTTTAAATACGGGGACGAGTTC PstI 

107b Forward AAAAA AGAATTC - EcoRI 

GATACCCAAGCCCCCGCCGGCACAAACTACTG 
Reverse AAAAAA CTGCAG - PstI 

TTACGCGTCGCCTTTAAAGTATTTGAGCAGGCTGGAGAC 

108 Forward AAAQAATTC-ATGTTGCCGGGCTTCAACCG EcoRI 
Reverse AAACTGCAG-TTAGCGGTACAGGTGTTTGAAGCA p s t I 

108a Forward AAAAAAGAATTC-GGTAACACATTCGGCAGCTTAGACGGTGG EcoRI 
Reverse AAACTGCAG-TTAGCGGTACAGGTGTTTGAAGCA p s tl 

109 Forward AAAGAATTC-ATGTATTATCGCCGGGTTATGGG EcoRI 
Reverse AAA CTGC AG -CTAGCCCAAAGATTTGA A GTGTTC PstI 

111 Forward CGCGGATCCCATATG-TGTTCGGAACAAACCGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCGGAGCAGTTTTTC A A A Xhol 
114 Forward CGC GGATCCCATATG -GCTTCCATCACTTCGC BamHI- 

Ndel 

Reverse CCCGCTCGAG-CATCCGCGAAATCGTC Xhol 

117 Forward AAAAA AGGTACC -ATGGTCGAAGAACTGGA ACTGCTG Kpn I 
Reverse AAACTGCAG-TTAAAGCCGGGTAACGCTCAATAC Pst I 

118 Forward AAAGTCGACATGTGTGAGTTCAAGGATATTATAAG Sail 
Reverse AAAGCATGC-CTATTTTTTGTTGTAATAATCAAATC Sph I 

121 Forward CGCGGATCCCATATG-GAAACACAGCTTTACAT BamHI- 

Ndel 

Reverse CCCGCTCGAG-ATAATAATATCCCGCGCCC Xhol 

122 Forward CGCGGATCCCATATG-GTCATGATTAAAATCCGCA BamHI- 

Ndel 

Reverse CCCGCTCGAG-AATCTTGGTAGATTGGATTt Xhol 

125 Forward AAAGAATTC-ATGTCGGGCAATGCCTCCTCTCC Eco RI 
Reverse AAACTGCAG-TCACGCCGTTTCAAGACG PstI 

125a Forward AAAAAAGAATTC- ACGGCAGGCAGCACCGCCGC AC AGGTTTC Eco RI 
Reverse AAAAA ACTGCAG - Pst j 

TTATTTTGCCACGTCGGTTTCTCCGGTGAACAACGC 

126 Forward CGCGGATCCCATATG-CCGTCTGAAACCC BamHI 
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Ndel 

Reverse CCCGCTCGAG-ATATTCCGCCGAATGCC Xhol 

127 Forward AAAGAATTC-ATGGAAATATGGAATATGTTGGACACTTG EcoRI 
Reverse AAACTGCAG-TTAAAGTGTTTCGGAGCCGGC Psti 

127a Forward AAAAAAGAATTC- AAGGAACTGATTATGTGTCTGTCGGG Eco RI 
Reverse AAACTGCAG-TTAAAGTGTTTCGGAGCCGGC P st I 

128 Forward CGC GGATCCCATATG -ACTGAC A ACfiC A CT BamHI- 

Ndel 

Reverse CCCGCTCGAG-GACCGCGTTGTCGAAA Xhol 
130 Forward CGCGGATCCCATATG-AAACAACTCCGCGA BamHI- 

Ndel 

Reverse CCCGCTCGAG-GAATTTTGCACCGGATTG Xhol 
132 Forward AAAGAATTC-ATGGAACCCTTCAAAACCTTAATTTG EcoRI 
Reverse AAAAAACTGCAG-TCACCATGTCGGCATTTGAAAAAC Pst I 

134 Forward CGC GGATCCCATATG -TCCCAAGA A ATCCTC BamHI- 

Ndel 

Reverse CCCGCTCGAG-CAGTTTGACCGAATGTTC Xhol 

135 Forward CGCGGATCCCATATG-AAATACAAAAGAATCGTATT BamHI- 

Ndel 

Reverse CCCGCTCGAG-AAATTCGGTCAGAAGCAGG Xhol 

137 Forward AAAAAAGGTACC-ATGATTACCCATCCCCAATTCGATCC Kpnl 
Reverse AAAAAACTGCAG-TCAGTGCTGTTTTTTCATGCCGAA Pst I 

137a Forward AAAAAAGAATTC-GGCCGCAAACACGGCATCGGCTTCCT EcoRI 
Reverse AAAAAACTGCAG-TTAAGCGGGATGACGCGGCAGCATACC Pst I 

138 Forward AAAAAAGAATTC-AACTCAGGCGAAGGAGTGCTTGTGGC EcoRI 
Reverse AAAAAATCTAGA^TCAGTTTAGGGATAGCAGGCGTAC Xbal 

141 Forward AAAGAATTC-ATGAGCTTCAAAACCGATGCCGAAATCGC Eco RI 
Reverse AAACTGCAG-TCAGAACAAGCCGTGAATCACGCC Pst I 

142 Forward CGC GGATCCC ATATG -CGTGCCG ATTTC A TG BamHI- 

Ndel 

Reverse CCCGCTCGAG-AAACTGCTGCACATGGG Xhol 

143 Forward AAAAA AGAATTC - Eco RI 

ATGCTCAGTTTCGGCTTTCTCGGCGTTCAGAC 

Reverse AAAAAACTGCAG-TCAAACCCCGCCGTGTGTTTCTTTAAT Pst I 

144 Forward AAAAAAGAATTC-GGTCTGATCGACGGGCGTGCCGTAAC Eco RI 
Reverse AAAAAATCTAGA-TCGGCATCGGCCGGCATATGTCCG Xbal 

146 Forward AAAAA AGAATTC - Eco pj 

CGCCAAGTCGTCATTGACCACGACAAAGTC 

Reverse AAAAAACTGCAG-TTAGGCATCGGCAAATAGGAAACTGGG Pst I 

147 Forward AAAAAAGAAJTC-ACTGAGCAATCGGTGGATTTGGAAAC EcoRI 
Reverse AAAAAAJCTAGA-TTAGGTAAAGCTGCGGCCCATTTGCGG Xbal 

48 Forward AAAAA AGAATTC - Eco RT 

ATGGCGTTAAAAACATCAAACTTGGAACACGC 

Reverse AAAAAATCTAGA-TCAGCCCTTCATACAGCCTTCGTTTTG Xba I 

49 Forward CGCGGATCCCATATG-CTGCTTGAC A AC A A AfTT BamHI- 
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Ndel 

Reverse CCCG CTCGAG -AAACTTCACGTTCACGCC Xhol 

150 Forward CGC GGATCCCATATG -CAGAACACAAATCCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATAAACATCACGCTGATAGC Xhol 

151 Forward AAAAAA GAATTC - Eco RI 

ATGAAACAAATCCGCAACATCGCCATCATCGC 
Reverse AAAAA ACTGCAG -TCAATCCAGCTTTTTAAAGTGGCGGCG Pst I 

152 Forward AAAAA AGAATTC - Eco RI 

ATGAAAAACAAAACCAAAGTCTGGGACCTCCC 
Reverse AAAAA ACTGCAG -TCAGGACAGGAGCAGGATGGCGGC Pst I 

153 Forward AAAAAAGAATTC- ATGGCGTTTGCTTACGGTATGAC EcoRI 
Reverse AAAAA ACTGCAG -TCAGTCATGTTTTTCCGTTTCATT Pst I 

153a Forward AAAAAAGAATTC-CGGACTTCGGTATCGGTTCCCCAGCATTG EcoRI 
Reverse AAAAA ACTGCAG - Psti 
TTACGCCGACGAAATACTCAGACTTTTCGG 

154 Forward CGC GGATCCCATATG -ACTGACAACAGCCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TCGGCTTCCTTTCGGG Xhol 

155 Forward AAAAAAGAATTC- ATGAAAATCGGTATCCCACGCGAGTC EcoRI 
Reverse AAAAA ACTGCAG -TTACCCTTTCTTAAACATATTCAGCAT Pst I 

156 Forward AAAAAA GAATTC -GCACAGCAAAACGGTTTTGAAGC EcoRI 
Reverse AAAAAA CTGCAG -TCAAGCAGCCGCGACAAACAGCCC Pst I 

157 Forward CGC GGATCCCATATG -AGGAACGAGGAAAAAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAAACACAATATCCCCGC Xhol 

158 Forward AAAAAAGAATTC-GCGGAGCAGTTGGCGATGGCAAATTCTGC EcoRI 
Reverse AAAAA ATCTAGA -TTATCCACAGAGATTGTTTCCCAGTTC Xbal 

160 Forward CGC GGATCCCATATG -GACATTCTGGACAAAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTTTGCCCGCCTTCTTT Xhol 

163 Forward AAAAA AGGTACC -ACCGTGCCGGATCAGGTGCAGATGTG Kpn I 
Reverse AAAAAATCJAGA-TTACTCTGCCAATTCCACCTGCTCGTG Xbal 

163a Forward AAAAAAGAATTC-CGGCTGGTGCAGATAATGAGCCAGAC EcoRI 
Reverse AAAAAATCTAGA-TTACTCTGCCAATTCCACCTGCTCGTG Xbal 

164 Forward CGC GGATCCCATATG -AACCGGACTTATGCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTGTTTCCGTCAAACTGC Xhol 

165 Forward CGC GGATCCGCTAGC -GCTGAAGCGACAGACG BamHI- 

Nhel 

Reverse CCCG CTCGAG -AATATCCAATACTTTCGCG Xhol 
206 Forward CGC GGATCCCATATG -AAACACCGCCAACCGA BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTCTGTAAAAAAAGTATGTGC Xhol 
209 Forward CGC GGATCCCATATG -CTGCGGCATTTAGGA BamHI- 
Ndel 
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Reverse CCCG CTCGAG -TACCCCTGAAGGCAAC Xhol 

211 Forward AAAAA AGAATTC -ATGTTGCGGGTTGCTGCTGC EcoRI 
Reverse AAAAA ACTGCAG -CTATCCTGCGGATTGGCATTGAAA PstI 

212 Forward CGCGGATCCCATATG-GACAATCTCGTATGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGGGGTTAGATCCTTCC Xhol 

215 Forward CGC GGATCCCATATG -GCATGGTTGGGTCGT BamHI- 

Ndel 

Reverse CCC GCTCGAG -CATATCTTTTGTATCATAAATC Xhol 

216 Forward CGC GGATCCCATATG -GCAATGGCAGAAAACG BamHI- 

Ndel 

Reverse CCC GCTCGAG -TACAATCCGTGCCGCC Xhol 

217 Forward CGC GGATCCCATATG -GCGGATGACGGTGTG BamHI- 

Ndel 

Reverse CCC GCTCGAG -ACCCCGAATATCGAATCC Xhol 

218 Forward CGC GGATCCCATATG -GTCGCGGTCGATC BamHI- 

Ndel 

Reverse CCC GCTCGAG -TAACTCATAGAATCCTGC Xhol 

219 Forward CGC GGATCCGCTAGC -ACGGCAAGGTTAAG BamHl- 

Nhel 

Reverse CCCG CTCGAG -TTTAAACCATCTCCTCAAAAC Xhol 
223 Forward CGC GGATCCCATATG -GAATTCAGGCACCAAGTA BamHI- 

Ndel 

Reverse CCC GCTCGAG -GGCTTCCCGCGTGTC Xhol 

225 Forward CGC GGATCCCATATG -GACGAGTTGACCAACC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GTTCAGAAAGCGGGAC Xhol 

226 Forward AAA GAATTC -CTTGCGATTATCGTGCGCACGCG . Eco RI 
Reverse AA ACTGCAG -TCAAAATCCCAAAACGGGGAT PstI 

228 Forward CGC GGATCCCATATG -TCGCAAGAAGCCAAACAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTGGCGGCATCTTTCAT Xhol 

229 Forward CGC GGATCCCATATG -CAAGAGGTTTTGCCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACACAATATAGCGGATGAAC Xhol 

230 Forward CGC GGATCCCATATG -CATCCGGGTGCCGAC BamHI- 

Ndel 

. Reverse CCCG CTCGAG -AAGTTTGGCGGCTTCGG Xhol 

232 Forward AAAAAA GAATTC -ATGTACGCTAAAAAAGGCGGTTTGGG Eco RI 
Reverse AAAAA ACTGCAG -TCAAGGTTTTTTCCTGATTGCCGCCGC PstI 

232a Forward AAAAA AGAATTC -GCCAAGGCTGCCGATACACAAATTGA Eco RI 
Reverse AAAAAA CTGCAG -TTAAACATTGTCGTTGCCGCCCAGATG PstI 

233 Forward CGC GGATCCCATATG -GCGGACAAACCCAAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -GACGGCATTGAGCAG Xhol 

234 Forward CGCGGATCCCATATG-GCCGTTTCACTGACCG BamHI- 
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Ndel 




Reverse 


GCCCAAGCTT-ACGGTTGGATTGCCATG 


Hind III 


235 


Forward 


CGCGGATCCCATATG-GCCTGCCAAGTTCAAA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGGGCTGCTCTTC 


Xhol 


236 


Forward 


CGCGGATCCCATATG-GCGCGTTTCGCCTT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ATGGGTCGCGCGCCGT 


Xhol 


238 


Forward 


CGCGGATCCGCTAGC-AACGGTTTGGATGCCCG 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-TTTGTCTAAGTTCCTGATATG 


Xhol 


239 


Forward 


CCGGAATTCTACATATG-CTCCACCATAAAGGTATTG 


EcoRJ- 








Ndel 




Reverse 


CCCGCTCGAG-TGGTGAAGAGCGGTTTAG 


Xhol 


240 


Forward 


CGCGGATCCCATATG-GACGTTGGACGATTTC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AAACGCCATTACCCGATG 


Xhol 


241 


Forward 


CCGGAATTCTACATATG-CCAACACGTCCAACT 


EcoRJ- 








Ndel 




Reverse 


CCCGCTCGAG-GAATGCGCCTGTAATTAATC 


Xhol 


242 


Forward 


CGCGGATCCCATATG-ATCGGCAAACTTGTTG 


BamHI- 








Ndel 




Reverse 


GCCCAAGCTT-ACCGATACGGTCGCAG 


Hindlll 


243 


Forward 


CGCGGATCCCATATG-ACGATTTTTTCGATGCTGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CGACTTGGTTACCGCG 


Xhol 


244 


Forward 


CGCGGATCCCATATG-CCGTCTGAAGCCC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTTTTCGGTAGGGGATTT 


Xhol 


246 


Forward 


CGCGGATCCCATATG-GACATCGGCAGTGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CCCGCGCTGCTGGAG 


Xhol 


247 


Forward 


CGCGGATCCCATATG-GTCGGATCGAGTTAC 


BamHT- 








Ndel 




Reverse 


CCCGCTCGAG-AAGTGTTCTGTTTGCGCA 


Xhol 


248 


Forward 


CGCGGATCCCATATG-CGCAAACAGAACACT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CTCATCATTATTGCTAACA 


Xhol 


249 


Forward 


CGCGGATCCCATATG-AAGAATAATGATTGCTTC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTCCCGACCTCCGAC 


Xhol 


2M 


Forward 


LOLOuA 1 ULC A 1 A 1 G-(_G 1 CjL 1 GCGCr 1 AGT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TACGAAAGCCGGTCGTG 


Xhol 


253 


Forward 


AAAAAAGAATTC-ATGATTGACAGGAACCGTATGCTGCG 


EcoRI 




Reverse 


AAAAAACTGCAG-TTATTGGTCTTTCAAACGCCCTTCCTG 


PstI 
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253a Forward AAAAAAG AATTC- A AAATCCTTTTG AAAAC AAGC G AAAACGG Eco RI 



Reverse AAAAAA CTGCAG -TTATTGGTCTTTCAAACGCCCTTCCTG PstI 

254 Forward AAAAAA GAATTC -ATGTATACAGGCGAACGCTTCAATAC Eco RI 
Reverse AAAAA ATCTAGA -TCAGATTACGTAACCGTACACGCTGAC Xbal 

255 Forward CGC GGATCCCATATG -GCCGCGTTGCGTTAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATCCGCAATACCGACCAG Xhol 

256 Forward CGC GGATCCGCTAGC -TTTTAACACCGCCGGAC BamHI- 

Nhel 

Reverse CCC GCTCGAG -ACGCCTGTTTGTGCGG Xhol 

257 Forward CGC GGATCCCATATG -GCGGTTTCTTTCCTG BamHI- 

Ndel 

Reverse CCC GCTCGAG -GCGCGTGAATATCGCG Xhol 

258 Forward AAAAAAGAATTC-GATTATTTCTGGTGGATTGTTGCGTTCAG Eco RI 
Reverse AAAAAA CTGCAG -CTACGCATAAGTTTTTACCGTTTTTGG PstI 

258a Forward AAAAAA GAATTC -GCGAAGGCGGTGGCGCAAGGCGA Eco RI 

Reverse AAAAAA CTGCAG -CTACGCATAAGTTTTTACCGTTTTTGG PstI 

259 Forward CGC GGATCCCATATG -GAAGAGCTGCCTCCG Bamffl- 

Ndel 

Reverse CCCG CTCGAG -GGCTTTTCCGGCGTTT Xhol 

260 Forward CGC GGATCCCATATG -GGTGCGGGTATGGT BamHI- 

Ndel 

Reverse CCCG CTCGAG -AACAGGGCGACACCCT Xhol 

261 Forward AAAAA AGAATTC -CAAGATACAGCTCGGGCATTCGC Eco RI 
Reverse AAAAAA CTGCAG -TCAAACCAACAAGCCTTGGTCACT PstI 

263 Forward CGC GGATCCCATATG -GCACGTTTAACCGTA BamHI- 

Ndel 

Reverse CCC GCTCGAG -GGCGTAAGCCTGCAATT Xhol 

264 Forward AAAAAA GGTACC -GCCGACGCAGTGGTCAAGGCAGAA Kpn I 
Reverse AAA CTGCAG -TCAGCCGGCGGTCAATACCGCCCG PstI 

265 Forward AAAAA AGAATTC -GCGGAGGTCAAGAGAAGGTGTTTG Eco RI 
Reverse AAAAA ACTGCAG -TTACGAATACGTCGTCAAAATGGG PstI 

266 . Forward AAA GAATTC -CTCATCTTTGCCAACGCCCCCTTC Eco RI 

Reverse AA ACTGCAG -CTATTCCCTGTTGCGCGTGTGCCA PstI 

267 Forward AAA GAATTC -TTCTTCCGATTCGATGTTAATCG EcoRI 
Reverse AAA CTGCAG -TTAGTAAAAACCTTTCTGCTTGGC PstI 

269 Forward AA AGAATTC -TGCAAACCTTGCGCCACGTGCCC Eco RI 
Reverse AAA CTGCAG -TTACGAAGACCGCAACGAAAGGCAGAG PstI 

269a Forward AAAAAA GAATTC -GACTTTATCCAAAACACGGCTTCGCC Eco RI 
Reverse AA ACTGCAG -TTACGAAGACCGCAACGAAAGGCAGAG Pst I 

270 Forward AAA GAATTC -GCCGTCAAGCTCGTTTTGTTGCAATG Eco RI 
Reverse AAA CTGCAG -TTATTCGGCGGTAAATGCCGTCTG PstI 

271 Forward CGC GGATCCCATATG -CCTGTGTGCAGCTCGAC BamHI- 

Ndel 

Reverse CCC GCTCGAG -TCCCAGCCCCGTGGAG Xhol 
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272 Forward AAAGAATTC-ATGACCGCAAAGGAAGAACTGTTCGC Eco RI 
Reverse AA ACTGCAG -TCAGAGCAGTTCCAAATCGGGGCT PstI 

273 Forward AAAGAATTC-ATGAGTCTTCAGGCGGTATTTATATACCC Eco RI 
Reverse AA ACTGCAG -TTACGCGTAAGAAAAAACTGC PstI 

274 Forward CGC GGATCCCATATG -ACAGATTTGGTTACGGAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTGCTTTCAGTATTATTGAA Xhol 

276 Forward AAAAAA GAATTC - Eco RI 

ATGATTTTGCCGTCGTCCATCACGATGATGCG 
Reverse AAAAAACTGCAG-CTACACCACCATCGGCGAATTTATGGC PstI 

277 Forward AAAAAAGAATTC- ATGCCGCGCTTTGAGGACAAGCTCGTAGG Eco RI 
Reverse AAAAAACTGCAG-TCATAAGCCATGCTTACCTTCCAACAA PstI 

277a Forward AAAAAAGAATTC-GGGGCGGCGGCTGGGTTGGACGTAGG Eco RI 
Reverse AAAAAACTGCAG-TCATAAGCCATGCTTACCTTCCAACAA PstI 

278 Forward AAAAAAGGTACC-GTCAAAGTTGTATTAATCGGGCCTTTGCC Kpn I 
Reverse AAAAAACTCCAG-TCATTCAACCATATCAAATCTGCC PstI 

278a Forward AAAAA AGAATTC -AAAACTCTCCTAATTCGTCATAGTCG Eco RI 
Reverse AAAAAACTGCAG-TCATTCAACCATATCAAATCTGCC PstI 

279 Forward CGC GGATCCCATATG -TTGCCTGCAATCACGATT BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTAGAAGCGGGCGGCAA Xhol 

280 Forward AAAAAA GGTACC -GCCCCCCTGCCGGTTGTAACCAG Kpn I 
Reverse AAAAAACTGCAG-TTATTGCTTCATCGCGTTGGTCAAGGC PstI 

281 Forward AAAAAAGAATTC-GCACCCGTCGGCGTATTCCTCGTCATGCG EcoRJ 
Reverse AAAAAA TCTAGA -GGTCAGAATGCCGCCTTCTTTGCCGAG Xbal 

281a Forward AAAAAAGAATTC-TCCTACCACATCGAAATTCCTTCCGG EcoRI 
Reverse AAAAAAJCTAGA-GGTCAGAATGCCGCCTTCTTTGCCGAG Xbal 

282 Forward AAAAAAGAATTC-CTTTACCTTGACCTGACCAACGGGCACAG Eco RJ 
Reverse AAAAAACTGCAG-TCAACCTGCCAGTTGCGGGAATATCGT PstI 

283 Forward CGC GGATCCCATATG -GCCGTCTTTACTTGGAAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACGGCAGTATTTGTTTACG Xhol 

284 Forward CGC GGATCCCATATG -TTTGCCTGCAAAAGAATCG BamHI- 

Ndel 

Reverse CCC GCTCGAG -CCGACTTTGCAAAAACTG Xhol 

286 Forward CG CGGATCCCATATG -GCCGACCTTTCCGAAAA BamHI- 

Ndel 

Reverse CCC GCTCGAG -GAAGCGCGTTCCCAAG Xhol 

287 Forward CCG GAATTC TA GCTAGC -CTTTCAGCCTGCGGG EcoRI- 

Nhel 

Reverse CCCG CTCGAG -ATCCTGCTCTTTTTTGCC Xhol 

288 Forward CGC GGATCCCATATG -CACACCGGACAGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGTATCAAAGACTTGCGT Xhol 
290 Forward CGCGGATCCCATATG-GCGGTTTGGGGCGGA BamHI- 
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Ndel 

Reverse CCCG CTCGAG -TCGGCGCGGCGGGC Xhol 
292 Forward CGC GGATCCCATATG -TGCGGGCAAACGCCC BamHI- 

Ndel 

Reverse CCC GCTCGAG -TTGATTTTTGCGGATGATTT Xhol 

294 Forward AAAAAA G AATTC -GTCTGGTCGATTCGGGTTGTC AG A A C Eco RI 
Reverse AAAAAACTGCAG-TTACCAGCTGATATAAAACATCGCTTT PstI 

295 Forward CGC GGATCCCATATG -AACCGGCCGGCCTCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGATATTTGATTCCGTTGC Xhol 

297 Forward AAAAAA GAATTC -GCATACATTGCTTCGACAGAGAG Eco RI 
Reverse AAAAA ACTGCAG -TCAATCCGATTGCGACACGGT PstI 

298 Forward AAAAAA G AATTC -CTGATTGCCGTGTGGTTC AGCC AA A A CC.C. Eco RI 
Reverse AAAAAACTGCAG-TCATGGCTGTGTACTTGATGGTTGCGT PstI 

299 Forward CGC GGATCCGCTAGC -CTACCTGTCGCCTCCG BamHI- 

Nhel 

Reverse CCC GCTCGAG -TTGCCTGATTGCAGCGG Xhol 
302 Forward AAAAAA GAATTC -ATGAGTCAAACCGATACGCAACG Eco RI 

Reverse AAAAA ACTGCAG -TTAAGGTGCGGGATAGAATGTGGGCGC PstI 

305 Forward AAAAA AGGTACC -GAATTTTTACCGATTTCCAGCACCGGA Kpn I 
Reverse AAAAA ACTGCAG -TCATTCCCAACTTATCCAGCCTGACAG PstI 

305a Forward AAAAAA GGTACC -TCCCGTTCGGGCAGTACGATTATGGG Kpn I 
Reverse AAAAAACTGCAG-TTACAAACCGACATCATGCAGGGTGAA PstI 

306 Forward CGC GGATCCCATATG -TTTATGAACAAATTTTCCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CCGCATCGGCAGAC Xhol 
308 Forward CGC GGATCCCATATG -TTAAATCGGGTATTTTATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATCCGCCATTCCCTGC Xhol 

311 Forward AAAAAA GGTACC -ATGTTCAGTTTTGGCTGGGTGTTT Kpn I 
Reverse AAA CTGCAG -ATGTTCATATTCCCTGCCTTCGGC PstI 

312 Forward AAAAAA GGTACC -ATGAGTATCCCATCCGGCGAAATT Kpn I 
Reverse AA ACTGCAG -TCAGTTTTTCATCGATTGAACCGG PstI 

313 Forward AAAAA AGAATTC -ATGGACGACCCGCGCACCTACGGATC Eco RI 
Reverse AAAAA ACTGCAG -TCAGCGGCTGCCGCCGATTTTGCT PstI 

401 Forward CGC GGATCCCATATG -AAGGCGGCAACACAGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CCTTACGTTTTTCAAAGCC Xhol 

402 Forward AAAAAAGAATTC-GTGCCTCAGGCATTTTCATTTACCCTTGC Eco RI 
Reverse AAAAA ATCTAGA -TTAAATCCCTCTGCCGTATTTGTATTC Xbal 

402a Forward AAAAAAGAATTC-AGGCTGATTGAAAACAAACACGG Eco RI 

Reverse AAAAA ATCTAGA -TTAAATCCCTCTGCCGTATTTGTATTC Xbal 
406 Forward CGC GGATCCCATATG -TGCGGGACACTGACAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGGTTGTCCTTGTCTATG Xhol 



WO 99/57280 



PCTAJS99/09346 



151 



501 


Forward CGCGGATCCCATATG-GCAGGCGGAGATGGC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GGTGTGATGTTCACCC 


Xhol 


502 


Forward CGCGGATCCCATATG-GTAGACGCGCTTAAGCA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-AGCTGCATGGCGGCG 


Xhol 


503 


Forward CGCGGATCCCATATG-TGTTCGGGGAAAGGCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CCGCGCATTCCTCGCA 


Xhol 


504 


Forward CGCGGATCCCATATG-AGCGATATTGAAGTGACG 


BamHI- 






Ndel 




Reverse GCCCAAGCTT-TGATTCAAGTCCTTGCCG 


Hindlll 


505 


Forward CGCGGATCCCATATG-TTTCGTTTACAATTCAGG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CGGCGTTTTATAGCGG 


Xhol 


510 


Forward CGCGGATCCCATATG-CCTTCGCGGACAC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GCGCACTGGCAGCG 


Xhol 


512 


Forward CGCGGATCCCATATG-GGACATGAAGTAACGGT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-AGGAATAGCCTTTGACG 


Xhol 


515 


Forward CGCGGATCCCATATG-GAGGAAATAGCCTTCGA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-AAATGCCGCAAAGCATC 


Xhol 


516 


Forward CGCGGATCCCATATG-TGTACGTTGATGTTGTGG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTGCGGGCGGCATC 


Xhol 


517 


Forward CGCGGATCCCATATG-GGTAAAGGTGTGGAAATA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GTGCGCCCAGCCGT 


Xhol 


518 


Forward AAAGAATTC-GCTTTTTTACTGCTCCGACCGGAAGG 


EcoRI 




Reverse AAACTGCAG-TCAAATTTCAGACTCTGCCAC 


PstI 


519 


Forward CGCGGATCCCATATG-TTCAAATCCTTTGTCGTCA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTGGCGGTTTTGCTGC 


Xhol 


520 


Forward CGCGGATCCCATATG-CCTGCGCTTCTTTCA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-ATATTTACATTTCAGTCGGC 


Xhol 


521 


Forward CGCGGATCCCATATG-GCCAAAATCTATACCTGC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CATACGCCCCAGTTCC 


Xhol 


522 


Forward CGCGGATCCCATATG-ACTGAGCCGAAACAC 


BamHI- 






Ndel 




Reverse GCCCAAGCTT-TTCTGATTTCAAATCGGCA 


Hindlll 


523 


Forward CGCGGATCCCATATG-GCTCTGCTTTCCGCG 


BamHI- 






Ndel | 
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Reverse CCCG CTCGAG -AGGGTGTGTGATAATAAGAAG Xhol 
525 Forward CGC GGATCCCATATG -GCCGAAATGGTTCAAATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCCCGTGCATATCATAAA Xhol 
527 Forward AA AGAATTC -TTCCCTCAATGTTGCCGTTTTCG Eco RI 

Reverse AAACTGCAG-TTATGCTAAACTCGAAACAAATTC PstI 

529 Forward CGC GGATCCGCTAGC -TGCTCCGGCAGCAAAAC BamHI- 

Nhel 

Reverse GCCC AAGCTT -ACGCAGTTCGGAATGGAG Hindlll 

530 Forward CGC GGATCCCATATG -AGTGCGAGCGCGG BamHI- 

Ndel 

Reverse CCC GCTCGAG -ACGACCGACTGATTCCG Xhol 

531 Forward AAAAA AGAATTC -TATGCCGCCGCCTACCAAATCTACGG Eco RI 
Reverse AAAAAACTGCAG-TTAAAACAGCGCCGTGCCGACGACAAG PstI 

532 Forward AAAAAAGAATTC-ATGAGCGGTCAGTTGGGCAAAGGTGC Eco RI 
Reverse AAAAAA CTGCAG -TCAGTGTTCCAAGTGGTCGGTATC A A A PstI 

532a Forward AAAAAAGAATTC-TTGGGTGTCGCGTTTGAGCCGGAAGT Eco RI 

Reverse AAAAAACTGCAG-TCAGTGTTCCAAGTGGTCGGTATCAAA PstI 

535 Forward AAA GAATTC -ATGCCCTTTCCCGTTTTCAGAC Eco RI 

Reverse AA ACTGCAG -TCAGACGACCCCGCCTTCCCC PstI 

537 Forward CGC GGATCCCATATG -CATACCCAAAACCAATCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATCCTGCAAATAAAGGGTT Xhol 

538 Forward CGC GGATCCCATATG -GTCGAGCTGGTCAAAGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TGGCATTTCGGTTTCGTC Xhol 

539 Forward CGC GGATCCGCTAGC -GAGGATTTGCAGGAAA BamHI- 

Nhel 

Reverse CCC GCTCGAG -TACCAATGTCGGCAAATC Xhol 

542 Forward AAA GAATTC -ATGCCGTCTGAAACCGTGTC Eco RI 
Reverse AA ACTGCAG -TTACCGCGAACCGGTCAGGAT PstI 

543 Forward AAAAAAGAATTC-GCCTTCGATGGCGACGTTGTAGGTAC Eco RI 
Reverse AAAAA ATCTAGA - Xbal 

TTAATG AAGAAG AAC ATATTGGAATTTTGG 

543a Forward AAAAAA GAATTC -GGCAAAACTCGTCATGAATTTGC Eco RI 

Reverse AAAAAA TCTAGA - Xbal 
TTAATGAAGAAGAAC ATATTGGAATTTTGG 

544 Forward AAAGAATTC-GCGCCCGCCTTCTCCCTGCCCGACCTGCACGG Eco RI 
Reverse AAA CTGCAG -CTATTGCGCCACGCGCGTATCGAT Pst I 

544a Forward AAAAAA GAATTC - Eco RI 
GCAAATGACTATAAAAACAAAAACTTCCAAGTACTTGC 

Reverse AAACTGCAG-CTATTGCGCCACGCGCGTATCGAT PstI 

547 Forward AAAGAArrC-ATGTTCGTAGATAACGGATTTAATAAAAC Eco RI 
Reverse AA ACTGCAG -TTAACAACAAAAAACAAACCGCTT PstI 

548 Forward AAAGAATTC-GCCTGCAAACCTCAAGACAACAGTGCGGC Eco RI 
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Reverse AAA CTGCAG -TCAGAGCAGGGTCCTTACATCGGC PsTI — 
550 Forward AAAAAA GTCGAC - Sal I 

ATGATAACGGACAGGTTTCATCTCTTTCATTTTCC 

Reverse AAACTGCAG-TTACGCAAACGCTGCAAAATCCCC PstI 
550a Forward AAAAAAGAATTC-GTAAATCACGCCTTTGGAGTCGCAAACGG Eco RJ 

Reverse AAA CTGGAG -TTACGCAAACGCTGGAAAATCCCC p s tl 
552 Forward AAAAA AGAATTC -TTGGCGCGTTGGCTGGATAC Eco RI 

Reverse AAA CTGCAG -TTATTTCTGATGCCTTTTCCCAAC p s tl 
554 Forward CGC GGATCCCATATG -TCGCCCGCGCCCAAC BamHI- 

Ndel 

Reverse CCC GCTCGAG -CTGCCCTGTCAGACAC Xhol 

556 Forward AAAGAATTC-GCGGGCGGTTTTGTTTGGACATCCCG Eco RI 
Reverse AAACTGCAG-TTAACGGTGCGGACGTTTCTGACC Pstl 

557 Forward CGCGGATCCCATATG-TGCGGTTTCCACCTGAA BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTCCGCCTTCAGAAAGG Xhol 

558 Forward AAAGAATTC-GAGCTTTATATGTTTCAACAGGGGACGGC Eco RI 
Reverse AAA CTGC AG -CTAAAC AATGCCGTCTG AAAGTGG A G A Pstl 

558a Forward AAAAAAGAATTC-ATTAGATTCTATCGCCATAAACAGACGGG Eco RI 
Reverse AAAAA ACTGCAG -CTAAACAATGCCGTCTGAAAGTGGAGA Pstl 

560 Forward AAAAAA GAATTC - Eco RI 

TCGCCTTTCCGGGACGGGGCGCACAAGATGGC 
Reverse AAAAAACTGCAG-TCATGCGGTTTCAGACGGCATTTTGGC Pstl 

561 Forward CC GGAATTC T ACATATG -ATACTGCCAGCCCGT EcoRI- 

Ndel 

Reverse CCC GCTCGAG -TTTCAAGCTTTCTTCAGATG Xhol 

562 Forward CGC GGATCCCATATG -GCAAGCCCGTCGAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGACCAACTCCAACTCGT Xhol 

565 Forward CGC GGATCCCATATG -AAGTCGAGCGCGAAATAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GGCATTGATCGGCGGC Xhol 

566 Forward CGC GGATCCCATATG -GTCGGTGGCGAAGAGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGCATGGGCGAAGTCA Xhol 

567 Forward CCG GAATTC TA CATATG -AGTGCGAACATCCTTG EcoRI- 

Ndel 

Reverse CCCG CTCGAG -TTTCCCCGACACCCTCG Xhol 

568 Forward CGC GGATCCCATATG -CTCAGGGTCAGACC BamHI- 

Ndel 

Reverse CCC GCTCGAG -CGGCGCGGCGTTCAG Xhol 

569 Forward AAAAAAGAATTC-CTGATTGCCTTGTGGGAATATGCCCG Eco RI 
Reverse AAAAAACTGCAG-TTATGCATAGACGCTGATAACGGCAAT Pstl 

570 Forward CGC GGATCCCATATG -GACACCTTCCAAAAAATCG BamHI- 

Ndel 

Reverse CCC GCTCGAG -GCGGGCGTTCATTTCTTT Xhol 
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571 Forward AAAAAA GAATTC - EcoRJ 

ATGGGTATTGCCGGCGCCGTAAATGTTTTGAACCC 
Reverse AAAAAA CTGCAG -TTATGGCCGACGCGCGGCTACCTGACG PstI 

572 Forward CGC GGATCCCATATG -GCGCAAAAAGGCAAAACC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCGCAGTGTGCCGATA Xhol 

573 Forward CGC GGATCCCATATG -CCCTGTTTGTGCCG BamHI- 

Ndel 

Reverse CCC GCTCGAG -GACGGTGTCATTTCGCC Xhol 

574 Forward CGC GGATCCCATATG -TGGTTTGCCGCCCGC BamHI- 

Ndel 

Reverse CCC GCTCGAG -AACTTCGATTTTATTCGGG Xhol 

575 Forward CGC GGATCCCATATG -GTTTCGGGCGAGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -CATTCCGAATCTGAACAG Xhol 

576 Forward CGC GGATCCCATATG -GCCGCCCCCGCATCT BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATTTACTTTTTTGATGTCGAC Xhol 

577 Forward CGC GGATCCCATATG -GAAAGGAACGGTGTATTT BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGGCTGTTTGGTAGATTCG Xhol 

578 Forward CGC GGATCCCATATG -AGAAGGTTCGTACAG BamHI- 

Ndel 

Reverse CCC GCTCGAG -GCCAACGCCTCCACG Xhol 

579 Forward CGC GGATCCCATATG -AGATTGGGCGTTTCCAC BamHI- 

Ndel 

Reverse CCC GCTCGAG -AGAATTGATGATGTGTATGT Xhol 

580 Forward CGC GGATCCCATATG -AGGCAGACTTCGCCGA BamHI- 

Ndel 

Reverse CCCG CTCGAG -CACTTCCCCCGAAGTG Xhol 

581 Forward CGC GGATCCCATATG -CACTTCGCCCAGC BamHI- 

Ndel 

Reverse CCC GCTCGAG -CGCCGTTTGGCTTTGG Xhol 

582 Forward AAAAA AGAATTC -TTTGGAGAGACCGCGCTGCAATGCGC Eco RI 
Reverse AAAAA ATCTAGA -TCAGATGCCGTCCCAGTCGTTGAA Xbal 

583 Forward AAAAAA GAATTC - ACTGCCGGCAATCGACTGCATAATCG Eco RI 
Reverse AAAAAA CTGCAG -TTAACGGAGGTCAATATGAT GAAATTG PstI 

584 Forward AAAAAA GAATTC - Eco RI 

GCGGCTGAAGCATTGAATTACAATATTGTC 
Reverse AAAAA ACTGCAG -TCAGAACTGAACCGTCCCATTGACGCT PstI 

585 Forward AAAAAA GGTACC -TCTTTCTGGCTGGTGCAGAACACCCTTGC Eco RI 
Reverse AAAAAA CTGCAG -TCAGTTCGCACTTTTTTCTGTTTTGGA PstI 

586 Forward CGC GGATCCCATATG -GCAGCCCATCTCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTCAGCGAATCAAGTTTC Xhol 

587 Forward CGC GGATCCCATATG -GACCTGCCCTTGACGA BamHI- 
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Ndel 

Reverse CCCG CTCGAG -AAATGTATGCTGTACGCC Xhol 

588 Forward AAAAAAGAATTC-GCCGTCCTGACTTCCTATCAAGAACCAGG Eco RI 
Reverse AAAAAA CTGC AG -TTATTTGTTTTTGGGC AGTTTC A C.TTC p st l 

589 Forward AAAAAAGAATTC- Eco pj 

ATGC AAC AAAAAATCCGTTTCC AAATC G AAGG 
Reverse AAAAAACTGCAG-CTAATCGATTTTTACCCGTTTCAGGCG Pstl 

590 Forward AAAAAA GAATTC - ATGAAAAAACCTTTGATTTCA GTTGCGOC Eco RJ 
Reverse AAAAAACTGCAG-TTACTGCTGCGGCTCTGAAACCAT Pstl 

591 Forward AAAAAA GAATTC -CACTACATCGTTGCCAGATTGTGCGG Eco RI 
Reverse AAAAAACTGCAG-CTAACCGAGCAGCCGGGTAACGTCGTT Pstl 

592a Forward AAAAAAGAATTC-CGCGATTACACCGCCAAGCTGAAAATGGG Eco RI 

Reverse AAAAAACTGCAG-TTACCAAACGTCGGATTTGATACG Pstl 
593 Forward CGC GGATCCGCTAGC -CTTGAACTGAACGGACTC BamHI- 

Nhel 

Reverse CCCG CTCGAG -GCGGAAGCGGACGATT Xhol 
594a Forward AAAAA AGAATTC -GGTAAGTTCGCCGTTCAGGCCTTTCA Eco RI 

Reverse AAAAAACTGCAG-TTACGCCGCCGTTTCCTGACACTCGCG Pstl 

595 Forward AAAAAAGAATJC-TGCCAGCCGCCGGAGGCGGAGAAAGC Eco RI 
Reverse AAAAAA CTGC AG -TTATTTCAAGCCGAGT ATGC CGCG p s tl 

596 Forward CGC GGATCCCATATG -TCCCAACAATACGTC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACGCGTTACCGGTTTGT Xhol 

597 Forward CGC GGATCCCATATG -CTGCTTCATGTCAGC BamHI- 

Ndel 

Reverse GCCCAAGCTT-ACGTATCCAGCTCGAAG Hindlll 

601 Forward CGC GGATCCCATATG -ATATGTTCCCAACCGGCAAT BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAAACAATCCTCAGGCAC Xhol 

602 Forward CGC GGATCCGCTAGC -TTGCTCCATCAATGC BamHI- 

Nhel 

Reverse CCCG CTCGAG -ATGCAGCTGCTAAAAGCG Xhol 

603 Forward AAAAAAGAATTC-CTGTCCTCGCGTAGGCGGGGACGGGG Eco RI 
Reverse AAAAAACTCCAG-CTACAAGATGCCGGCAAGTTCGGC Pstl 

604 Forward CGC GGATCCGCTAGC -CCCGAAGCGCACTT BamHI- 

Nhel 

Reverse CCC GCTCGAG -GACGGCATCTGCACGG Xhol 
606a Forward AAAAAAGj\ATTC-CGCGAATACCGCGCCGATGCGGGCGC Eco RI 
Reverse AAAAAACTGCAG-TTAAAGCGATTTGAGGCGGGCGATACG Pst I 

607 Forward AAAAAAGAATTC-ATGCTGCTCGACCTCAACCGCTTTTC Eco RI 
Reverse AAAAAACTGCAG-TCAGACGGCCTTATGCGATCTGAC Pstl 

608 Forward AAAAAAGj^ATTC-ATGTCCGCCCTCCTCCCCATCATCAACCG Eco RI 
Reverse AAAAAACTGCAG-TTAGTCTATCCAAATGTCGCGTTC Pstl 

609 Forward CGCGGATCCCATATG-GTTGTGGATAGACTCG BamHI- 
. • Ndel 
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Reverse CCC GCTCGAG -CTGGATTATOATfiTrTfTrr' 

610 Forward CGCG^ATCCCATATG-ATTGGAGGGCTTATGCA 

Reverse CCCGCTCGAG-ACGCTTCAACATCTTTGCC 

611 Forward CGCGGATCCCATWTG-CCGTCTCAAAACGGG 

Reverse CCCGCTCGAG-AACGACTTTGAACGCGCAA 

613 Forward CGCGGATCCCATATG-TCGCGTTCGAGCCG3 

Reverse CCCGCTCGAG-AGCCTGTAAAATAAGCGGC 

614 Forward CGC GGATCCC AT ATG -TCCGTCGTG A nmnr 

Reverse CCCGCTCGAG-CCATACTGCGGCGTTC 
616 Forward AAAAAAGAATTC-ATGTCAAACACAATCAAAATGGTTGTCGG 

Reverse AAAAAATCTAGA-TTAGTCCGGGCGGCAGGCAGCTCG 
619a Forward AAAAAAGAATTC-GGGCTTCTCGCCGCCTCGCTTGC 

Reverse AAAAAACTGCAG-TCATTTTTTGTGTTTTAAAACGAGATA 
622 Forward CGCGGATCCCMATG-GCCGCCCTGCCTAAAG 

Reverse CCCGCTCGAG-TTTGTCCAAATGATAAATCTG 

624 Forward CG CGG ATCCC ATATG -TCCCCGCGCTTTT A CCG 

Reverse CCCGCTCGAG-AGATTCGGGCCTGCGC 

625 Forward CGC GGATCCCATATG -TTTGCA A C.C. AGH A aaatp, 

CCCGCTCGAG-CGGCAAAATTACCGCCTT 

AAAAAAGAATTC-AAAGCAGGCGAGGCAGGCGCGCTGGG 
AAAAA ACTGCAG - 

TTACGAATGAAACAGGGTACCCGTCATCAAGGC 

AAAAAAGGTACC-GCCTTACAAACATGGATTTTGCGTTC 

AAAAAACTGCAG-CTACGCACCTGAAGCGCTGGCAAA 

AAAAAAGAATTC-GCCACCTTTATCGCGTATGAAAACGA 

AAAAAACTGCAG-TTACAACACCGCCGTCCGGTTCAAACC 

AAAAAAGAATTC-GCGGCTTTGGGTATTTCTTTCGG 

AAAAAACTGCAG-TTAGGAGACTTCGCCAATGGAGCCGGG 
AAAAA AGAATTC - 

ATGACCCAGCGACGGGTCGGCAAGCAAAACCG 
AAAAAACTGCAG-TTAATCCACTATAATCCTGTTGCT 
AAAAAAGAATTC-ATGATTGGCGAAAAGTTTATCGTAGTTGG 
AAAAAACTGCAG-TCACGAACCGATTATGCTGATCGG 
CGCGGATCCCATATG-ATGCTTTATTTTGTTCG 

Reverse CCCGCTCGAG-ATCGCGGCTGCCGAC 
642 Forward CGCGGATCCCATATG-CGGTATCCGCCGCAAT 

Reverse CCCGCTCGAG-AGGATTGCGGGGCATTA 



Reverse 
627a Forward 
Reverse 

628 Forward 
Reverse 
629a Forward 
Reverse 
630a Forward 
Reverse 
635 Forward 



638 



Reverse 
Forward 
Reverse 
639 Forward 



Xhol 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
EcoRI 
Xbal 
EcoRI 
PstI 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
EcoRI 
PstI 

Kpnl 
PstI 
EcoRI 
PstI 
EcoRI 
PstI 
EcoRI 

PstI 
EcoRI 
PstI 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
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643 Fonvard CGC GGATCCCATATG -GCTTCGCCGTCGGCAG BamHI- 

Ndel 

Reverse CCC GCTCGAG -AACCGAAAAACAGACCGC Xhol 

644 Forward AAAAA AGAATTC - EcoRI 

ATGCCGTCTGAAAGGTCGGCGGATTGTTGCCC 

Reverse AAAAA ATCTAGA -CTACCCGCAATATCGGCAGTCCAATAT PstI 

645 Forward AAAAAA GAATTC -GTGGAACAGAGCAACACGTTAAATCG Eco RI 
Reverse AAAAA ACTGCAG -CTACGAGGAAACCGAAGACCAGGCCGC PstI 

647 Forward AAAAAA GAATTC -ATGCAAAGGCTCGCCGCAGACGG Eco RI 
Reverse AAAAA ACTGCAG -TTAGATTATCAGGGATATCCGGTAGAA PstI 

648 Forward AAAAA AGAATTC - Eco RI 

ATGAACAGGCGCGACGCGCGGATCGAACG 

Reverse AAAAAA CTGCAG -TCAAGCTGTGTGCTGATTGAATGCGAC PstI 

649 Forward AAAAAA GAATTC -GGTACGTCAGAACCCGCCCACCG Eco RI 
Reverse AAAAAA CTGCAG -TTAACGGCGGAAACTGCCGCCGTC PstI 

650 Forward AAAAAA GAATTC -ATGTCCAAACTCAAAACCATCGC Eco RI 
Reverse AAAAAA CTGCAG -TCAGACGGCATGGCGGTCTGTTTT PstI 

652 Forward AAAAAA GGTACC - Kpn I 

GCTGCCGAAGACTCAGGCCTGCCGCTTTACCG 

Reverse AAAAA ACTGCAG -TTATTTGCCCAGTTGGTAGAATGCGGC Pst I 

653 Forward AAAAAA GAATTC -GCGGCTTTGCCGGTAATTTTCATCGG Eco RI 
Reverse AAAAAA CTGCAG -CTATGCCGGTCTGGTTGCCGGCGGCGA PstI 

656a Forward AAAAAA GAATTC -CGGCCGACGTCGTTGCGTCCTAAGTC Eco RI 

Reverse AAAAA ACTGCAG -CTACGATTTCGGCGATTTCCACATCGT PstI 

657 Forward AAAAAA GAATTC -GCAGAATTTGCCGACCGCCATTTGTGCGC Eco RI 
Reverse AAAAAA CTGCAG -TTATAGGGACTGATGCAGTTTTTTTGC PstI 

658 Forward CGC GGATCCCATATG -GTGTCCGGAATTGTG BamHI- 

Ndel 

Reverse CCC GCTCGAG -GGCAGAATGTTTACCGTT Xhol 

661 Forward AAAAAA GAATTC - EcoRI 
ATGCACATCGGCGGCTATTTTATCGACAACCC 

Reverse AAAAAA CTGCAG -TCACGACGTGTCTGTTCGCCGTCGGGC PstI 

663 Forward CGC GGATCCCATATG -TGTATCGAGATGAAATT BamHI- 

Ndel 

Reverse CCC GCTCGAG -GTAAAAATCGGGGCTGC Xhol 

664 Forward CGC GGATCCCATATG -GCGGCTGGCGCGGT BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAATCGAGTTTTACACCAC Xhol 

665 Forward AAAAAA GAATTC -ATGAAATGGGACGAAACGCGCTTCGG Eco RI 
Reverse AAAAA ACTGCAG -TCAATCCAAAATTTTGCCGACGATTTC PstI 

666 Forward AAAAAAGAATTC- AACTCAGGCGAAGGAGTGCTTGTGGC Eco RI 
Reverse AAAAA ATCTAGA -TCAGTTTAGGGATAGCAGGCGTAC Xbal 

667 Forward AAAAAA GAATTC - EcoRI 

CCGCATCCGTTTGATTTCCATTTCGTATTCGTCCG 

Reverse AAAAA ACTGCAG -TTAATGACACAATAGGCGCAAGTC PstI 
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669 Forward AAAAAA GAATTC -ATGCGCCGCATCATTAAAAAACACCAGCC Eco RI 

Reverse AAAAA ACTGCAG -TTACAGTATCCGTTTGATGTCGGC Pst I 

670a Forward AAAAAA GAATTC -AAAAACGCTTCGGGCGTTTCGTCTTC Eco RI 
Reverse AAAAA ACTGCAG - Pst I 

TTAGGAGCTTTTGGAACGCGTCGGACTGGC 

671 Forward CG C GG ATCCC ATATG - ACC AGC AGGGT AAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGCAACTATAAAAACGCAAG Xhol 

672 Forward CGC GGATCCCATATG -AGGAAAATCCGCACC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACGGGATAGGCGGTTG Xhol 

673 Forward AAAAAA GAATTC -ATGGATATTGAAACCTTCCTTGCAGG Eco RI 
Reverse AAAAA ACTGCAG -CTACAAACCCAGCTCGCGCAGGAA Pst I 

674 Forward AAAAAA GAATTC -ATGAAAACAGCCCGCCGCCGTTCCCG Eco RI 
Reverse AAAAA ACTGCAG -TCAACGGCGTTTGGGCTCGTCGGG Pst I 

675 Forward CGC GGATCCCATATG -AACACCATCGCCCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTCTTCGTCTTCAAACTGT Xhol 
677a Forward AAAAAA GAATTC -AGACGGCATTCCCGATCAGTCGATTTTGA Eco RI 
Reverse AAAAAA CTGCAG -TTACGTATGCGCGAAATCGACCGCCGC Pst I 

680 Forward CGC GGATCCGCTAGC -ACGAAGGGCAGTTCGG BamHI- 

Nhel 

Reverse CCCGCTCGAG-CATCAAAAACCTGCCGC Xhol 

681 Forward AAAAAA GAATTC -ATGACGACGCCGATGGCAATCAGTGC EcoRI 
Reverse AAAAAA CTGCAG -TTACCGTCTTCCGCAAAAAACAGC Pst I 

683 Forward CGC GGATCCCATATG -TGCAGCACACCGGACAA BamHI- 

Ndel 

Reverse CCC GCTCGAG -GAGTTTTTTTCCGCATACG Xhol 

684 Forward CGC GGATCCCATATG -TGCGGTACTGTGCAAAG BamHI- 

Ndel 

Reverse CCC GCTCGAG -CTCGACCATCTGTTGCG Xhol 

685 Forward CGC GGATCCCATATG -TGTTTGCTTAATAATAAACATT BamHI- 

Ndel 

Reverse CCCG CTCGAG -CTTTTTCCCCGCCGCA Xhol 

686 Forward CGC GGATCCCATATG -TGCGGCGGTTCGGAAG BamHI- 

Ndel 

Reverse CCC GCTCGAG -CATTCCGATTCTGATGAAG Xhol 

687 Forward CGC GGATCCCATATG -TGCGACAGCAAAGTCCA BamHI- 

Ndel 

Reverse CCCG CTCGAG -CTGCGCGGCTTTTTGTT Xhol 

690 Forward CGC GGATCCCATATG -TGTTCTCCGAGCAAAGAC BamHI- 

Ndel 

Reverse CCC GCTCGAG -TATTCGCCCCGTGTTTGG Xhol 

691 Forward CGC GGATCCCATATG -GCCACGGCTT AT ATCCC BamHI- 

Ndel 

Reverse CCC GCTCGAG -TTTGAGGCAGGAAGAAAG Xhol 
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694 


Forward 


CGCGGATCCCATATG-TTGGTTTCCGCATCCGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TCTGCGTCGGTGCGGT 


Xhol 


695 


Forward 


CGCGGATCCCATATG-TTGCCTCAAACTCGTCCG 


BamHI- 

UilU. XX 








Ndel 




Reverse 


CCCGCTCGAG-TCGTTTGCGCACGGCT 


Xhol 


696 


Forward 


CGCGGATCCCATATG-TTGGGTTGCCGGCAGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTGATTGCCGCAATGATG 


Xhol 


700a 


Forward 


AAAAAAGAATTC-GCATCGACAGACGGTGTGTCGTGGAC 


EcoRI 




Reverse 


AAAAAACTGCAG-TTACGCTACCGGCACGACTTCCAAACC 


PstI 


701 


Forward 


CGCGGATCCCATATG-AAGACTTGTTTGGATACTTC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TGCCGACAACAGCCTC 


Xhol 


702 


Forward 


AAAAAAGAATTC-ATGCCGTGTTCCAAAGCCAGTTGGATTTC 


EcoRI 




Reverse 


AAAAAACTGCAG-TTAACCCCATTCCACCCGGAGAACCGA 


PstI 


703 


Forward 


CGCGGATCCGCTAGC-CAAACGCTGGCAACCG 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-TTTTGCAGGTTTGATGTTTG 


Xhol 


704a 


Forward 


AAAAAAGAATTC-GCTTCTACCGGTACGCTGGCGCG 


EcoRI 




Reverse 


AAAAAACTGCAG- 


PstI 






TTAGTTTTGCCGGATAATATGGCGGGTGCG 




707 


Forward 


CGCGGATCCGCTAGC-GAAATTATTAACGATGCAGA 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-GAAACTGTAATTCAAGTTGA 


Xhol 


708 


Forward 


CGCGGATCCGCTAGC-CCTTTTAAGCCATCCAAAA 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-TTGACCGGTGAGGACG 


Xhol 


710 


Forward 


CGCGGATCCCATATG-GAAACCCACGAAAAAATC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AACGGTTTCGGTCAG 


Xhol 


714 


Forward 


CGCGGATCCCATATG-AGCTATCAAGACATCTT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GCGGTAGGTAAATCGGAT 


Xhol 


716 


Forward 


CGCGGATCCCATATG-GCCAACAAACCGGCAAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTAGAACCGCATTTGCC 


Xhol 


718 


Forward 


CGCGGATCCCATATG-GAGCCGATAATGGCAAA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GGCGCGGGCATGGTCTTGTCC 


Xhol 


720 


Forward 


CGCGGATCCCATATG-AGCGGATGGCATACC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTTGCATAGCTGTTGACCA 


Xhol 


723 


Forward 


CGCGGATCCCATATG-CGACCCAAGCCCC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AATGCGAATCCGCCGCC 


Xhol 
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725 


Forward CGCGGATCCCATATG-GTGCGCACGGTTAAA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTGCTTATCCTTAAGGGTTA 


Xhol 


726 


Forward CGCGGATCCCATATG-ACCATCTATTTCAAAAAC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GCCGATGTTTAGCGTCC 


Xhol 


728 


Forward CGCGGATCCCATATG-TTTTGGCTGGGAACGGG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GTGAGAAAGGTCGCGC 


Xhol 


729 


Forward CGCGGATCCCATATG-TGCACCATGATTCCCCA 


BamHI- 






Ndel 




Reverse GCCCAAGCTT-TTTGTCGGTTTGGGTATC 


Hindlll 


731 


Forward CGCGGATCCGCTAGC-GCCGTGCCGGAGG 


BamHI- 






Nhel 




Reverse CCCGCTCGAG-ACGGGCGCGGCAG 


Xhol 


732 


Forward CCGGAATTCTACATATG-TCGAAACCTGTTTTTAAGAA 


EcoRJ- 






Ndel 




Reverse CCCGCTCGAG-CTTCTTATCTTTTTTATCTTTC 


Xhol 


733 


Forward CGCGGATCCCATATG-GCCTGCGGCGGCAA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TCGCTTGCCTCCTTTAC 


Xhol 


734 


Forward CGCGGATCCCATATG-GCCGATACTTACGGCTAT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTGAGATTTTGAATCAAAGAG 


Xhol 


735 


Forward CGCGGATCCCATATG-AAGCAGCAGGCGGTCA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-ATTTCCGTAGCCGAGGG 


Xhol 


737 


Forward CGCGGATCCCATATG-CACCACGACGGACACG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GTCGTCGCGGCGGGA 


Xhol 


739 


Forward CGCGGATCCCATATG-GCAAAAAAACCGAACA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GAAGAGTTTGTCGAGAATT 


Xhol 


740 


Forward CGCGGATCCCATATG-GCCAATCCGCCCGAAG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-AAACGCGCCAAAATAGTG 


Xhol 


741 


Forward CGCGGATCCCATATG-TGCAGCAGCGGAGGG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTGCTTGGCGGCAAGGC 


Xhol 


743 


Forward CGCGGATCCCATATG-GACGGTGTTGTGCCTGTT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CTTACGGATCAAATTGACG 


Xhol 


745 


Forward CGCGGATCCCATATG-TTTTGGCAACTGACCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CAAATCAGATGCCTTTAGG 


Xhol 


746 


Forward CGCGGAICCCAIAICj-ICLCjAAAACAAACAAAAC 


BamHl- 
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Ndel 




Reverse CCCGCTCGAG-TTCATTCGTTACCTGACC 


Xhol 


7A7 


Forward CCGGAATTCTAGCTAGC-CTGACCCCTTGGG 


EcoRI- 






Nhel 




x) averse GCCC AAGCTT-TTTTGATTTT AATTGACTAT AGAAC 

JvQ^ y Ov V ,1 V ' V> xx V.VJ V-/ XX X X X X vji x x x x x x xx x x x vji xx-/ x x x x x x v_* x xx x. x — ✓ 


Hindlll 


7AQ 


Forward CGCGGATCCCATATG-TGCCAGCCGCCG 

I* tJI W ft I Vx Vv vJ V-/ VJ VJJTV A >»✓ V-/ V_/x x. x. x x X VJ x VJ V— ' V> VJ V/ Vv VJ 


BamHI- 

x-/ ixixxx xx 






Ndel 




Reverse CCCGCTCGAG-TTTCAAGCCGAGTATGC 


Xhol 




Forward CGCGGATCCCATATG-TGTTCGCCCGxAACCTG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CTTTTTCCCCGCCGCAA 


Xhol 


758 


Forward CGCGGATCCCATATG-AACAATCTGACCGTGTT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TGGCTCAATCCTTTCTGC 


Xhol 


7SQ 


Forward CGCGGATCCGCTAGC-CGCTTCACACACACCAC 

£ VJi. W Ql v»- vy v -i \j x_J a x a. v.*/ x_> x— * x^ a a. x vj \> x— / \j x_> x x x_> -a vv* xx** /a. iwi x v-/ a x x— ' 


BamHI- 

U (XI X 1 M X X 






Nhel 




Reverse CCCGCTCGAG-CCAGTTGTAGCCTATTTTG 


Xhol 


761 


Forward CGCGGATCCCATATG-CTGCCTGAAGCATGGCG 

JL VJ a. W it 1 V* X- > VJ W X^J> V-J a X, X V—/ V— ^ V> A X X. A X A \J X^s A. VJ W X*> X w A 11 X VJ V>1 X X V_-J VJ %»✓ X— J 


BamHI- 

U IXi. -1 XX x X 






Ndel 




Reverse CCCGCTCGAG-TTCCGCAAATACCGTTTCC 


Xhol 


764 


Forward CGCGGATCCCATATG-TTTTTCTCCGCCCTGA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TCGCTCCCTAAAGCTTTC 


Xhol 


765 


Forward CGCGGATCCCATATG-TTAAGATGCCGTCCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-ACGCCGACGTTTTTTATTAA 


Xhol 


767 


Forward CGCGGATCCCATATG-CTGACGGAAGGGGAAG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTCTGTACAGCAGGGG 


Xhol 


768 


Forward CGCGGATCCCATATG-GCCCCGCAAAAACCCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTCATCCCTTTTTTGAGC 

^ V. vv V VI O w V^/ vJ X— / X V-/ XJ* A X X—* XXX VV1 X X X-** x«/ X»/ X X X X X X X»J X A \J x.^ 


Xhol 


770 


Forward CGCGGATCCCATATG-TGCGGCAGCGGCGAA 

JL ul W CU Vx V-* VJ V^ VJ VJ Xxx x x x x x n_j x v^* v^* vj x»j v/i x vi v_j x.^ vj x xx x 


BamHI- 

xj iAA A XX X X 






Ndel 




Reverse CCCGCTCGAG-GCGTTTGTCGAGATTTTC 


Xhol 


111 


Forward CGCGGATCCCATATG-TCCGTATATCGCACCTTC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CGGTTCTTTAGGTTTGAG 


Xhol 


772 


Forward CGCGGATCCCATATG-TTTGCGGCGTTGGTGG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CAATGCCGACATCAAACG 


Xhol 


774 


Forward CGCGGATCCCATATG-TCCGTTTCACCCGTTCC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TCGTTTGCGCACGGCT 


Xhol 


790 


Forward CGCGGATCCCATATG-GCAAGAAGGTCAAAAAC 


BamHI- 






Ndel 
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Reverse CCC GCTCGAG -GGCGTTGTTCGGATTTCG Xhol 

900 Forward CGC GGATCCCATATG -CCGTCTGAAATGCCG BamHI- 

Ndel 

Reverse CCC GCTCGAG -ATATGGAAAAGTCTGTTGTC Xhol 

901 Forward CGC GGATCCCATATG -CCCGATTTTTCGATG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAAATGGAACAATACCAGG Xhol 

902 Forward. CCG GAATTC T ACATATG -TTGCACTTTCAAAGGATAATC EcoRI- 

2 Ndel 
Reverse CCCG CTCGAG -AAAAATGTACAATGGCGTAC Xhol 

903 Forward CCG GAATTC TA GCTAGC -CAGCGTCAGCAGCACAT EcoRI- 

Nhel 

Reverse CCC GCTCGAG -GAAACTGTAATTCAAGTTGAA Xhol 

904 Forward AAAAAA GGTACC -ATGATGCAGCACAATCGTTTC Kpn I 
Reverse AAA CTGCAG -TTAATATCGATAGGTTATATG Pst I 

904a Forward AAAAAA GAATTC- CGGCTCGGCATTGTGCAGATGTTGCA Eco RI 
Reverse AA ACTGCAG -TTAATATCGATAGGTTATATG Pst I 

905 Forward CGC GGATCCCATATG -AACAAAATATACCGCATC BamHI- 

Ndel 

Reverse CCC GCTCGAG -CCACTGATAACCGACAGAT Xhol 

907 Forward CGC GGATCCCATATG -GGCGCGCAACGTGAG BamHI- 

Ndel 

Reverse CCC GCTCGAG -ACGCCACTGCCAGCG Xhol 

908 Forward AA AGAATTC -GCAGAGTTAGTAGGCGTTAATAAAAATAC Eco RI 
Reverse AA ACTGCAG -TTAATATGGTTTTGTCGTTCG Pst I 

909 Forward CGC GGATCCCATATG -TGCGCGTGGGAAACTTAT BamHI- 

Ndel 

Reverse CCCG CTCGAG -TCGGTTTTGAAACTTTGGTTTT Xhol 

910 Forward AA AGAATTC -GCATTTGCCGGCGACTCTGCCGAGCG EcoRI 
Reverse AAA CTGCAG -TCAGCGATCGAGCTGCTCTTT Pst I 

911 Forward AAA GAATTC -GCTTTCCGCGTGGCCGGCGGTGC EcoRI 
Reverse AAAAAA CTGCAG -GTCGACTTATTCGGCGGCTTTTTCCGC Pst I 

912 Forward AAAAA AGAATTC - EcoRI 

CAAATCCGTCAAAACGCCACTCAAGTATTGAG 
Reverse AAAAA ACTGCAG -TTACAGTCCGTCCACGCCTTTCGC Pst I 

913 Forward CGC GGATCCCATATG -GAAACCCGCCCCGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGGTTGTGTTCCAGGTTG Xhol 

915 Forward CGC GGATCCCATATG -TGCCGGCAGGCGGAA BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTGAAAATATAGGTATCAGG Xhol 

914 Forward AA AGAATTC -GACAGAATCGGCGATTTGGAAGCACG Eco RI 
Reverse AAA CTGCAG -CTATATGCGCGGCAGGACGCTCAACGG Pst I 

916 Forward CGC GGATCCCATATG -GCAATGATGGCGGCTG BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTGGCGGCATCTTTCAT Xhol 
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917 Forward AAAAAA GAATTC -CCTGCCGAAAAACCGGr A cranr 
Reverse AAAAAACTGCAG-TTATTTCCCCGCCTTCACATCCTG 

919 Forward CG CGGATCCCATATG -TGCCAAAGCAAGAGCATC 

Reverse CCCGCTCGAG-CGGGCGGTATTCGGG 

920 Forward CGCGGArcCCATATG-CACCGCGTCTGGGTC 



921 



Reverse CCCGCTCGAG-ATGGTGCGAATGACCGA 
Forward AAAAAAGAATTC-TTGACGGAAATCCCCGTGAATCC 
Reverse AAAAAACTGCAG-TCATTTCAAGGGCTGCATCTTCAT 

922 Forward. CGCGGArcCGCTAGC-TGTACGGCGATGGAGGC 

2 

Reverse CCCGCTCGAG-CAATCCCGGGCCGCC 

923 Forward CGCGGATCCCATATG-TGTTACGCAATATTGTCCC 

Reverse CCCGCTCGAG-GGACAAGGCGACGAAG 

925 Forward CGC GGATCCC ATATG - AAAC AAATOfTTTT A arm 

Reverse CCCGCTCGAG-GCCGTTGCATTTGATTTC 

926 Forward CGC GGATCCCATATG -TGCGCGC A ATT A CCTC 



927 



Reverse CCCGCTCGAG-TCTCGTGCGCGCCG 
Forward CGCGGATCCCA1ATG-TGCAGCCCCGCAGC 



Reverse 
929a Forward 

Reverse 
93 1 Forward 

Reverse 

935 Forward 

Reverse 

936 Forward 



CCCGCTCGAG-GTTTTTTGCTGACGTAGT 

AAAAAAGAATTC-CGCGGTTTGCTCAAAACAGGGCTGGG 

AAAAAATCTAGA-TTAAGAAAGACGGAAACTACTGCC 

AAAAAAGAATTC-GCAACCCATGTTTTGATGGAAAC 

AAAAAACTGCAG-TTACTGCCCGACAACAACGCGACG 
AAAAAA GAATTC - 

GCGGATGCGCCCGCGATTTTGGATGACAAGGC 

AAAAAACTGCAG-TCAAAACCGCCAATCCGCCGACAC 

CGCGGATCCCATATG-GCCGCCGTCGGCGC 



937 



Reverse CCCGCTCGAG-GCGTTGGACGTAGTTTTG 
Forward AAAAAAGAATTC-CCGGTTTACATTCAAACCGGCGCAAC 
Reverse AAAAAACTGCAG-TTAAAATGTATGCTGTACGCCAAA 
939a Forward AAAAAAGAATTC-GGTTCGGCAGCTGTGATGAAACC 

Reverse AAAAAACTCCAG-TTAACGCAAACCTTGGATAAAGTTGGC 
Forward CGC GGATCCCATATG -GCCAACA A ACCGGC A a a 



950 



Reverse CCCGCTCGAG-TTTAGAACCGCATTTGCC 
953 Forward CGC GGATCCCATATG -GCCACCTACA AAGTGGAC 

Reverse CCCGCTCGAG-TTGTTTGGCTGCCTCGAT 
957 Forward CGCGGATCCCATATG-TTTTGGCTGGGAACGGG 



EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Nhel 

Xhol 

BamHI- 

Nhel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

Xbal 

EcoRI 

PstI 

Eco RI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 
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Reverse CCCGCTCGAG-GTGAGAAAGGTCGCGC 

958 Forward CGCGGATCCCATATG-GCCGATGCCGTTGCG 

Reverse GCCCAAGCTT-GGGTCGTTTGTTGCGTC 

959 Forward CGCGGATCCCATATG-CACCACGACGGACACG 

Reverse CCCGCTCGAG-GTCGTCGCGGCGGGA 
961 Forward CGCGGATCCCATATG-GCCACAAGCGACGACG 

Reverse CCCGCTCGAG-CCACTCGTAATTGACGC 

972 Forward AAAAA AGAATTC - 

TTGACTAACAGGGGGGGAGCGAAATTAAAAAC 
Reverse AAAAAATCTAGA-TTAAAAATAATCATAATCTACATTTTG 

973 Forward AAAAAAGAATTC-ATGGACGGCGCACAACCGAAAAC 
Reverse AAAAAACTGCAG-TTACTTCACGCGGGTCGCCATCAGCGT 

982 Forward CGC GGATCCCAT ATG -GCAGC A AAAGACGTAC 

Reverse CCCGCTCGAG-CATCATGCCGCCCATCC 

983 Forward CGCGGATCCCATATG-TTAGCTGTTGCAACAACAC 

Reverse CCCGCTCGAG-GAACCGGTAGCCTACG 

987 Forward CGCGGA1GCCATATG-CCCCCACTGGAAGAAC 

Reverse CCCGCTCGAG-TAATAAACCTTCTATGGGC 

988 Forward CGCGGAjTCCCATATG-TCTTTAAATTTACGGGAAAAAG 

Reverse GCCCAAGCTT-TGATTTGCCTTTCCGTTTT 

989 Forward CCGGAATTCTACATATG-GTCCACGCATCCGGCTA 

Reverse CCCGCTCGAG-TTTGAATTTGTAGGTGTATTGC 

990 Forward. CGCGGATCCGCTAGC-TTCAGAGCTCAGCTT 
2 

Reverse CCCGCTCGAG-AAACAGCCATTTGAGCGA 

992 Forward CGCGGATCCCATATG-GACGCGCCCGCCCG 

Reverse CCCGCTCGAG-CCAAATGCCCAACCATTC 

993 Forward CGCGGATCCCATATG-GCAATGCTGATTGAAATCA 

Reverse CCCGCTCGAG-GAACACATCGCGCCCG 

996 Forward CGCGGATCCCATATG-TGCGGCAGAAAATCCGC 

Reverse CCCGCTCGAG-TCTAAACCCCTGTTTTCTC 

997 Forward CCGGAAJTCTAGGTAGC-CGGCACGCCGACGTT 

Reverse CCCGCTCGAG-GACGGCATCGCTCAGG 



Ndel 

Xhol 

BamHI- 

Ndel 

Hindlll 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

Xbal 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Hindlll 

EcoRI- 

Ndel 

Xhol 

BamHI- 

Nhel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI- 

Nhel 

Xhol 
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Underlined sequences indicate restriction recognition sites. 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1>: 

gOOl.seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG GTGTCGGCGA ACGAGGTGTC 

51 CGGCAGGGCT TGCGCCCGGA TGGTGCTGGT CATCTGCCAG ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGC TCGGGTACGC ATACTTTACC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTGCG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCGGTTT GTAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCTGAAG CGATGTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCGGCTTCAT CGGGCAGGTG GGACAATACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 2; ORF 001. ng>: 

gOOl .pep 

1 MLPQGKAARR VSANEVSGRA CA RMVLVICO TLPKRDTLNG SGTHTLPVWA 

51 ILPRSLRSKS TIITFSARFF GSVCNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEAMLRKSS GEKHSVHADC PASSGRWDNT A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3>: 

mOOl . seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGcAssCTT ss . GCTTGGA yGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GATCGTTACG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCCTCCGCAT CGGGCAGGTG GGACAAGACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 4; ORF 00 1>: 

mOOl .pep 

1 MLPQGKAARR MSANEVCGXL X AWXVLVICO TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TIITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PSASGRWDKT A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 5>: 

aOOl.seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGCAAGGCT TGGGCTTGGA TGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTACG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCTTGTGCAT CGGGCAGGTG GGACAAAACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 6; ORF 001.a>: 

aOOl.pep 

1 MLPQGKAARR MSANEVCGKA W AWMVLVICO TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TIITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PCASGRWDKT A* 

mOOl/aOOl 96.2% identity over a 131 aa overlap 

10 20 30 40 50 60 

mOOl . pep MLPQGKAARRMSANEVCGXLXAWXVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 

M M I I I I M I I I M M I II | | | | j | | | I I I I I I I I t I I I I I I I I | | M I 

aOOl . pep MLPQGKAARRMSANEVCGKAWAWMVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 
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10 20 30 40 50 60 

70 80 90 100 HO 120 

mOOl.pep TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 
MINIMI MMIMII ill I I I || I [| || | MM mi M ) in [, , | (||| 

aOOl . pep TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 

70 80 90 100 110 120 

130 

mOOl . pep PSASGRWDKTAX 
I I I I I I I I I II 
aOOl.pep PCASGRWDKTAX 

130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 001 shows 89.3% identity over a 131 aa overlap with a predicted ORF (ORF OOl.ng) 
from N. gonorrhoeae: 

mOOl/gOOl 



10 20 30 40 50 60 

mOOl . pep MLPQGKAARRMSANEVCGXLXAWXVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 
IIIIIIMII:IMMI I M M M M I I I I I I I M I I MM M I I M M M II I 
gOOl MLPQGKAARRVSANEVSGRACARMVLVICQTLPKRDTLNGSGTHTLPVWAILPRSLRSKS 

10 20 30 40 50 60 

70 80 90 100 110 120 

mOOl . pep TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 
M I I I I I I I I I I : I I I I I I M I I I I I I I I I I I I I I I I I I I I I I : | | M I I I M I I I I I I 
gOOl TIITFSARFFGSVCNSAARRSSCPSPKIGAVPFIGSVLMVPSEAMLRKSSGEKHSVHADC 

70 80 90 100 110 120 

130 

mOOl . pep PSASGRWDKTAX 
I : : I I I I I : I I I 
gOOl PAS SGRWDNTAX 

130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 7>: 

g003.seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCGTATTC GGTCGCGCTG TTTTGGGTCA 

51 CTTGGTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGTG CGGCTTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CGGCTTTGCG GATGTCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAGG TAGTCCTGAT GGTATTCCTC GGCGTCGTAG 

251 AAGTTTTTCA GCGGTTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGG CGGCGATGAC GGCTTTTTCG GCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTGC CGGTGTCGTT ACCCTGTTTG 

4 01 TTGAGGCTGG TCGGATCAAC GACGCGGAAA TAATATTGCA GGATGTCGTC 

4 51 CAGgCTGagt TTGTCGGCAT CGTaggtcac tTTGACGGTC TCGGCATGAC 

501 CCGTATGGCG GTaggacact tctTCgtanc TcGGGtTTTC CGTGttGCCG 

551 TTGGCgttac cGGATACCGC gtcaACCACG CCGTcgatgc gttggaAATa 

601 ggCTTCCAAg ccccaaaagc agccgccggc gaagtaaatg gtgcccgtgt 

651 tcatgattGC TGa 

This corresponds to the amino acid sequence <SEQ ID 8; ORF 003. ng>: 

g003.pep 

1 MVVFVAEGVF GRAVLGHLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGFG 

51 FARQRFVGFA DVDV AVAVGV FNQVVLMVFL GVVEVFQRFV FNNEGQLVFL 

101 LLAFEGGGDD GFFGGVG VVH AAAVLRAGVV TLFV EAGRIN DAEIILQDVV 
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151 QAEFVGIVGH FDGLGMTRMA VGHFFVRVFR VAVGVTGYRV NHAVDALEIG 
201 FQAPKAAAGE VNGARVHDC 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 9>: 

m003.seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCATATTC GGTCGCGCTG TTTTGGGTAA 

51 CTTGsTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGGG CGGTCTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CAGCkTTGCG GATGTCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAAG TAGTCCTGAT GGTATTCCTC GGCATCGTAG 

251 AAGTTTTtCA GCGGCTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGk CGGCGATGAC GGCTTTTTCG kCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTAC CGGTGTCGTT GCCCTGTTTG 

4 01 TTGAGGCTGG TCGGATCAAC GACGCGGAAG AAATATTGCA GGATGTCGTC 

4 51 TAGGCTGAGT TTGTCGGCAT CGTAGGTCAC TTTGACGGTT TCGGCGTGGC 

501 CCGTATGGCG GTAGGACACG TCTTCATAGC TCGGATTTTT CGTGTTGCCG 

551 TTGGCGTAGC CGGATACCGC GTCAACCACG CCGTCGATGC GTTGGAAATA 

601 GGCTTCCAAG CCCCAGAAGC AGCg . CCGGC GAGGTAAATG GTGCGCGTGT 

651 TCATGATTTT TGA 



This corresponds to the amino acid sequence <SEQ ID 10; ORF 003>: 

m003.pep Length: 221 

1 MVVFVAEGIF GRAVLGNLXL LFGQGAFEFG VTRFFIRCRV EAFALRGGLG 

51 FARQRFVSXA DVDV AVAVGV FNQVVLMVFL GIVEVFQRLV FNNEGQLVFL 

101 LLAFEGXGDD GFFXGVG VVH AAAVLRTGVV ALFV EAGRIN DAEEILQDVV 

151 *AEFVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALEI 

201 GFQAPEAAXG EVNGARVHDF * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1>: 

a003.seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCATATTC GGTCGCGCTG TTTTGGGTAA 

51 CTTGGTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGTG CGGTCTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CGGCTTTGCG GATATCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAAG TAGTCCTGAT GGTATTCCTC GGCATCGTAG 

251 AAGTTTTTCA GCGGCTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGG CGGCGATGAC GGCTTTTTCG GCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTAC CGGTGTCGTT GCCCTGTTTG 

4 01 TTGAGGCTGG TCGGATCAAC GACGCGGAAG AAATATTGCA GGATGTCGTC 

4 51 TAGGCTGAGT TTGTCGGCAT CGTAGGTCAC TTTGACGGTT TCGGCGTGGC 

501 CCGTATGGCG GTAGGACACG TCTTCATAGC TCGGATTTTT CGTGTTGCCG 

551 TTGGCGTAGC CGGATACCGC GTCAACCACG CCGTCGATGC GTTGGAAATA 

601 GGCTTCCAAG CCCCAGAAGC AGCCGCCGGC GAGGTAGATG GTGCGCGTGT 

651 TCATGATTTT TGA 

This con-esponds to the amino acid sequence <SEQ ID 12; ORE 003. a>: 

a003 .pep 

1 MVVFVAEGIF GRAVLGNLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGLG 

51 FARQRFVGFA DID VAVAVGV FNQVVLMVFL GIVEVFQRLV FNNEGQLVFL 

101 LLAFEGGGDD GFFGGVG VVH AAAVLRTGVV ALFV EAGRIN DAEEILQDVV 

151 *AEFVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALEI 

201 GFQAPEAAAG EVDGARVHDF * 

moo3/a003 95.9% identity over a 220 aa overlap 



10 20 30 40 50 60 

m003.pep MVVFVAEG I FGRAVLGNLXLLFGQGAFEFGVTRFF I RCRVEAFALRGGLG FARQRFVSXA 

N I I I I I I I I I I I I I I I I | | | M I I I I I I I I I I I I || I | i | | ( | | M || | | | | | | : , 
a003 MVVFV AEG I FGRAVLGNLVLLFGQGAFEFGVTRFFIRCRVE A FALRCGLG FARQRFVGFA 
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m003 . pep 
a003 

m003 .pep 
a003 

m003 . pep 
a003 



70 80 90 100 no 120 

WDVAVAVGVFNQVVLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGXGDDGFFXGVGVVH 
I : NillllllllllMI||||||M||||!||||||,||,| M1| |||)M M||| 

DIDVAVAVGVFNQVVLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGGGDDGFFGGVGWH 
70 80 90 100 



110 



120 



130 



140 



150 160 170 180 

AAAVLRTGVVALFVEAGRINDAEEILQDVVXAEFVGIVGHFDGFGVARMAVGHVFIARIF 
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AAAVLRTGVVALFVEAGRINDAEEILQDWXAEFVGIVGHFDGFGVARMAVGHVFIARIF 
130 140 150 160 



170 



180 



190 200 210 220 

RVAVGVAGYRVNHAVDALEIGFQAPEAAXGEVNGARVHDFX 
MMMMMIIMIIMIIIIIIIIil lli:l!IIM|| 
RVAVGVAG YRVNHAVDALE I GFQAPEAAAGE VDGARVH D FX 



190 



200 



210 



220 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 003 shows 88.6% identity over a 219 aa overlap with a predicted ORF (ORF 003 no) 
from N. gonorrhoeae: 

m003/g003 



m003 . pep 
g003 

m003 .pep 
g003 

m003 . pep 
g003 

m003 . pep 
g003 



10 2 0 30 40 50 60 

MVVFVAEGIFGRAVLGNLXLLFGQGAFEFGVTRFFIRCRVEAFALRGGLGFARQRFVSXA 
IIMIIII:|||||ll:j I I I I ! I I I [ I I I I I I I i I | ! | | | | | | ! | 
MVVFVAEGVFGRAVLGHLVLLFGQGAFE FGVTRFFIRCRVEAFALRCG FGFARQRFVGFA 
10 20 30 40 



50 



60 



70 



80 



90 100 HO 120 

DVDVAVAVGVFNQVVLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGXGDDGFFXGVGWH 
1(11111 I M I I I I I ( : | | | | | | : | | | | | | j | , | , | | , , , | I | | | | | | | | | , I 

DVDVAVAVGVFNQVVLMVFLGWEVFQRFVFNNEGQLVFLLLAFEGGGDDGFFGGVGVVH 
70 80 90 100 110 120 

130 140 150 160 170 180 

AAAVLRTGWALFVEAGRINDAEEILQDWXAEFVGIVGHFDGFGVARMAVGHVFIARIF 
I M, : i :l I ;: I I I I I I ; || | If MINI I I i I ! I I I M I I : I : : | | | | | | | 1-1 

AAAVLRAGWTLFVEAGRINDAEIILQDWQAEFVGIVGHFDGLGMTRMAVGHFFV-RVF 
130 140 150 160 170 180 

190 200 210 220 

RVAVGVAG YRVNHAVDALE I G FQAPE AAXGE VNGARVH DFX 
IIINIMIIIIIIIIIIIMiiihii lliiniiii 

RVAVGVTG YRVNHAVDALE I GFQAPKAAAGEVNGARVHDC 
190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 13>- 

g004.seq 

1 ATGgtagAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GCGCCCATGC CAACAagtga gccaAAtgtT CGGCGGCAGG GCCTacgatT 

101 TCCGCGCCGA TAAagcggcc gGTGgctTTT tcgGCataca ggcgcaTatg 

151 gCCTTTGTTT ACCAgcatca cgcggctgcg accttgaTTT TTGAACGATA 

201 CTTCGCCgaT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

251 TATTTCAAAC CGACAAAGCC GATTTGCgga ctggtaaACA CCACGCCAAT 

301 GGTgctgcgg cGCAAACCGC TGCCGATATt cgGgtagcgg ccccgcgtta 

351 ttgcccggca atcttacctt ggtcggcggc ttcatGCAGC AGGGGCagtt 

401 ggttggacgc gtcgcccgca ataAAGATAT GCGGAATgct ggtCTGCATg 

4 51 gtCAGCGGAT CGGCAACGGG tacgccgcgc gcgtctttgT CGATATTGAT 

501 GTTTTCCAAA CCGATATtgT CAACGTTCGG ACGGCgACCT ACGGCTGCCA 
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551 A CAT AT ATT C GGCAACAAAT ACGCCTTTTT CGCCATCCTG CTCCCAATGG 

601 ACTtctACAT TGCCGTCTGC GTCGAGTTTG ACCTCGGTTT TAG CAT CC AG 

651 ATGCAGTTTC AATtCtTCTC CGAACACGGC TTTCGCCTCG TCTGAAACAA 

701 CGGGGTCGGA AATGCCGCCG ATGATTCCGC CCAAACCGAA AATTTCAACT 

7 51 TTCACACCCA AACGGTGCAA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 14; ORF 004.ng>: 

g004.pep 

1 MVERHIQHLR NGHLHLMRPC QQVSQMFGGR AYDFRADKAA GGFFGIQAHM 

51 AFVYQHHAAA TLIFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAN 

101 GAAAQTAADI RVAAPRYCPA ILPWSAASCS RGSW LDASPA IKICGMLVCM 

151 VSGSATGTPR ASLSILMFSK PILSTFGRRP TAANIYSATN TPFSPSCSQW 

201 TSTLPSASSL TSVLASRCSF NSSPNTAFAS SETTGSEMPP MIPPKPKIST 

251 FTPKRCNA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 15>: 

m004.seq 

1 ATGGTAGAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GTGCCCAAGC CAACAGGTGC GCCAAATGTT CGGCGGCAGG GCCTACGATT 

101 TCCGCGCCGA TAAAGCGGCC GGTGGCTTTT TCGGCATACA GGCGCATATG 

151 GCCTTTGTTC ACCAGCATCA CGCGGCTGCG GCCTTGGTTT TTGAACGATA 

201 CTTCGCCGAT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

251 TATTTCAGAC CGACAAAGCC GATTTGCGGA CTGGTAAACA CCACGCCGAT 

301 GGTGCTGCGC CGCAAACCGC CGCCGATATT CGGGTAGCGG CCGCGTTATC 

351 GCCGGCAATC TTGCCTTGGT CGGCAGCTTC ATGCAGCAGA GGCAGTTGGT 

4 01 TGGACGCATC GCCTGCGATG AAGATATGCG GAATACTGGT CTGCATGGTC 

4 51 AGCGGGTCGG CAACAGGTAC GCCGCGCGCA TCTTTTTCGA TATTGATATT 

501 TTCCAAACCG ATATTGTCAA CGTTCGGACG GCGGCCCACG GCTGCCAGCA 

551 TATATTCGGC AACAAATACG CCTTTTTCGC CATCCTGCTC CCAATGGACT 

601 TCTACATTGC CGTCTGCATC GAGTTTGACC TCGGTTTTAG CATCCAGATG 

651 CAGTTTCAAT TCTTCGCCGA ACACGGCGTT CGCCTCGTCT GAAACGACGG 

701 GGTCGGAAAT GCCGCCGATG ATTCCGCCCA AACCGAAAAT TTCAACTTTC 

751 ACGCCCAAAC GGTGCAATGC CTGA 

This corresponds to the amino acid sequence <SEQ ID 16; ORF 004>: 

m004 .pep 

1 MVERHIQHLR NGHLHLMCPS QQVRQMFGGR AYDFRADKAA GGFFGIQAHM 

51 AFVHQHHAAA ALVFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAD 

101 GAAPQTAADI RVAAALSPAI LPWSAASCSR GSW LDASPAM KICGILVCMV 

151 SGSATGTPRA SFSILIFSKP ILSTFGRRPT AASIYSATNT PFSPSCSQWT 

201 STLPSASSLT SVLASRCSFN SSPNTAFASS ETTGSEMPPM IPPKPKISTF 

251 TPKRCNA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 17>: 

a004 . seq 

1 ATGGTAGAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GTGCCCAAGC CAACAGGTGC GCCAAATGTT CGGCGGCCGG ACCTACGATT 

101 TCTGCGCCGA TGAAGCGGCC GGTGGCTTTT TCGGCATACA GGCGCATATG 

151 GCCTTTGTTT ACCAGCATCA CGCGGCTGCG GCCTTGGTTT TTGAACGATA 

201 CTTCGCCGAT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

251 TATTTCAAAC CGACAAAGCC GATTTGCGGA CTGGTGAACA CTACGCCGAT 

301 GGTGCTGCGG CGCAAACCGC CGCCGATATT CGGGTAGCGG CCGCGTTATC 

351 GCCGGCAATC TTGCCTTGGT CGGCGGCTTC ATGCAGCAGG GGCAGTTGGT 

4 01 TGGACGCGTC GCCCGCAATA AAGATATGCG GAATACTGGT CTGCATAGTC 

4 51 AGCGGATCGG CAACGGGTAC GCCGCGCGCA TCTTTTTCGA TATTGATGTT 

501 TTCCAAACCG ATATTGTCAA CGTTCGGACG GCGGCCTACG GCTGCCAGCA 

551 TATATTCGGC AACAAATACG CCTTTTTCGC CATCCTGCTC CCAATGGACT 

601 TCTACATTGC CGTCTGCGTC GAGTTTGGCC TCGGTTTTAG CATCCAAATG 

651 CAGTTTCAAT TCTTCACCGA ACACGGCTTT CGCCTCGTCT GAAACGACGG 

701 GGTCGGAAAT GCCGCCGATG ATGCCACCCA AACCGAAAAT TTCAACTTTC 

751 ACGCCCAAAC GGTGCAATGC CTGA 
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This corresponds to the amino acid sequence <SEQ ID 18; ORF 004. a>: 

a004 .pep 

MVERHIQHLR NGHLHLMCPS QQVRQMFGGR TYDFCADEAA GGFFGIQAHM 
AFVYQHHAAA ALVFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGEHYAD 
GAAAQTAADI RVAAALSPAI LPWSAASCSR GSWL DASPAI KICGILVCIV 
SGSATGTPRA SFSILMFSKP ILSTFGRRPT AASIYSATNT PFSPSCSQWT 
STLPSASSLA SVLASKCSFN SSPNTAFASS ETTGSEMPPM MPPKPKISTF 
TPKRCNA* 



1 
51 
101 
151 
201 
251 



moo4/aoo4 94.9% identity over a 257 aa overlap 



m004 .pep 
a004 

m004 . pep 
a004 

m004 .pep 
a004 

m004 .pep 
a004 

m004 . pep 
a004 



10 20 30 40 50 60 

MVERHIQHLRNGHLHLMGPSQQVRQMFGGRAYDFRADKAAGGFFGIQAHMAFVHQHHAAA 

Nil M I I I U Mil Ml M I I I I |:| | I I M II II I M I II II I M : I M I I I 

MVERHIQHLRNGHLHLMCPSQQVRQMFGGRTYDFCADEAAGGFFGIQAHMAFVYQHHAAA 
10 20 30 40 50 ' 60 

70 80 90 100 110 120 

ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHADGAAPQTAADIRVAAALSPAI 
I I I I M I I I I I I I I U I I I I I I I I I I I I | I : I : | I I | I | | | | | | | [ | { | | | | | j 

ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGEHYADGAAAQTAADIRVAAALSPAI 
70 80 90 100 110 120 

130 1^0 150 160 170 180 

LPWSAASCSRGSWLDASPAMKICGILVCMVSGSATGTPRASFSILIFSKPILSTFGRRPT 
' I t ( I I I I r M M I I I I M : I I I I i | | f : M I I I I I M I I J M I I : I f| M M I I I I I I I 
LPWSAASCSRGSWLDASPAIKICGILVCIVSGSATGTPRASFSILMFSKPILSTFGRRPT 

130 140 150 160 170 180 

190 200 210 220 230 240 

AASIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPPM 
MUMMMMMiillillliiMllirlMICMMMMMMIIMIMIMM 

AASIYSATNTPFSPSCSQWTSTLPSASSLASVLASKCSFNSSPNTAFASSETTGSEMPPM 
190 200 210 220 230 240 

250 

IPPKPKISTFTPKRCNAX 
: M I f I I I I 1 I I M I I M 
MPPKPKISTFT PKRCN AX 
250 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 004 shows 93.4% identity over a 258 aa overlap with a predicted ORF (ORF 004 .ng) 
from K gonorrhoeae: 

m004/g004 



10 20 30 40 50 60 

m004 . pep MVERHIQHLRNGHLHLMCPSQQVRQMFGGRAYDFRADKAAGGFFGIQAHMAFVHQHHAAA 

I M M I I I I I I I I Ml II M M M M M II M M II M M II M M M II M 

9004 MVERHIQHLRNGHLHLMRPCQQVSQMFGGRAYDFRADKAAGGFFGIQAHMAFVYQHHAAA 

1° 20 30 4 0 50 * 60 

7 0 80 90 100 110 119 

m004 . pep ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHADGAAPQTAADIRVAAA-LSPA 
: I : I I M I I J II M I II M II M I I I I I I I I I I I M I M : I M II I I II I M I || 
g° 04 TLIFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHANGAAAQTAADIRVAAPRYCPA 

70 80 90 100 110 120 

120 130 140 150 160 170 179 

m004 . pep I^PWSAASCSRGSWLDASPAMKICGILVCMVSGSATGTPRASFSILIFSKPILSTFGRRP 
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1 1 1 1 1 1 1 1 1 1 1 ' I I I :IMI:| JMMMI II I I IN: III: MMI I! MINI 

g004 ILPWSAASCSRGSWLDASPAIKICGMLVCMVSGSATGTPRASLSILMFSKPILSTFGRRP 

130 14 ° 150 160 170 180 

180 190 200 210 220 230 239 

m004.pep TAASIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPP 

HIHIIMIIIIIUM III II | IN Ml I I I I | | | | | | | j | in || Ml || | | Mill 
g004 TAANIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGS^ 

190 200 210 220 230 240 

240 250 
m004 .pep MI PPKPKI STFT PKRCNAX 

I I I I I I I ! I I I I I I I I I I 
g004 MI PPKPKI ST FTPKRCNA 

250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 19>- 

g005 . seq w 

1 ATGGGGATGG ACAATATTGA TATGTTCATG CCTGAACAAG AGGAAATCCA 

' 51 ATCAATGTGG AAAGAAATTT TACTGAATTA CGGTATTTTC CTGCTCGAAC 

101 TGCTTACCGT GTTCGGCGCA ATTGCGCTGA TTGTGTTGGC TATCGTACAG 

151 AGTAAGAAAC AGTCGGAAAG CGGCAGTGTC GTACTGACAG ATTTTTCGGA 

201 AAATTATAAA AAACAGCGGC AATCGTTTGA AACATTCTTT TTAAGCGAGG 

251 AAGAGACAAA ACATCAGGAA AAAAAAGAAA AGAAAAAGGA AAAGGCGGAA 

301 GCCAAAGCAG AGAAAAAGCG TTTGAAGGAG GGCGGGGAGA AATCTGCCGA 

351 AACGCAAAAA TCCCGCCTTT TTGTGTTGGA TTTTGACGGC GATTTGTATG 

4 01 CACACGCCGT AGAATCCTTG CGTCATGAGA TTACGGCGGT GCTTTTGATT 

4 51 GCCAAGCCTG AAGATGAGGT TCTGCTCAGA TTGGAAAGTC CGGGCGGCGT 

501 GGTTCACGGT TACGGTTTGG CGGCTTCGCA GCTTAGGCGT TTGCGCGAAC 

551 GCAATATTCC GCTGAccgtc gccgTCGATA AGGTCGCGGC AAGCGgcggc 

601 tatatgatgg cgtgtgtgGC GG AT AAAATT GTTTCCGCtc cgtttgcggt 

651 catcggttcg gtgggtgtgg tgGcggaagt gcCGAATATC CAccgCctGT 

701 TGAAAAAACA TGATATTGAT GTGGATGTGA TGACGGCGGG CGAATTTAAG 

751 CGCACGGTTA CTTTTATGGG TGAAAATACG GAAAAGGGCA AACAGAAATT 

801 CCGGCAGGAA CTGGAGGAAA CGCATCAGTT GTTCAAGCAG TTTGTCAGTG 

851 AAAACCGCCC CGGGTTGGAT ATTGAAAAAA TAGCGACGGG CGAGCATTGG 

901 TTCGGCCGGC AGGCGTTGGC GTTGAACTTG ATTGACGAGA TTTCGACCAG 

951 TGATGATTTG TTGTTGAAAG CGTTTGAAAA CAAACAGGtt aTCGAAGTGA 

1001 AATATCAGGA GAAGCGAAGC CTGATCCAGC GCATTGGTTT GCAGGCGGAA 

1051 GCTTCCGTTG AAAAGTTGTT TGCCAAACTT GTCAACCGGC GAGCGGATGT 

1101 GATGTAG 



This corresponds to the amino acid sequence <SEQ ID 20; ORF 005. ng>: 

g005.pep fc 

1 MGMDNIDMFM PEQEEIQSMW KEILLNYG IF LLELLTVFGA IALI VLATVO 

51 SKKQSESGSV VLTDFSENYK KQRQSFETFF LSEEETKHQE KKEKKKEKAE 

101 AKAEKKRLKE GGEKSAETQK SRLFVLDFDG DLYAHAVESL RHEITAVLLI 

151 AKPEDEVLLR LESPGGVVHG YGLAASQLRR LRERNIPLTV AVDKVAASGG 

201 YMMACVAD KI VSAPFAVIGS VGVVAE VPNT HRLLKKHDID VDVMTAGEFK 

251 RTVTFMGENT EKGKQKFRQE LEETHQLFKQ FVSENRPGLD IEKIATGEHW 

301 FGRQALALNL IDEISTSDDL LLKAFENKQV IEVKYQEKRS LIQRIGLQAE 

351 ASVEKLFAKL VNRRADVM* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 21>: 

m005.seq ' 

1 ATGGACAATA TTGACATGTT CATGCCTGAA CAAGAGGAAA TCCAATCAAT 

51 GTGGAAAGAA ATTTTACTGA ATTACGGTAT TTTCCTGCTC GAACTGCTTA 

101 CCGTGTTCGG CGCAATTGCG CTGATTGTGT TGGCTATCGT ACAGAGTAAG 

151 AAACAGTCGG AwAGCGGCAG TGTCGTACTG ACGGATTTTT CGGAAAATTA 

201 TAAAAAACAG CGGCAATCGT TTGAAGCATT CTTTTTAAGC GGGGAAGAGG 

251 CACAACATCA GGAAAAAGAG GAAAAGAAAA AGGAAAAGGC GGAAGCCAAA 
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301 GCAGAGAAAA A. CGTTTGAA GGAGGGTGGG GAGAAATCTG CCGAAACGCA 

351 nAAATCACGC CTTTTTGTGT TGGANNNNNN NNNNNNNNNN NNNNNNNNNN 

4 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNGCGAGCGG CGGTTATATG 

601 ATGGCGTGTG TGGCGGATAA AATTGCTTCC GCTCCGTTTG CGATTGTCGG 

651 TTCGGTGGGT GTGGTGGCGG AAGTACCGAA TATCCACCGC CTGTTGAAAA 

7 01 AACATGATAT TGATGTGGAT GTGATGACGG CGGGCGAATT TAAGCGCACG 

7 51 GTTACTTTTA TGGGTGAAAA TACGGAAAAG GGCAAACAGA AATTCCGACA 

801 GGAACTGGAG GAAACGCATC AGTTGTTCAA GCAGTTTGTC AGCGAGAACC 

851 GCCCTCAATT GGATATTGAG GAAGTGGCAA CGGGCGAGCA TTGGTTCGGT 

901 CGGCAGGCGT TGGCGTTGAA CTTGATTGAC GAGATTTCGA CCAGTGATGA 

951 TTTGTTGTTG AAAGCGTTTG AAAACAAACA GGTTATCGAA GTGAAATATC 

1001 AGGAGAAGCA AAGCCTGATC CAGCGCATTG GTTTGCAGGC GGAAGCTTCT 

1051 GTTGAAAAGT TGTTTGCCAA ACTTGTCAAC CGGCGGGCGG ATGTGATGT A 



This corresponds to the amino acid sequence <SEQ ID 22; ORF 005>: 

m005 .pep 

1 MDNIDMFMPE QEEIQSMWKE ILLNYG IFLL ELLTVFGAIA LIV LAIVQSK 

51 KQSXSGSVVL TDFSENYKKQ RQSFEAFFLS GEEAQHQEKE EKKKEKAEAK 

101 AEKXRLKEGG EKSAETXKSR LFVLXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXASGGYM 

201 MACVADK IAS APFAIVGSVG VVAEV PNIHR LLKKHDIDVD VMTAGEFKRT 

251 VTFMGENTEK GKQKFRQELE ETHQLFKQFV SENRPQLDIE EVATGEHWFG 

301 RQALALNLID EISTSDDLLL KAFENKQVIE VKYQEKQSLI QRIGLQAEAS 

351 VEKLFAKLVN RRADVM* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23>: 

a005.seq 

1 ATGGACAATA TTGACATGTT CATGCCTGAA CAAGAGGAAA TCCAATCAAT 

51 GTGGAAAGAA ATTTTACTGA ATTACGGTAT TTTCCTGCTC GAACTGCTTA 

101 CCGTGTTCGG CGCAATTGCG CTGATTGTGT TGGCTATCGT ACAGAGTAAG 

151 AAACAGTCGG AAAGCGGCAG TGTCGTACTG ACGGATTTTT CGGAAAATTA 

201 TAAAAAACAG CGGCAATCGT TTGAAGCATT CTTTTTAAGC GGGGAAGAGG 

251 CAAAACATCA GGAAAAAGAG GAAAAGAAAA AGGAAAAGGC GGAAGCCAAA 

301 GCAGAGAAAA AGCGTTTGAA GGAGGGTGGG GAGAAATCTT CCGAAACGCA 

351 AAAATCCCGC CTTTTTGTGT TGGATTTTGA CGGCGATTTG TATGCACACG 

4 01 CCGTAGAATC CTTGCGTCAT GAGATTACGG CGGTGCTTTT GATTGCCAAG 

4 51 CCTGAAGATG AGGTTCTGCT TAGATTGGAA AGTCCGGGCG GCGTGGTTCA 

501 CGGTTACGGT TTGGCGGCTT CGCAGCTTAG GCGTTTGCGC GAACGCAATA 

551 TTCCGCTGAC CGTCGCCGTC GATAAGGTGG CGGCGAGCGG TGGTTATATG 

601 ATGGCGTGTG TGGCGGATAA AATTGTTTCC GCTCCGTTTG CGATTGTCGG 

651 TTCGGTGGGT GTTGTAGCGG AAGTACCGAA TATCCACCGC CTGTTGAAAA 

7 01 AACATGATAT TGATGTGGAT GTGATGACGG CGGGCGAATT TAAGCGCACG 

751 GTTACTTTTA TGGGTGAAAA TACGGAAAAG GGCAAACAGA AATTCCGACA 

801 GGAACTGGAG GAAACGCATC AGTTGTTCAA GCAGTTTGTC AGCGAGAACC 

851 GCCCTCAATT GGATATTGAG GAAGTGGCAA CGGGCGAGCA TTGGTTCGGT 

901 CGGCAGGCGT TGGCGTTGAA CTTGATTGAC GAGATTTCGA CCAGTGATGA 

951 TTTGTTGTTG AAAGCGTTTG AAAACAAACA GGTTATCGAA GTGAAATATC 

1001 AGGAGAAGCA AAGCCTGATC CAGCGCATTG GTTTGCAGGC GGAAGCTTCT 

1051 GTTGAAAAGT TGTTTGCCAA ACTTGTCAAC CGGCGGGCGG ATGTGATGTA 



This corresponds to the amino acid sequence <SEQ ID 24; ORF 005. a>: 

a005.pep 

1 MDNIDMFMPE QEEIQSMWKE ILLNYG IFLL ELLTVFGAIA LIV LAIVQSK 

51 KQSESGSVVL TDFSENYKKQ RQSFEAFFLS GEEAKHQEKE EKKKEKAEAK 

101 AEKKRLKEGG EKSSETQKSR LFVLDFDGDL YAHAVESLRH EITAVLLIAK 

151 PEDEVLLRLE SPGGVVHGYG LAASQLRRLR ERNIPLTVAV DKVAASGGYM 

201 MACVADK IVS APFAIVGSVG VVAEV PNTHR LLKKHDIDVD VMTAGEFKRT 

251 VTFMGENTEK GKQKFRQELE ETHQLFKQFV SENRPQLDIE EVATGEHWFG 
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301 RQALALNLID EISTSDDLLL KAFENKQVIE VKYQEKQSLI ORIGLOAEAS 
351 VEKLFAKLVN RRADVM* 

raoos/aoos 79.2% identity over a 366 aa overlap 

10 20 30 



±u Z{J 30 40 50 60 

mOOS . pep MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSXSGSWL 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 r 1 1 1 1 j i j i j 1 1 1 1 r j 1 1 mini 

MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSESGSVVL 
10 20 30 40 50 . 60 



a005 



a005 



a005 



a005 



30 
90 



70 80 90 100 110 120 

m005 . pep TDFSENYKKQRQSFEAFFLSGEEAQHQEKEEKKKEKAEAKAEKXRLKEGGEKSAETXKSR 

"iiiiiiiiiiiiiiiiiiMihiiiinmmmm iiiiimii:ii m 

TDFSENYKKQRQSFEAFFLSGEEAKHQEKEEKKKEKAEAKAEKKRLKEGGEKSSETQKSR 
70 80 90 inn 11Q 120 . 



100 
160 



130 140 150 160 170 inn 

m005 . pep lfvlxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

a005 LFVLDFDGDLYAHAVESLRHEITAVLLIAKPEDEVLLRLESPGGVVHGYGLAASQLRRLR 

130 140 15 ° 160 170 180 

190 200 2 10 220 230 240 

m005 . pep XXXXXXXXXXXXXXASGGYMMACVADKIASAPFAIVGSVGVVAEVPNIHRLLKKHDIDVD 
: 1 " I I I I I I I I I I I : I I I I | | | | | | M I I I I I I I | | | | | | | I I I I I 

ERNIPLTVAVDKVAASGGYMMACVADKIVSAPFAIVGSVGVVAEVPNIHRLLKKHDIDVD 
190 2 »0 210 220 230 240 

250 260 270 280 290 300 

m005 . pep VMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHWFG 
1111 111111 I Ml II I II Nil | IIIIM II III MM III I II MM III I I II MM 
a005 VMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQEVSENRPQLDIEEVATGEHWFG 

250 260 270 280 290 300 

310 320 330 340 350 360 

m005.pep RQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQAEASVEKLFAKLVN 

11 HHIIIIINIIIIIIIIIIIIIINIIIIIIIIIIIIIIIIIIIII 

RQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQAEASVEKLFAKLVN 
310 320 330 340 350 360 



m005.pep 
a005 



RRADVMX 

I II II I I 
RRADVMX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 005 shows 77.0% identity over a 366 aa overlap with a predicted ORF (ORF 005.ng) 
from N. gonorrhoeae: 

m005/g005 



m005 .pep 
g005 

m005 .pep 
g005 



10 20 30 40 50 

MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSXSGSV 

I I I I I I I II I I I I I I III I I I II I I I I I I I I II I I II I I I I I I M I II I I || 

MGMDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSESGSV 
10 20 30 40 50 60 

60 7 0 80 90 100 110 

VLTDFSENYKKQRQSFEAFFLSGEEAQHQEKEEKKKEKAEAKAEKKRLKEGGEKSAETXK 
I I I I I I I I I I I I I I I I I : I II I M : : I I I I : II I I I | II M II II I I I I I II | 

VLTDFSENYKKQRQSFETFFLSEEETKHQEKKEKKKEKAEAKAEKKRLKEGGEKSAETQK 
70 80 90 100 110 120 
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120 130 140 150 160 170 

m005 . pep SRLFVLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
I I I I I I : 

g005 SRLFVLDFDGDLYAHAVESLRHEITAVLLIAKPE DEVLLRLESPGGVVHGYGLAASQLRR 

130 140 150 160 170 180 

180 190 200 210 220 230 

m005 . pep XXXXXXXXXXXXXXXXASGGYMMACVADKIASAPFAIVGSVGVVAEVPNIHRLLKKHDID 

MMMM l:MIII::IIIIIIMIIIIII|||||||| 

9005 LRERNIPLTVAVDKVAASGGYMMACVADKIVSAPFAVIGSVGVVAEVPNIHRLLKKHDID 

I 90 200 210 220 230 240 

240 250 260 270 280 290 

mOOS.pep VDVMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHW 
I N M ( I I I I I i I I I I I I I i M f I I I I I I I I l l 1 m I i I i I I I I I I I ||tl::|||||| 
9005 VDVMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSENRPGLDIEKIATGEHW 

250 260 270 280 290 300 

300 310 320 330 340 350 

m005.pep FGRQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQAEASVEKLFAKL 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I : 1 1 1 1 1 j ,,,,,,,,,, 1 ,,, , 

g005 FGRQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKRSLIQRIGLQAEASVEKLFAKL 

310 3 20 330 340 350 360 

360 

mOOS.pep VNRRADVMX 
I I I I I I I I 1 
g005 VNRRADVMX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 25>: 

g006.seq 

1 ATGCTGCTGG TGCTggaatt ttggttCGGc gtGtCGGCGG TGGGCatact 

51 tgCGTTGTTT TTATGGCttt TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC gcgACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGGCAGCT GTACCGCCAT TACGGACTGG TTTCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCG 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACGGCA GCGCGGGGCA TATTTATTCG GTCGGCACTT ATCTGTGGAT 

351 GTTTGCCATG AGTTTGGACG ATGTGCCGCG ATTGGTCGAA CAATATTCCA 

4 01 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGGAACG GAACATCAAA 

4 51 GCCGGAACTT GA 

This corresponds to the amino acid sequence <SEQ ID 26; ORF 006.ng>: 

g006.pep 

1 MLLVLEFWFG VSAVGILALF LWLLPRFAAI SENLYFRLNN SLERDNHFIR 

51 KGDERQLYRH YGLVSRLRVL I SNREAFGYL CVGAAMGILF GFAFVM MTLK 

101 GYGSAGHIYS VGTYLWMFAM SLDDVPRLVE QYSNLKDIGQ RIEWSERNIK 

151 AGT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 27>: 

m006.seq 

1 ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG ' TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACC GGCGGCAGCT GTACCGCCAT TACGGACTGC TTGCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 

2 51 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 
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351 GTTTGCCATG AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 
4 01 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGGAACG GAACATCAAA 
4 51 GCCGGAACTTGA 



This corresponds to the amino acid sequence <SEQ ID 28; ORF 006>: 

m00 6 .pep 

1 MLLVLEFWVG VSAVGILALF LWLLPRFAAI SENLYFRLNN SLERDNHFIR 

51 KGDRRQLYRH YGLLARLRVL ISNREAFGYL CVGTAMGILF GFAFVM MTLK 

101 GYSSAGHVYS VGTYLWMFAM SLDDVPRLVE QYSNLKDIGQ RIEWSERNIK 

151 AGT* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 29>: 

a00 6.seq 

1 ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAAGAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGGCAGCT GGACCGCCAT TACGGACTGC TTGCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 

351 GTTTGCCATA AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 

4 01 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGAAACG GAACATCAAA 

4 51 GCCGGAACTT GA 



This corresponds to the amino acid sequence <SEQ ID 30; ORF 006.a>: 

a006.pep 

1 MLLVLEFWVG VSAVGILALF LWLLPRFAAI SENLYFRLKN SLERDNHFIR 

51 KG DERQLDRH YGLLARLRVL ISNREAFGY L CVGTAMGILF GFAFVM MTLK 

101 GYSSAGHVYS VGTYLWMFAI SLDDVPRLVE QYSNLKDIGQ RIEWSKRNIK 

151 AGT* 



m006/a006 96.7% identity over a 153 aa overlap 

10 20 30 40 50 60 

m006.pep MLLVLEFWVGVSAVGILALFLWLLPRFAAI SENLYFRLNN SLERDNHFIRKGDRRQLYRH 

M I I I i I I I T I I I I I I I I I I M I I ! I I I I I | | | | | | | | : | [ | M I i I 1 I I 1 I I : I I I II 
a006 MLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLKNSLERDNHFIRKGDERQLDRH 

10 20 30 40 50 60 

70 80 90 100 110 120 

m006.pep YGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSAGHVYS VGTYLWMFAM 

M I I I I I I I I M I I I I I I I I II II I I I I I I I I I | | | | | | | | | | || | | | | | | | | |: 

a006 YGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSAGHVYS VGTYLWMFAI 

7 0 80 90 100 110 120 

130 140 150 

m006.pep SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
I I I I I I I I I I I I I I I I I I I I I I I I I : M I I I I I I 
a006 SLDDVPRLVEQYSNLKDIGQRIEWSKRNIKAGTX 

130 140 150 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 006 shows 95.4% identity over a 153 aa overlap with a predicted ORF (ORF 006.ng) 
from N. gonorrhoeae: 

m006/g006 

10 20 30 40 50 60 

m006 . pep MLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERDNHFIRKGDRRQLYRH 
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g006 



IIIIIMI lllll I Mill MMIIIMMMMM I MIIIIMM I MIIHMIII 

MLLVLEFWFGVSAVGI LALFLWLLPRFAAI SENLYFRLNNSLERDNHFI RKGDERQLYRH 
10 20 30 40 50 60 



70 80 90 100 110 120 

m006 pep YGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSAGHVYSVGTYLWMFAM 

|||::|||||||||||!lllll|:|||!lllllllllllllhlll|:|IIIIIIIIIM 
g00 6 YGLVSRLRVLI SNREAFGYLCVGAAMGI LFGFAFVMMTLKGYGS AGHI YS VGTYLWMFAM 

70 80 90 100 110 120 

130 140 150 

tn006 . pep SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 

I II I M 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

g 1 1 8 SLDDVPRLVEQYSNLKD I GQRI EWSERNI KAGT 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 31>: 

g006-l.seq 

1 ATGTGGAAAA TGTTGAAACA CATAGCCAAA ACCCACCGCA AG CG ATT GAT 

51 TGGCACATTT TCCCCGGTCG GACTGGAAAA CCTTTTGATG CTGGGGTATC 

101 CGGTGTTTGG CGGCTGGGCG ATTAATGCCG TGATTGCGGG GAGGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTATTT TTGATGTGGC TGGTCGGTGC 

201 GGCACGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

251 TCGCCGTGCC GGTTGTGTTG GAACAACGGC AGCGGCAAGT CCCGCATTCA 

301 GCGGTAACTG CACGGGTTGC CCTGTCGCGT GAATTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGTATCCATA TTCGGCGCGT 

401 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

4 51 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

501 AAACCTGTAT TTCCGCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGAAAAGG CGACGAGCGG CAGCTGTACC GCCATTACGG ACTGGTTTCG 

601 CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGCT ATCTCTGCGT 

651 CGGCGCGGCG ATGGGTATTT TGTTCGGCTT TGCTTTTGTG ATGATGACGC 

701 TCAAAGGCTA CGGCAGCGCG GGGCATATTT ATTCGGTCGG CACTTATCTG 

7 51 TGGATGTTTG CCATGAGTTT GGACGATGTG CCGCGATTGG TCGAACAATA 

801 TTCCAATTTG AAAGACATCG GACAACGGAT AGAGTGGTCG GAACGGAACA 

851 TCAAAGCCGG AACTTGA 

This corresponds to the amino acid sequence <SEQ ID 32; ORF 006-l.ng>: 

g006-l .pep 

1 MWKMLKHIAK THRKRLIGTF SPVGLENLLM LGYPVFGGWA INAVIAGRVW 

51 Q ALLYALVVF LMWLVGAA RR IADTRTFTRI YTEIAVPVVL EQRQRQVPHS 

101 AVTARVALSR EFVSFFEEHL PIA ATSVVSI FGACIMLLVL E FWVGVSAVG 

151 ILALFLWLL P RFAAISENLY FRLNNSLERD NHFIRKGDER QLYRHYGLVS 

201 RLRVLISNRE AFGY LCVGAA MGILFGFAFV M MTLKGYGSA GHIYSVGTYL 

251 WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERNIKAGT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33>: 

m006-l.seq 

1 ATGTGGAAAA TGTTGAAACA CATAGCCCAA ACCCACCGCA AGCGATTGAT 

51 TGGCACATTT TCCCTGGTCG GACTGGAAAA CCTTTTGATG CTGGTGTATC 

101 CGGTGTTTGG CGGCCGGGCG ATCAATGCCG TGATTGCGGG GGAGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTGCTT TTGATGTGGC TGGTCGGTGC 

201 GGTGCGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

2 51 TCGCCGTGCC GGTCGTGTTG GAACAGCGGC AGCGACAAGT CCCGCATTCG 

301 GCGGTAACTG CGCGGGTTGC CCTGTCGCGT GAGTTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGTATCCATA TTCGGCGCGT 

4 01 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

4 51 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

501 AAACCTGTAT TTCCGCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGAAAAGG CGACCGGCGG CAGCTGTACC GCCATTACGG ACTGCTTGCG 

601 CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGCT ATCTCTGCGT 
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651 CGGCACGGCG 

701 TCAAAGGCTA 

7 51 TGGATGTTTG 
801 TTCCAATTTG 

8 51 TCAAAGCCGG 



ATGGGTATTT TGTTCGGCTT TGCTTTTGTG ATGATGACGC 
CAGCAGCGCG GGGCATGTCT ATTCGGTCGG CACTTATCTG 
CCATGAGTTT GGACGACGTG CCGCGATTGG TCGAACAATA 
AAAGACATCG GACAACGGAT AGAGTGGTCG GAACGGAACA 
AACTTGA 



This corresponds to the amino acid sequence <SEQ ID 34; ORF 006-l>: 

m006-l .pep 

1 MWKMLKHIAQ THRKRLIGTF SLVGLENLLM LVYPVFGGRA INAVIAGEVW 
Q ALLYALVVL LMWLVGAV RR IADTRTFTRI YTEIAVPVVL EQRQRQVPHS 
AVTARVALSR EFVSFFEEHL PI AATSVVSI FGACIMLLVL E FWVGVSAVG 
ILALFLWLL P RFAAISENLY FRLNNSLERD NHFIRKGDRR QLYRHYGLLA 
RLRVLISNRE AFGY LCVGTA MGILFGFAFV M MTLKGYSSA GHVYSVGTYL 
WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERNIKAGT* 



51 
101 
151 
201 
251 



moo6-i/goo6-i 95.5% identity in 288 aa overlap 



10 20 30 40 50 60 

mO 06-1. pep MWKMLKHIAQTHRKRLIGTFSLVGLENLLMLVYPVFGGRAINAVIAGEVWQALLYALWL 
I I I I I I I I I = I I I I I I I I I I I I I I I I I I I I MINI I I I ! I I I I : I I I I | | | | | I | : 
g006-l MWKMLKHIAKTHRKRLIGTFSPVGLENLLMLGYPVFGGWAINAVIAGRVWQALLYALWF 

10 20 30 40 50 60 

7 0 80 90 100 110 120 

m006-l.pep LMWLVGAVRR I ADTRT ETRIYTEIAVPVVLEQRQRQVPHSAVTARVALSREFVSFFEEHL 
I I I i M I : M I I I I I I I M ! I M II I I II I I I I I I I I I I M I f I I M I I I M I I M I I I I 
g006-l LMWLVGAARRIADTRTFTRIYTEIAVPVVLEQRQRQVPHSAVTARVALSREFVSFFEEHL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m006-l.pep P I AATS VVS I FGAC IMLLVLE FWVGVSAVG I LALFLWLLPRFAAI SENLYFRLNNS LERD 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | t 
g006-l PIAATSVVS I FGACIMLLVLEFWVGVSAVGI LALFLWLLPRFAAI SENLYFRLNNSLERD 

130 140 150 160 170 180 

190 200 210 220 230 240 

m006-l.pep NHFIRKGDRRQLYRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 

II M II II: ! I II I Ml I:: I M ! I Ml I I I I t II M f:| II I | | | | | | | M | I II I: M 
g006-l NHFIRKGDERQLYRHYGLVSRLRVLISNREAFGYLCVGAAMGILFGFAFVMMTLKGYGSA 

190 200 210 220 230 240 

250 260 270 280 289 

m006-l.pep GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
I I : M I I I I I M II I I M I I I II I I M I I M I M I I I I I M I I I I I I I I 
g006-l GHIYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 

250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 35>: 

a006-l.seq (partial) 

1 . . AGCCAAAACC ACCGCAAGCG ATTGATTGGC ACATTTTTTC TGGTCGGACT 

51 GGAAAACCTT TTGATGCTGG TGTATCCGGT GTTTGGCGGC TGGGCGATTA 

101 ATGCCGTGAT TGCGGGGCAG GCGTGGCAGG CGTTGCTGTA CGCTTTGGTT 

151 GTGCTTTTGA TGTGGCTGGT CGGTGCGGCG CGGCGGATTG CCGATACGCG 

201 CACGTTTACG CGGATTTATA CCGAAATCGC CGTGCCGGTT GTGTTGGAAC 

251 AGCGGCAGCG GCAAGTCCCG CATTCGGCGG TAACTGCGCG GGTTGCCCTG 

301 TCGCGTGAGT TTGTCAGCTT TTTTGAAGAA CACCTGCCGA TTGCCGCGAC 

351 ATCCGTCGTA TCCATATTCG GCGCGTGCAT CATGCTGCTG GTGCTGGAAT 

4 01 TTTGGGTCGG CGTGTCGGCG GTGGGCATAC TTGCGTTGTT TTTATGGCTT 

4 51 TTGCCACGTT TTGCCGCCAT CAGCGAAAAC CTGTATTTCC GCCTGAAGAA 

501 CAGCTTGGAA CGCGACAACC ACTTTATCCG AAAAGGCGAC GAGCGGCAGC 

551 TGGACCGCCA TTACGGACTG CTTGCGCGCC TGCGTGTGCT GATTTCCAAC 

601 CGCGAAGCCT TCGGCTATCT CTGCGTCGGC ACGGCGATGG GTATTTTGTT 

651 CGGCTTTGCT TTTGTGATGA TGACGCTCAA AGGCTACAGC AGCGCGGGGC 
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701 ATGTCTATTC GGTCGGCACT TATCTGTGGA TGTTTGCCAT AAGTTTGGAC 
7 51 GACGTGCCGC GATTGGTCGA ACAATATTCC AATTTGAAAG ACATCGGACA 
801 ACGGATAGAG TGGTCGAAAC GGAACATCAA AGCCGGAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 36; ORF 006-1. a>: 



a006-l 



pep 
1 



51 
101 
151 
201 
251 

a006-l/m006-l 

a006-l . pep 
m006-l 



a006-l .pep 



m006-l 



a006-l.pep 



m006-l 



a006-l . pep 



m006-l 



a006-l.pep 



m006-l 



(partial) 

. SQNHRKRLIG TFFLVGLENL LMLVYPVFGG WAINAVIAGQ AW QALLYALV 
VLLMWLVGAA RRIADTRTFT RIYTEIAVPV VLEQRQRQVP HSAVTARVAL 

SREFVSFFEE HLPI AATSW SIFGACIMLL VL E FWVGVSA VGILALFLWL 

LPRFAAISEN LYFRLKNSLE RDNHFIRKGD ERQLDRHYGL LARLRVLISN 

REAFG YLCVG TAMGILFGFA FVM MTLKGYS SAGHVYSVGT YLWMFAISLD 

DVPRLVEQYS NLKDIGQRIE WSKRNIKAGT * 

95.7% identity in 280 aa overlap 

10 . 20 30 40 50 

SQNHRKRLIGTFFLVGLENLLMLVYPVFGGWAINAVIAGQAWQALLYALWL 

:I:IMMIIII MM! IIMIM I I I I I I I I :: I I I I I I I I I I I 

MWKMLKHIAQTHRKRLIGTFSLVGLENLLMLVYPVFGGRAINAVIAGEVWQALLYALWL 
10 20 30 40 50 60 

60 70 80 90 100 110 

LMWLVGAARRIADTRTFTRIYTEIAVPVVLEQRQRQVPHSAVTARVALSRE FVSFFEEHL 
' HMIhll Mil UNI IIIIMMIIMI I Ml I IIMMUIM || | | | Mum 
LMWLVGAVRRIADTRTFTRIYTEIAVPVVLEQRQRQVPHSAVTARVALSRE FVSFFEEHL 
70 80 90 100 110 120 

120 130 140 150 160 170 

PIAATSWSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLKNSLERD 
M M II I II M I I II I I I I I I M I M I i I M II I II I I M M I I I I M I I M M M II I I 

PIAATSVVSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERD 
130 140 150 160 170 180 

180 190 200 210 220 230 

NHFIRKGDERQLDRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 
MMMIhlM M M I I I I M II M I M I II M I I I I I I M M M II M M I M M M 
NHFIRKGDRRQLYRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 
190 200 210 220 230^ 240 

240 250 260 270 280 

GHVYSVGTYLWMFAISLDDVPRLVEQYSNLKDIGQRIEWSKRNIKAGTX 
I M I I I I I I M i M M I M M M I I M M I I I I M I I I M : I I M I I II 
GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 37>: 

g007.seq 

1 atgaACACAA CCCGACTGCC GACCGCCTTC ATCTTGTGCT GCCTCTGcgC 

51 CGCcGCTTCT GCCGccgaca acAGCatcat gaCaAAAGGG CAAAAAGTGT 

101 ACGAATCcAa ctGCATCGCC TGCCACGGCA AGAAAGGGGA AGGGCGCGGC 

151 ACTGCGtTTC CTccgctTTT CCggtcgGac tgtattatga acaAACCGCa 

201 cgTCCtgctg cacagcatgg tcaaaggcAt cgacgggaca ttcaaagtgg 

251 agcggcaaaa cctacgacgg atttatgCcc gcaaccgcca tcagcgATGC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT GA 

This corresponds to the amino acid sequence <SEQ ID 38; ORF 007.ng>: 

g007.pep 

1 MNTTRLPTAF ILCCLCAAAS A ADNSIMTKG QKVYESNCIA CHGKKGEGRG 

51 TAFPPLFRSD CIMNKPHVLL HSMVKGIDGT FKVERQNLRR IYARNRHQRC 

101 GHCRRRHLYH ERL* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 39>: 
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m007 . seq 

1 
51 
101 
151 
201 



ATGAACACAA 
CGCCGCTTCT 
ACGAATCCAA 
ACCATGTTTC 
GGTGCTGCTG 
251 AACGGCAAAA 
301 GGACATTGCC 



CCCGACTGCC 
GCCGCCGACA 
CTGCGTCGCC 
CGCCGCTCTA 
CACAGCATGG 
CCTACAACGG 
GCCGTCGCCA 
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GACCGCCCTC 
ACAGCATCAT 
TGCCACGGCA 
CCGCTCCGAC 
TCAAAGGCAT 
ATTCATGCCC 
CTTATATCAT 



GTCTTGGGCT 
GACAAAAGGG 
AAAAGGGCGA 
TTCATCATGA 
CAACGGTACA 
GCAACCGCCA 
GAACGCCTTT 



GCTTCTGCGC 
CAAAAAGTGT 
AGGCCGCGGA 
AAAAACCGCA 
ATCAAAGTC . 
TCAGCGATGC 
GA 



This corresponds to the amino acid sequence <SEQ ID 40; ORF 007>: 

m007 .pep 

1 MNTTRLPTAL VLGCFCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVXRQNLQR IHARNRHQRC 

101 GHCRRRHLYH ERL* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 41>: 

a007.seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA 

151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA 

251 AACGGCAAAA CCTACAACGG ATTCATGCCC GCCACTGCCA 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT 



GCCTCTGCGC 
CAAAAAGTGT 
AGGCCGCGGA 
AAAAACCGCA 
ATCAAAGTC . 
TCAGCGATGC 
GA 



This corresponds to the amino acid sequence <SEQ ID 42; ORF 007.a>: 

a007 .pep 

1 MNTTRLPTAL VLGCLCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVXRQNLQR IHARHCHQRC 

101 GHCRRRHLYH ERL* 

moo7/aoo7 97.3% identity over a 1 13 aa overlap 

10 20 30 40 50 60 

m007 . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I i | | | | | | | | | | | | | | | j | | | | | | | | | | 
a007 MNTTRLPTALVLGCLCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

10 20 30 40 50 60 

70 80 90 100 110 

m007.pep FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARNRHQRCGHCRRRHLYHERLX 
I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I : M I I I I I I I I I I I I | | I I 
a007 FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARHCHQRCGHCRRRHLYHERLX 

70 80 90 100 110 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 007 shows 86.7% identity over a 1 13 aa overlap with a predicted ORF (ORF 007.ng) 
from N. gonorrhoeae: 



m007/g007 

m007 .pep 
g007 



10 20 30 40 50 60 

MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
MINIMI::! I : i M I I II M I I I I I I f I I I I I I I : I I I I i I I II 1 I I lllhlil 
MNTTRLPTAFILCCLCAAASAADNSIMTKGQKVYESNCIACHGKKGEGRGTAFPPLFRSD 



10 



20 



30 



40 



50 



60 



m007 . pep 



70 80 90 100 110 

FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARNRHQRCGHCRRRHLYHERLX 
M:|I:|IMIMIII:||:|| I I I I : I I : I I I I I I I I I I I I I I M I I I I I 
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g007 CIMNKPHVLLHSMVKGIDGTFKVERQNLRRIYARNRHQRCGHCRRRHLYHERL 

70 80 90 100 110 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 43>: 

g007-l.seq (partial) 

1 ATGAACACAA CCCGACTGCC GACCGCCTTC ATCTTGTGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCATCGCC TGCCACGGCA AGAAAGGGGA AGGGCGCGGC 

151 ACTGCGTTTC CTCCGCTTTT CCGGTCGGAC T AT AT TAT G A ACAAACCGCA 

201 CGTCCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 

251 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 

301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

351 CGGAAGCGTT AC C GAAAAAG ACGTAAAACA GGCAAAAGGC AAAAAAAAC . 

This corresponds to the amino acid sequence <SEQ ID 44; ORF 007-Lng>: 

g007-l .pep (partial) 

1 MNTTRLPTAF ILCCLCAAAS A ADNSIMTKG QKVYESNCIA CHGKKGEGRG 
51 TAFPPLFRSD YIMNKPHVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKG KKN... 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 45>: 

m007-l .seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCTTCTGCGC 
51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 
101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 
151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 
201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 
251 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 
301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 
351 CGGAAGCGTT AC C GAAAAAG ACGTAAAACA GGCAAAAAGC AAAAAAAACT 
401 AA 

This corresponds to. the amino acid sequence <SEQ ID 46; ORF 007- 1> 

m007-l .pep 

1 MNTTRLPTAL VLGCFCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 
51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKS KKN* 

m007-i / g007-i 91 .7% identity in 133 aa overlap 

10 20 30 40 50 60 

m007-l .pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I I I I I 1 I I I I I : I I M I I i I I I I I I I I I I I I I I I t : I I I I I I I I I I I I I j I I : I I I 
g007-l MNTTRLPTAFILCCLCAAASAADNSIMTKGQKVYESNCIACHGKKGEGRGTAFPPLFRSD 

10 20 30 40 50 60 



70 80 90 100 110 120 

m007-l .pep FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 
: I I : I I : I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I t I I i I I I I I I I I I I I II I I i 
g007-l YIMNKPHVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 

70 80 90 100 110 120 



130 

m007-l . pep TEKDVKQAKS KKNX 

I I I I I I I I I : II I 
g0 0 7 - 1 TEKDVKQAKGKKN 

130 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 47>: 

a007-l . seq (partial) 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCCTCTGCGC 
51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 
101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 



r 
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151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT 

251 ACGGCAAAAC CTACAACGGA TTCATGCCCG 

301 GACATTGCCG CCGTCGCCAC TTATATCATG 

351 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA 



ttcatcatga 
caacggtaca 
ccactgccat 
aacgcctttg 
ggcaIaaaaac 



AAAAACCGCA 
ATCAAAGTCA 
CAGCGATGCG 
ACAACGGCGG 
AAAAAA. . 



This corresponds to the amino acid sequence <SEQ ID 48; ORF 007-1. a>: 

a007-l.pep (partial) 

1 MNTTRLPTAL VLGCLCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 
51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKN KK. . 

moo7-i/aoo7-i 98.5% identity in 132 aa overlap 

10 20 30 40 50 60 

m007~l . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I t I I I ! M I I i I I I : I I I I I I I I I I j I I | | | | | | | | | | | | | | I | | | | | | | | | | | | | | | | | 
aO 0 7 - 1 MNTTRLPTALVLGCLCAAASAADNS IMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

10 20 30 40 50 60 

™ 80 90 100 110 120 

m007-l.pep FIMKKPQVLLHSMVKGINGT IKVNGKTYNG FMPATAISDADIAAVATYIMNAFDNGGGSV 
I M I II II I II I I I I I I I I I I I I I I I I I I I I I | | | | | | | i | | M I I i I I I I I I I | I I | M 
a007-l FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 

70 80 90 100 110 120 

130 

m007-l .pep TEKDVKQAKSKKNX 

111111111:11 
a007-l TEKDVKQAKNKK 

130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 49>: 

g008.seq 

1 ATGAACAACA GACATTTTGC CGTCAtcgCC TTGGGCAGCA ACCTTGACAA 

51 CCCCGCACAA CAAATacgcg gcgcattaga cgcgctctcg tcccatcctg 

101 acatccggct tgaaCaggtt tcctcactgt aTatgaccgc acctgtcggt 

151 tacgAcaaTC agcccgATTT CATCaatgcc gTCTgcaccg TTTCCACCAC 

201 CtTGGACGGC ATTGcccTGC TTGCCgaACT CAAccgTATC GAAGCCGATT 

251 TCGGACGCGA aCGCAGTTTC CGCAATGCAC CGCGCACATT GGATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACCCCCGCC TTACCCTGCC 

351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TTGGCAGAAA 

4 01 TCCTCCCTGA TTTTATTTTG GGAAAATACG GAAAGGTTGT CGAATTGTCA 

4 51 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGACA GGTAA 

This corresponds to the amino acid sequence <SEQ ID 50; ORF 008.ng>: 

g008.pep 

1 MNNRHFAVIA LGSNLDNPAQ QIRGALDALS SHPDIRLEQV SSLYMTAPVG 

51 YDNQPDFINA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDFIL GKYGKVVELS 

151 KRLGNQGIRL LPDR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 51>: 

m008 .seq 

1 ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCTCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

101 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT TGTCAATGCC GTCTGCACCG TTTCCACCAC 

201 TCTGGACGGC ATTGCCyTGC TTGCCGAACT CAACCGTATC GAGGCTGATT 

251 TCGGACGCGA ACGCAGCTTC CGCAACGCGC CGCGCACATT GkATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACACsCGAC TcACCtTGCC 
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351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATCCGCCCT TTGGCAGAAA 
4 01 TCCTCCCTGA TTTTGTTTTA GGAAAACACG GAAAGGTTGC CGAATTGTCA 
4 51 AAACGGyTGG GCAATCAAGG TATCCGTCTT TTACCGGACA GGTAATT 



This corresponds to the amino acid sequence <SEQ ID 52; ORF 008>: 

m008 .pep 

1 MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SSLYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLXLD 

101 IIDFDGISSD DTRLTLPHPR AHERSFVIRP LAEILPDFVL GKHGKVAELS 

151 KRLGNQGIRL LPDR* 



The following partial DNA sequence was identified in N. meningitidis<SEQ ID 53>: 

a008. seq 

1 ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCCCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

101 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT CGTCAATGCC GTCTGCACCG TTTCCACCAC 

201 CTTGGACGGC ATTGCCCTGC TTGCCGAACT CAACCGTATC GAAGCCGATT 

251 TCGGACGCGA ACGCAGCTTC CGCAACGCGC CGCGCACATT GGATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACCCCCGAC TCACCCTGCC 

351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TTGGCAGAAA 

4 01 TCCTCCCTGA TTTTATTTTG GGAAAACACG GAAAGGTTGC CGAATTGTCA 

4 51 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGATA AGTAA 

This corresponds to the amino acid sequence <SEQ ID 54; ORF 008. a>: 

a008.pep 

1 MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SSLYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDFIL GKHGKVAELS 

151 KRLGNQGIRL LPDK* 



moo8/a008 97.6% identity over a 164 aa overlap 

10 20 30 40 50 60 

m008.pep MNNRHFAV I ALGSNLENPAQQVRAALDTLSSHPD I RLKQASSLYMTAPVG YDNQPDFVNA 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I | | | | | | 
a008 MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVG YDNQPDFVNA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m008.pep VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLXLDIIDFDGISSDDTRLTLPHPR 

II I I I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I ! I I I I I i | | | | 

a008 VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLDLDIIDFDGISSDDPRLTLPHPR 

70 80 90 100 110 120 

130 140 150 160 

m008.pep AHERSFVIRPLAEILPDFVLGKHGKVAELSKRLGNQGIRLLPDRX 
I I I I I I I M I II I I I I I I : I I I I | | | | | | | | | | | | | | | | | | | | : | 
a008 AHERSFVIRPLAEILPDFILGKHGKVAELSKRLGNQGIRLLPDKX 

130 140 150 160 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 008 shows 92.7% identity over a 164 aa overlap with a predicted ORF (ORF008.ng) 
from N. gonorrhoeae: 

m008/g008 



10 20 30 40 50 60 

m008 . pep MNNRHFAVI ALGSNLENPAQQVRAALDTLSSHPD I RLKQASSLYMTAPVG YDNQPDFVNA 

I I I I I I I I I I I N I I : I I I I I : | : | | | : | | | | | | | | | : | : | | | | | | | | | j I I : If 

900B MNNRHFAVIALGSNLDNPAQQIRGALDALSSHPDIRLEQVSSLYMTAPVGYDNQPDFINA 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m008 . pep VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLXLDIIDFDGISSDDTRLTLPHPR 
I I I I I I I i I M I I I I I I I ! I I I I I I I I I I ! I I I I I I I I I I I 1 I I | M I I I I I I I I I I I 

g008 VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLDLDIIDFDGISSDDPRLTLPHPR 

70 80 90 100 110 120 

130 140 150 160 

m008.pep AHERSFVIRPLAEILPDFVLGKHGKVAELSKRLGNQGIRLLPDRX 
I I M I I M I M I I I M I I: I i I: I ||:| M | | M II I I M I 1 II I 
g008 AHERSFVIRPLAEILPDFILGKYGKWELSKRLGNQGIRLLPDRX 

130 140 150 160 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 55>: 

g009 . seq 

1 ATGCCCCGCG CTGCCGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GCGCCGACGC AG AG AT AG CC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGAA ACCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC TCGTCGCCTT TTCGGATAAA GTGGTTGTcg cGTTCCAAGC 

201 TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AaaaGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 56; ORF 009.ng>: 

g009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARNQSVMAV 
51 QLPLVAFSDK VWAFQAVVQ AEIQVFA DGG KTWQKP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 57>: 

m009.seq 

1 ATGCCCCGCG CTGCTGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC CGGTCGCCTT TTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

2 01 TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AAAAGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 58; ORF 009>: 

m009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARKQSVMAV 
51 QLPPVAFSDK VWAFQAVVQ AEIQVFA DGG KTWQKP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 009 shows 97.7% identity over a 86 aa overlap with a predicted ORF (ORF 009.ng) 
from TV. gonorrhoeae: 

ro009/g009 

10 20 30 40 50 60 

m009.pep MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPPVAFSDK 
I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I i I I I M : | | I | | | | | | | | | | 1 | 
g009 MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARNQSVMAVQLPLVAFSDK 

10 20 30 40 50 60 

70 80 
m009 . pep VVVAFQAVVQAEIQVFADGGKTWQKPX 
I I I I I I I I I I I I II II I I I I I I | I | I I 
g009 VVVAFQAVVQAEIQVFADGGKTWQKPX 

70 80 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 59>: 

a009.seq 

1 ATGCCCCGCG CTGCTGTAGC CTTTGAGCGT CAT CATC AC A AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTC 

151 CAGCTGCCGC TCGTCGCCTT TTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

201 TGTTCTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AAAAGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 60; ORF 009. a>; 

a009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARKQSVMAV 
51 QLPLVAFSDK VWAFQAVLQ AEIQVFA DGG KTWQKP* 

m009/a009 97.7% identity over a 86 aa overlap 

10 20 30 40 50 60 

m009 . pep MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPPVAFSDK 
I I I t I I I I I I ! I I I I 1 I I I I I I I I I I I I I | | | | ( | | ! | | | | | | | | | | | | | | | | nun 
a009 MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPLVAFSDK 

10 20 30 40 50 60 

70 80 
VVVAFQAVVQAEIQVFADGGKTWQKPX 
I I I I I I I I : ! I I I I I I I I I I I I I I | 1 I 
VVVAFQAVLQAEIQVFADGGKTWQKPX 
70 80 



m009.pep 
a009 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 61>: 

g010 . seq 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGCGG 

51 TGCAGGTTTA CGTGCAGCCC TCCAATTATC CAAATCCGGT TTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGCT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAGGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGCTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGTCGC GCTGCGCCTG AAGCGGTGAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGCGTTGAA AGCGGCAAAA TTTATCAGCG 

351 TCCTTTCGGC GGACATACTG CCGAAC AT GG TAAACGTGCG GTAGAACGTG 

4 01 CATGTGCGGT TGCCGACCGT ACCGGTCATG CGATGTTGCA TACTTTGTAC 

451 CAACAAAACG TCCGTGCCAA TACACAATTC TTTGTGGAAT GGACGGCGCA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC GGGCGAAGTT TATATTTTCC ACGCCAAGGC CGTGATGT.TT 

601 GCTACCGGTG GCGGCGGTCG TATTTATGCT TCTTCTACCA ATGCTTATAT 

651 GAATACCGGT GACGGTTTGG GCATTTGCGC CCGTGCGGGC ATTCCGTTGG 

701 AAGATATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAacgc 

801 cgacggcgaA cgcTTTATGG AAcgctatgc GCcgACCGta aAagaCTTGG 

851 CTTCTCGCga cgtGGTTTCA CgcgcGatgG CGatggaAAt ctatgaaggt 

901 cgcggctgTG GtaaAAAcaA agaCCacgtC TTACTGAAAA TCGACcAtAt 

951 cggtGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCagtttgc cGGTATCGAT CCGATTAAAG ACCCGATTcc ggttgTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCcg aCCAATTATC ACGGTGAAGT 

1101 TGTTGTTCCG CAAGGCGACG AGTACGAAGT ACCTGTAAAA GGCCTGTATG 

■ 1151 CCGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGTTTGGGT 

1201 ACGAACTCCC TGCTGGACTT GGTGGTGTTC cgcccaaccc cccggtga 

This corresponds to the amino acid sequence <SEQ ID 62; ORF 010.ng>: 

gOlO.pep 
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1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDWGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDVVS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEWVP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLWF RPTPR* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 63>: 

mOlO.seq (PARTIAL) 

1 . . nTCCAATTAT CCAAATCCGG TCTGAATTGT GCCGTTTTGT CTAAAGTGTT 

51 CCCGACCCGT TCGCATACCG TAGCGGCGCA GGGCGGTATT TCCGCCTCTn 

101 TGGGTAATGT GCAGGAAGAC CGTTGGG ACT GGCACATGTA CGATACCGTG 

151 AAAGGTTCCG ACTGGTTGGG CGACCAAGAT GCGATTGAGT TTATGTGCCG 

201 CGCCGCGCCT GAAGCCGTAA TTGAGTTGGA ACACATGGGT ATGCCTTTTG *i 

251 ACCGTGTGGA AAGCGGTAAA ATTTATCAGC GTCCTTTCGG CGGCCATACT 

301 GCCGAACACG GTAAACGCGC GGTAGAACGC GyCTGTGCGG TTGCCGACCG 

351 TACAGGTCAT GCGATGCTGC ATACTTTGTA CCAACAAAAC GTCCGTGCCA 

4 01 ATACGCAATT CTTTGTGGAA TGGACGGCAC AAGATTTGAT TCGTGATGAA 

4 51 AACGGCGATG TCGTCGGCGT AACCGCCATG GAAATGGAAA CCGGCGAAgT 

501 TTATATTTTC CACGCTAAAG CTGTGATGTT TGCTACCGGC GGCGGCGGTC 

551 GTATTTATGC GTCTTCTACC AATGCCTATA TGAATACCGG CGATGGTTTG 

601 GGTATTTGTG CGCGTGCAGG TATCCCGTTG GAAGACATGG AATTCTGGCA 

651 ATTCCAGCCG ACCGGCGTGG CGGGTGCGGG CGTGTTGATT ACCGAA. 

This corresponds to the amino acid sequence <SEQ ID 64; ORF 010>: 

mOlO.pep (PARTIAL) 

1 . . XQLSKSGLNC AVLSKVFPTR SHTVAAQGGI SASXGNVQED RWDWHMYDTV 

51 KGSDWLGDQD AIEFMCRAAP EAVIELEHMG MPFDRVESGK IYQRPFGGHT 

101 AEHGKRAVER XCAVADRTGH AMLHTLYQQN VRANTQFFVE WTAQDLIRDE 

' 151 NGDVVGVTAM EMETGEVYIF HAKAVMFATG GGGRIYASST NAYMNTGDGL 

201 GICARAGIPL EDMEFWQFQP TGVAGAGVLI TE . . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 65>: 

aOlO.seq 

1 ATGGGCTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCANCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

401 CCTGTGCNGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 

4 51 CAACAAAATG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGCCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

701 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC AGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGTATTCCG ACCAACTACC ATGGCGAAGT 

1101 TGTCGTTCCT CAAGGCGACG AATACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACGAACTCCC TGCTGGACTT AGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CATGATTAAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 
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1301 ATGCCGGCGA ACTGACCCGC CAACGTATCG AGCGTTTGGA CAATCAAACT 

1351 GATGGTGAAA ACGTTGATGC ATTGCGCCGC GAACTGCAAC GCTCCGTACA 

14 01 ATTGCACGCC GGCGTGTTCC GTACTGATGA GATTCTGAGC AAAGGCGTTC 

14 51 GAGAAGTCAT GGCGATTGCC GAGCGTGTGA AACGTACCGA AATCAAAGAC 

1501 AAGAGCAAAG TGTGGAATAC CGCGCGTATC GAGGCTTTGG AATTGGATAA 

1551 CCTAATTGAA GTGGCGAAAG CGACTTTGGT GTCTGCCGAA GCACGTAAAG 

1601 AATCACGCGG TGCGCACGCT TCAGACGACC ATCCTGAGCG CGATGATGAA 

1651 AACTGGATGA AACATACGCT GTACCATTCA GATGCCAATA CCTTGTCCTA 

17 01 CAAACCGGTG CACACCAAGC CTTTGAGCGT GGAATACATC AAACCGGCCA 

1751 AGCGCGTTTA TTGA 

This corresponds to the amino acid sequence <SEQ ID 66; ORF 010.a>: 

a010 .pep 

1 MGFPVRKFDA VIVGGGGAGL RAXLQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDVVGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDVVS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEWVP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLVVF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QRIERLDNQT 

4 51 DGENVDALRR ELQRSVQLHA GVFRTDEILS KGVREVMAIA ERVKRTEIKD 

501 KSKVWNTARI EALELDNLIE VAKATLVSAE ARKESRGAHA SDDHPERDDE 

551 NWMKHTLYHS DANTLSYKPV HTKPLSVEYI KPAKRVY* 

moio/aoio 98.7% identity over a 23 1 aa overlap 

10 20 30 

mOlO .pep XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml 

a010 MGFPVRKFDAVIVGGGGAGLRAXLQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 

10 20 30 40 50 60 

40 50 60 70 80 90 

mOlO.pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 
M I I M II I I I I I I I I I I I I I I I I I I I I I I I | | | | I | | | | | | j | | | | | | | j | | | | | | | , | 
aOlO QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

70 80 90 100 110 120 

100 110 120 130 140 150 

mOlO . pep GHTAEHGKRAVERXCAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDL I RDENGDVVGV 

I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | ) | | | | | | | | | | j | | | | | | | | | 
aOlO GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 

130 140 150 160 170 180 

160 170 180 190 200 210 

mOlO . pep TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 

I I M I I I I I ! I I I i ! I I I I I I I I I I I I ! I | | | | | l | | | | | j | | || | | | j | || | | | | | | | | 
aOlO TAMEMETGE VYI FHAKAVMFATGGGGRI YAS STNAYMNTGDGLGI CARAGI PLE DMEFWQ 

190 200 210 220 230 240 

220 230 
mOlO . pep FQPTGVAGAGVLITE 
I : I I I I I I I I I I I I I 

aOlO FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDVVSRAMAMEIYEG 

250 260 270 280 290 300 



Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 010 shows 98.7% identity over a 231 aa overlap with a predicted ORF (ORF OlO.ng) 
from N. gonorrhoeae: 

m010.pep/g010 .pep 

10 20 30 

XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 
I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MGFPVRKFDAVIVGGGGAGLRAALQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 
10 20 30 40 50 60 

40 50 60 70 80 90 

QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 
I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i | j | | f | | | 
QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 
70 80 90 100 110 120 

100 110 120 130 140 150 

GHTAEHGKRAVERXCAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 
I I I M I I 1 I I I I I I I I I I I I I I I I I I I I I I || | | M | | | | | | | | | | | | | | | | | | | | | | | 
GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 
130 140 150 160 170 180 

160 170 180 190 200 210 

TAMEMETGEV Y I FHAKAVMFATGGGGRI YAS STNAYMNTGDGLG I C ARAG I PLE DME FWQ 
I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I | I | I | | | | | | | | | | | || | | | | | | | | | | 
TAMEMETGEVY I FHAKAVM FATGGGGR I YAS S TN AYMNTG DGLG I CARAG I PLE DME FWQ 
190 200 210 220 230 240 

220 230 
FQ PTG VAGAG V LITE 
I : I I I I I I I I II I I I 

FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDWSRAMAMEIYEG 
250 260 270 280 290 300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 67>: 

g010-l . seq. . 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGCGG 

51 TGCAGGTTTA CGTGCAGCCC TCCAATTATC CAAATCCGGT TTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGCT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAGGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGCTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGTCGC GCTGCGCCTG AAGCGGTGAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGCGTTGAA AGCGGCAAAA TTTATCAGCG 

351 TCCTTTCGGC GGACATACTG CCGAACATGG TAAACGTGCG GTAGAACGTG 

4 01 CATGTGCGGT TGCCGACCGT ACCGGTCATG CGATGTTGCA TACTTTGTAC 

451 CAACAAAACG TCCGTGCCAA TACACAATTC TTTGTGGAAT GGACGGCGCA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC GGGCGAAGTT TATATTTTCC ACGCCAAGGC CGTGATGTTT 

601 GCTACCGGTG GCGGCGGTCG TATTTATGCT TCTTCTACCA ATGCTTATAT 

651 GAATACCGGT GACGGTTTGG GCATTTGCGC CCGTGCGGGC ATTCCGTTGG 

701 AAGATATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAACGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTGGTTTCA CGCGCGATGG CGATGGAAAT CTATGAAGGT 

901 CGCGGCTGTG GTAAAAACAA AGACCACGTC TTACTGAAAA TCGACCATAT 

951 CGG TGCAGAA AAAATT AT GG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTTGC CGGTATCGAT CCGATTAAAG ACCCGATTCC GGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCCG ACCAATTATC ACGGTGAAGT 

1101 TGTTGTTCCG CAAGGCGACG AGTACGAAGT ACCTGTAAAA GGCCTGTATG 

1151 CCGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGTTTGGGT 



mOlO . pep 
gOlO 

mOlO .pep 
gOlO 

mOlO.pep 
gOlO 

mOlO . pep 
gOlO 

mOlO.pep 
gOlO 
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1201 ACGAACTCCC TGCTGGACTT GGTGGTGTTC cgcccaaccc cccggtga 

This corresponds to the amino acid sequence <SEQ ID 68; ORF 010-1. ng>: 

g010-l.pep 

1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDVVGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDVVS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEVWP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLVVF RPTPR* 

g010-l / P10444 

sp| P10444 |DHSA_ECOLI SUCCINATE DEHYDROGENASE FLAVOPROTEIN SUBUNIT 

gnl | PID| dl015210 (D90711) Succinate dehydrogenase, flavoprotein [Escherichia coli] gi | 1786942 
(AE000175) succinate dehydrogenase flavoprotein subunit [Escherichia coli] Length =588 

Score = 1073 (495-6 bits), Expect = 6.7e-169, Sum P{2) = 6.7e-169 

Identities = 191/303 (63%) , Positives = 238/303 (78%) 



Query : 


1 


MGFPVRKFDAVIVXXXXXXXXXXXXXSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 


60 




M PVR+FDAV++ S+SG CA+LSKVFPTRSHTV+AQGGI + +LGN 




Sbjct : 


1 


MKLPVREFDAWIGAGGAGMRAALQISQSGQTCALLSKVFPTRSHTVSAQGGITVALGNT 


60 


Query : 


61 


QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 


120 






ED W+WHMYDTVKGSD++GDQDAIE+MC+ PEA++ELEHMG+PF R+ + G+IYQRPFG 




Sbjct : 


61 


HEDNWEWHMYDTVKGSDYIGDQDAIEYMCKTGPEAILELEHMGLPFSRLDDGRIYQRPFG 


120 


Query : 


121 


GHTAEHGKRAVERAC^VADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 


180 




G + G R A ADRTGHA+LHTLYQQN++ +T F EW A DL+++++G WG 




Sb j ct : 


121 


GQS KNFGGEQAARTAAAADRTGHALLHTLYQQNLKNHTT I FS EWYALDLVKNQDGAWGC 


180 


Query: 


181 


TAMEMETGEVYI FHAKAVM FATGGGGRI YAS STNAYMNTGDGLG I CARAG I PLEDMEFWQ 


240 




TA+ +ETGEV F A+A + ATGG GRIY S+TNA++NTGDG+G+ RAG+P++DME WQ 




Sbjct: 


181 


TALC I ETG EWYF KARAT VLATGG AG R I YQS TTN AH I NTGDG VGMA I RAG V P VQDME MWQ 


240 


Query : 


241 


FHPTGVAGAGVLITEGWGEGGILLNADGERFMERYAPTVKDLASRDVVSRAMAMEIYEG 


300 




FH PTG + AG AG V L+ TEG RGEGG LLN GERFMERYAP KDLA RDW+R++ +EI EG 




Sbjct: 


241 


FHPTGIAGAGVLVTEGCRGEGGYLLNKHGERFMERYAPNAKDLAGRDWARSIMIEIREG 


300 


Query: 


301 


RGC 303 








RGC 




Sb j Ct : 


301 


RGC 303 




Score 


= 249 


(115.0 bits), Expect = 6.7e-169, Sum P(2) = 6.7e-169 




Identities = 53/102 (51%), Positives = 62/102 (60%) 




Query : 


309 


HVLLKI DH IGAEKI MEKLPG I RE I S I QFAGXXXXXXXXXXXXTTHYMMGG I PTNYHGE W 


368 




H LK+DH+G E + +LPGI E+S FA T HYMMGGIPT G+ + 




Sbjct: 


310 


HAKLKLDHLGKEVLESRLPGILELSRTFAHVTJPVKEPIPVIPTCHYMMGGIPTKVTGQAL 


369 


Query : 


369 VPQGD E Y E VP VKG L YAAG E CAC AS VHG ANRLGTN S LLDL W F 410 








+V V GL+A GE AC SVHGANRLG NSLLDLWF 




Sbjct: 


370 


T VN E KGE D VW PG L FA VG E I ACV S VHG ANRLGGN SLLDL WF 411 





The following partial DNA sequence was identified in N. meningitidis <SEQ ID 69>: 

mOlO-l.seq. . 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCAGCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAg 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

401 CCTGTGCGGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 
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451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 



CAACAAAACG 
AGATTTGATT 
AAATGGAAAC 
GCTACCGGCG 
GAATACCGGC 
AAGACATGGA 
GTGTTGATTA 
CGACGGCGAA 
CTTCTCGCGA 
CGCGGCTGCG 
CGGCGCAGAA 
TTCAGTTCGC 
ACTACCCACT 
TGTCGTTCCG 
CGGCAGGTGA 
ACCAACTCCC 
CATGATTAAA 
ATGCAGGTGA 
GATGGTGAAA 
ACTGCACGCC 
GAGAAGTCAT 
AAGAGCAAAG 
CCTGATTGAA 
AATCACGCGG 
AACTGGATGA 
CAAACCGGTG 
AGCGCGTTTA 



TCCGTGCCAA 
CGTGATGAAA 
CGGCGAAGTT 
GCGGCGGTCG 
GATGGTTTGG 
ATTCTGGCAA 
CCGAAGGCGT 
CGCTTTATGG 
CGTTGTTTCC 
GTAAAAACAA 
AAAATTATGG 
CGGTATCGAT 
ATATGATGGG 
CAAGGTGAAG 
GTGCGCTTGT 
TGTTGGACTT 
TTCATCAAAG 
GTTGACCCGC 
ACGTTGATGC 
GGCGTGTTCC 
GGCGATTGCC 
TGTGGAATAC 
GTGGCGAAAG 
TGCGCACGCT 
AACATACGCT 
CACACCAAGC 
TTGATGA 



TACGCAATTC 
ACGGCGATGT 
TATATTTTCC 
TATTTATGCG 
GTATTTGTGC 
TTCCAGCCGA 
ACGCGGCGAG 
AACGCTATGC 
CGCGCGATGG 
AGACCATGTC 
AAAAACTGCC 
CCGATTAAAG 
CGGCATTCCG 
ATTACGAAGT 
GCTTCCGTAC 
GGTGGTATTC 
AGCAAAGCGA 
CAACGTATCG 
ATTGCGTCGC 
GTACTGATGA 
GAGCGTGTGA 
CGCGCGTATC 
CGACTTTGGT 
TCAGACGACC 
GTACCATTCA 
CTTTGAGCGT 



TTTGTGGAAT 
CGTCGGCGTA 
ACGCTAAAGC 
TCTTCTACCA 
GCGTGCAGGT 
CCGGCGTGGC 
GGCGGTATTC 
GCCGACCGTA 
CGATGGAAAT 
TTACTGAAAA 
GGGCATCCGC 
ACCCGATTCC 
ACCAATTACC 
GCCTGTAAAA 
ACGGTGCGAA 
GGTAAAGCTG 
CTGGAAACCT 
AGCGTTTGGA 
GAACTGCAAC 
GATTCTGAGC 
AACGTACCGA 
GAGGCTTTGG 
GTCTGCCGAA 
ATCCTGAGCG 
GATATCAATA 
GGAATACATC 



GGACGGCACA 
ACCGCCATGG 
TGTGATGTTT 
ATGCCTATAT 
ATCCCGTTGG 
GGGTGCGGGC 
TGTTGAATGC 
AAAGACTTGG 
CTACGAAGGT 
TCGACCATAT 
GAGATTTCCA 
CGTTGTGCCG 
ACGGCGAAGT 
GGTCTGTATG 
CCGCTTGGGT 
CCGGCGACAG 
TTGCCTGCTA 
CAACCAAACC 
GCTCTGTACA 
AAAGGCGTTC 
AATCAAAGAC 
AATTGGATAA 
GCACGTAAAG 
CGATGATGAA 
CCTTGTCCTA 
AAACCGGCCA 



This corresponds to the amino acid sequence <SEQ ID 70; ORF 010-1> 

mOlO-l.pep. . 

VIVGGGGAGL 
QEDRWDWHMY 
SGKIYQRPFG 
FVEWTAQDLI 
SSTNAYMNTG 
GGILLNADGE 
LLKIDHIGAE 
TNYHGEVWP 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



MGFPVRKFDA 
GGISASLGNV 
HMGMPFDRVE 
QQNVRANTQF 
ATGGGGRIYA 
VLITEGVRGE 
RGCGKNKDHV 
TTHYMMGGIP 
TNSLLDLWF 
DGENVDALRR 
KSKVWNTARI 
NWMKHTLYHS 



RAALQLSKSG 
DTVKGSDWLG 
GHTAEHGKRA 
RDENGDWGV 
DGLGICARAG 
RFMERYAPTV 
KIMEKLPGIR 
QGEDYEVPVK 



GKAAGDSMIK FIKEQSDWKP 
ELQRSVQLHA GVFRTDEILS 
EALELDNLIE 
DINTLSYKPV 



LNCAVLSKVF 
DQDAIEFMCR 
VERACAVADR 
TAMEMETGEV 
IPLEDMEFWQ 
KDLASRDVVS 
EISIQFAGID 
GLYAAGECAC 
LPANAGELTR 
KGVREVMAIA 
VAKATLVSAE ARKESRGAHA 
HTKPLSVEYI KPAKRVY* 



PTRSHTVAAQ 
AAPEAVIELE 
TGHAMLHTLY 
YIFHAKAVMF 
FHPTGVAGAG 
RAMAMEIYEG 
PIKDPIPWP 
ASVHGANRLG 
QRIERLDNQT 
ERVKRTEIKD 
SDDHPERDDE 



m010-l / g010-l 99.5% identity in 410 aa overlap 

10 20 30 40 50 60 

m010-l .pep MGFP VRKTOAVT VGG GGAGLRAALQL SKS GLNCAVL S KVFPTR S HTVAAQ GGI S AS LGNV 

I I I | t I | I I I I I 1 I I I I I I J I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I 
g010-l MGFPVlWFDAVIVGGGGAGIiRAALQLSKSGLNCAV^ 

10 20 30 40 50 60 

70 80 90 100 110 120 

mOlO-l.pep QEDRWDWHMYD TVKG SDWLGDQDAI EEMCRAAPEAVT ELE HMGMPFDRVE S GKI YQRPFG 

I | | | | M 1 M I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I 
gOlO-1 QEDRWDWHMYDTVKGSDWLGDQDAIEEWCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

70 80 90 100 110 120 

130 140 150 160 170 180 

mO 1 0 - 1 . pep GHTAEHGKRAVERACAVADRTGHAMLHT^ 

I | I I I I I I I 1 I I I II I I I I I II I M I I I I I I I I I I I I I II II I I I I I II I I I I I I I I ! I I 
g010-l GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVlumTQFFVEWTAQDLIRDENGDVVGV 

130 140 150 160 170 180 

190 200 210 220 230 240 

m010-l .pep TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 
I I I I I I I I I I I I I I I I I I 1 M I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I 
g010-l TAMEME TGE VY IFHAKAVMFAT GGGGRI YAS S TNAYMNTGD GLG I CARAGI PLEDME FWQ 
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190 200 210 220 230 240 

250 260 270 280 290 300 

FH PTGVAGAGVLI TEGVRGEGGILLNADGERFMERYAPTVKDLASRD WSRAMAME I YEG 
I | ( I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I t I I 

FH PTGVAGAGVLI TEGVRGEGGILLNADGERFMERYAPTVKDLASRD WSRAMAME I YEG 
250 260 270 280 290 300 

310 320 330 340 350 360 

RGCGKNKDHVLLKIDHIGAEKIMEKLPGIREISIQFAGIDPIKDPIPVVPTTHYMMGGIP 
I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ( I I I I I I I i I 
RGCGKNKDHVLLKIDHIGAEKIMEKLPGIREISIQFAGIDPIKDPIPWPTTHYMMGGIP 
310 320 330 340 350 360 

370 380 390 400 410 420 

TNYHGEVVVPQGEDYEVPVKGLYAAGECACASVHGANRLGTN^ 
i I I I I I I I I I I I :: I I I I I I I I I I I I I I i I I I I I I I I I I t I I I I I I I I I I 
TNYHGEVVVPQGDEYEVPVKGLYAAGECACASVHGANRLGTNSLLDLVVFRPTPRX 
370 380 390 400 410 

430 440 450 460 470 480 

FI KEQSDWKPLPANAGELTRQRIERLDNQTDGENVDALRRELQRS VQLHAGVFRTDE ILS 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 71>: 

a010-l . seq. . 

1 ATGGGCTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCANCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

4 01 CCTGTGCNGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 

4 51 CAACAAAATG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGCCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

701 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC AGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGTATTCCG ACCAACTACC ATGGCGAAGT 

1101 TGTCGTTCCT CAAGGCGACG AATACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACGAACTCCC TGCTGGACTT AGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CATGATTAAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 

1301 ATGCCGGCGA ACTGACCCGC CAACGTATCG AGCGTTTGGA CAATCAAACT 

1351 GATGGTGAAA ACGTTGATGC ATTGCGCCGC GAACTGCAAC GCTCCGTACA 

14 01 ATTGCACGCC GGCGTGTTCC GTACTGATGA GATTCTGAGC AAAGGCGTTC 

1451 GAGAAGTCAT GGCGATTGCC GAGCGTGTGA AACGTACCGA AATCAAAGAC 

1501 AAGAGCAAAG TGTGGAATAC CGCGCGTATC GAGGCTTTGG AATTGGATAA 

1551 CCTAATTGAA GTGGCGAAAG CGACTTTGGT GTCTGCCGAA GCACGTAAAG 

1601 AATCACGCGG TGCGCACGCT TCAGACGACC ATCCTGAGCG CGATGATGAA 

1651 AACTGGATGA AACATACGCT GTACCATTCA GATGCCAATA CCTTGTCCTA 

1701 CAAACCGGTG CACACCAAGC CTTTGAGCGT GGAATACATC AAACCGGCCA 

1751 AGCGCGTTTA TTGA 

This corresponds to the amino acid sequence <SEQ ID 72; ORF 010-l.a>: 

a010-l.pep. . 

1 MGFPVRKFDA VIVGGGGAGL RAXLQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 



m010-l .pep 
g010-l 

m010-l .pep 
g010-l 

mOlO-l.pep 
g010-l 

m010-l .pep 
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151 QQNVRANTQF FVEWTAQDLI RDENGDVVGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDVVS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPVVP 

351 TTHYMMGGIP TNYHGEWVP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLVVF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QRIERLDNQT 

4 51 DGENVDALRR ELQRSVQLHA GVFRTDEILS KGVREVMAIA ERVKRTEIKD 

501 KSKVWNTARI EALELDNLIE VAKATLVSAE ARKESRGAHA SDDHPERDDE 

551 NWMKHTLYHS DANTLSYKPV HTKPLSVEYI KPAKRVY* 

m01O-l / a010-l 99.3% identity in 587 aa overlap 



10 20 30 40 50 60 

aOlO-l.pep MGF P VRKFDAVI VGGGGAGLRAXLQLS KSGLNCAVLS KVF PTRSHTVAAQGG I S AS LGNV 

III Mil III llllllllllll IIMIIIIIIIIIIIIIIIIIIIIIIIII IIIIIMI 

a010-l MG F PVRKFDA VI VGGGGAGLRAALQLS KSGLNCAVLS KVF PTRSHTVAAQGG I S AS LGNV 

10 20 30 40 50 60 



70 80 90 100 110 120 

aO 1 0 - 1 . pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKI YQRPFG 

I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I It 
m010-l QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

70 80 90 100 110 120 



130 140 150 160 170 180 

aOlO-l.pep GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQF FVEWTAQDLI RDENGDVVGV 

I I I I I I I I I I I I I I I I I I 1 I I I I I I! I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I 
raO 1 0 - 1 GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQF FVEWTAQDLI RDENGDVVGV 

130 140 150 160 170 180 



190 200 210 220 230 240 

a010-l .pep TAMEMETG E VY I FHAKAVMFATGGGGRI YASSTNAYMNTGDGLG I CARAG I PLEDME FWQ 

I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I | 
m01O-l TAMEMETG EVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 

190 200 210 220 230 240 



250 260 270 280 290 300 

a010-l .pep FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDWSRAMAMEIYEG 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I II I I 
m010-l FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDWSRAMAMEIYEG 

250 260 270 280 290 300 



310 320 330 340 350 360 

a010-l .pep RGCGKNKDHVLLKIDHIGAEKIMEKLPGIREISIQFAGIDPIKDPIPWPTTHYMMGGIP 

III I III III III IMMMIMIMI llll Mill ill! :: 1 1 MM Ml III 1 1 

m010-l RGCGKNKDHVLLKIDHIGAEKIMEKLPGIREISIQFAGIDPIKDPIPWPTTHYMMGGIP 

310 320 330 340 350 360 



370 380 390 400 410 420 

aOl 0 - 1 . pep TNYHGEVWPQGDEYEVPVKGLYAAGECACASVHGANRLGTNSLLDLWFGKAAGDSMIK 
I I I I I I I I I I I h M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I || I I | | 
mO 1 0 - 1 TNYHGEVWPQGEDYEVPVKGLYAAGECACASVHGANRLGTNSLLDLWFGKAAGDSMI K 

370 380 390 400 410 420 



430 440 450 460 470 480 

a010-l .pep FIKEQSDWKPL PAN AGE LTRQR I ERLDNQTDG ENVD ALRRE LQR S VQLHAG V F RTDE I L S 

II MMIIMMMMMMMMMIMIimillllllMIIMIM III II Ml II 

m010-l FIKEQSDWKPLPANAGELTRQRIERLDNQTDGENVDALRRELQRSVQLHAGVFRTDEILS 

430 440 450 460 470 480 



490 500 510 520 530 540 

a010-l .pep KG VRE VMA I AERVKRTE I KD KS KVWNT ARI EALE LDNL I EVA KAT L VS AE ARKE S RGAHA 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II I M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 II I M 1 1 1 1 1 1 1 

m010-l KG VR E VMA I A E R VKRT E I KD KS K VWNT AR I EALE LDN L I E VAKAT L VS AE ARK E S RGAHA 

490 500 510 520 530 540 



a010-l .pep 



550 560 570 580 

SDDHPERDDENWMKHTLYHSDANTLSYKPVHTKPLSVEYIKPAKRVYX 

lllllllllllllllllllll i ■ ■ 



WO 99/57280 



PCT/US99/09346 



192 



m0 1 0 - 1 SDDHPERDDENWMKHTLYHSDINTLSYKPVHTKPLSVEYI KPAKRVYX 

550 560 570 580 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 73>: 

gOll . seq 

1 ATGAAGACAC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG G AC AT CAT G A 

101 GCCTGAAAAC CCGCCTTACC GAAGATATGA AAACCGCGAT GCGCGCCAAA 

151 GATCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAATGCCG CCGTCAAACA 

201 GTTTGAAGTA GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

251 TCCTGACCAA AATGGTCAAA CAGCGCAAAG ACGGCGCGAA AATCTACACT 

301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGACGT 

351 GCTGCACCGC TACCTGCCGC AAATGCTCTC CGCCGGCGAA ATCCGCACCG 

4 01 CCGTCGAAGC AGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 

4 51 GGCAAAGTGA TGGTCGTATT GAAAAcccGC CTCGCCGGCA AAGccgATAT 

501 GGGCGAAGTC AACAAAATCT TGAAAAccGt aCTGACCGCC tga 



This corresponds to the amino acid sequence <SEQ ID 74; ORF 01 l.ng>: 

gOll .pep 

1 MKTHRKTCSA VCFAFQTASK PAVSIRHPSE DIMSLKTRLT EDMKTAMRAK 

51 DQVSLGTIRL INAAVKQFEV DERTEADDAK ITAILTKMVK QRKDGAKIYT 

101 EAGRQDLADK ENAEIDVLHR YLPQMLSAGE IRTAVEAAVA ETGAAGMADM 

151 GKVMVVLKTR LAGKADMGEV NKILKTVLTA * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 75>: 

mOll.seq (partial) 

1 ATGAGGACAC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG G AC AT CAT G A 

101 GCCTGAAAAT CCGCCTTACC GAAGACATGA AAACCGCGAT GCGCGCCAAA 

151 GACCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAACGCCG CCGTCAAACA 

201 GTTTGAAGTG GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

251 TCCTGACCAA AATGGTCAAA CAGCGAAAAG ACAGCGCGAA AATCTACACT 

301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGAGGT 

351 ACTGCACCGC TACCTTCCCC AAATGCTTTC CGCCGGCGAA ATCCGTACCG 

4 01 AGGTCGAAGC TGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 

4 51 GGTAAAGTCA TGGGGCTGCT GAAAACCCGC CTCGCAGGTA AAGCCGA. . . 



This corresponds to the amino acid sequence <SEQ ID 76; ORF 01 1>: 

mOll.pep (partial) 

1 MRTHRKTCSA VCFAFQTASK PAVSIRHPSE DIMSLKIRLT EDMKTAMRAK 

51 DQVSLGTIRL INAAVKQFEV DERTEADDAK ITAILTKMVK QRKDSAKIYT 

101 EAGRQDLADK ENAEIEVLHR YLPQMLSAGE IRTEVEAAVA ETGAAGMADM 

151 GKVMGLLKTR LAGKA 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 01 1 shows 95.8% identity over a 165 aa overlap with a predicted ORF (ORF 01 1 .ng) 
from N. gonorrhoeae: 

mOll/gOll 

10 20 30 40 50 60 

mOll.pep MRTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKIRLTEDMKTAMRAKDQVSLGTIRL 
I : I f I I I I I I II I I I I I I I I I I I I i I i I I 1 I I I I I I I I I I I I I M I I I I M 1 | | I | | | | 
gOll MKTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKTRLTEDMKTAMRAKDQVSLGTIRL 

10 20 30 40 50 60 

70 80 90 100 110 120 

raOll .pep INAAVKQFEVDERTEADDAKITAILTKMVKQRKDSAKIYTEAGRQDLADKENAEIEVLHR 
I I II I I I I I I I I I I I M I I I I M I II I I I I I I I I : I I I I I I I I I I || | | | | | | | | : | | | | 
gOll INAAVKQFEVDERTEADDAKITAILTKMVKQRKDGAKIYTEAGRQDLADKENAEIDVLHR 
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70 80 90 100 110 120 

130 140 150 160 

mOll . pep YLPQMLSAGEIRTEVEAAVAETGAAGMADMGKVMGLLKTRLAGKA 
I I! I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I : I I I I I I I I I 
gOll YLPQMLSAGEIRTAVEAAVAETGAAGMADMGKVMVVLKTRLAGKADMGEVNKILKTVLTA 

130 140 150 160 170 180 

gOll X 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 77>: 

g012 . seq 

1 ATGCTCGCCC GTCGCTATTT TTTCAATATC CAACCCGGGG CGGTTTTCAC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGCCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACa 

201 gGcggTGGAT ATTCGgcact tccgCcacca cacccaccga accgatgacc 

251 gcaaacggaG CGGAAACAAT TTTATCCGCc acacacgcca tcatatagcc 

301 gcCGCTTGCC GCGACCTTAT CGAcggcgac ggTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CCTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

4 01 CCGCCCGGAC TTTCCAATCT GAGCAGAACC TCATCTTCAG GCTTGGCAAT 

4 51 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GCAGATTTCT CCCCGCCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CTTTTTTTTC CTGATGTTTT GTCTCTTCCT 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 78; ORF 012.ng>: 

g012 .pep 

1 MLARRYFFNI QPGAVFTDKL LEQLMRFLQF LPEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRHFRHHTHR TDDRKRSGNN FIRHTRHHIA 

101 AACRDLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS EQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPAL LQTLFLCFGF 

201 RLFLFLFFFF LMFCLFLA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 79>: 



m012 . seq 












1 


ATGCTCGCCC 


GTTGCCACTT 


CCTCAATATC 


CAATTGAGGG 


CGGTTCTCGC 


51 


TGACAAACTG 


CTTGAACAAC 


TGATGCGTTT 


CCTCCAGTTC 


CTGTCGGAAT 


101 


TTCTGTTTGC 


CCTTTTCCGT 


ATTTTCACCC 


ATAAAAGTAA 


CCGTGCGCTT 


151 


AAATTCGCCC 


GCCGTCATCA 


CATCCACATC 


AATATCATGT 


TTTTTCAACA 


201 


GGCGGTGGAT 


ATTCGGTACT 


TCCGCCACCA 


CACCCACCGA 


ACCGACAATC 


251 


GCAAACGGAG 


CGGAAGCAAT 


TTTATCCGCC 


ACACACGCCA 


TCATATAACC 


301 


GCCGCTCGCn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


351 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


401 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


451 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


501 


nnnnnnnnnn 


nnnnnnnnnC 


AACACAAAAA 


GGCGTGATTT 


nTGCGTTTCG 


551 


GCAGATTTCT 


CCCCACCCTC 


CTTCAAACGT 


TTTTCcTCTG 


CTTTGGCTTC 


601 


CGCCTTTTCC 


TTTTTCTTTT 


CCTCTTTTTC 


CTGATGTTGT 


GCCTCTTCCC 


651 


CGCTTAA 











This corresponds to the amino acid sequence <SEQ ID 80; ORF 012>: 

m012 .pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR TDNRKRSGSN FIRHTRHHIT 

101 AARXXXXXXX xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 XXXXXXXXXX XXXXXXXXXX XXXQHKKA*F XRFGRFLPTL LQTFFLCFGF 

2 01 RLFLFLFLFF LMLCLFPA* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 81>: 

a012.seq 

1 ATGCTCGCCC GTTGCCACTT 

51 TGACAAACTG CTTGAACAAC 

101 TTCTGTTTGC CCTTTTCCGT 

151 AAATTCGCCC GCCGTCATCA 

201 GGCGGTGGAT ATTCGGTACT 

251 GCAAACGGAG CGGAAACAAT 

301 ACCGCTCGCC GCCACCTTAT 

351 CGCGCAAACG CCTAAGCTGC 

4 01 CCGCCCGGAC TTTCCAATCT 

4 51 CAAAAGCACC GCCGTAATCT 

501 ACAAATCGCC GTCAAAATCC 

551 GAAGATTTCT CCCCACCCTC 

601 CGCCTTTTCC TTTTTCTTTT 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 82; ORF 012. a>: 

a012 .pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDNRKRSGNN FIRHTRHHIT 

101 TARRHLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS KQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 

201 R LFLFLFLFF LMFCLFPA * 

moi2/aOi2 64.2% identity over a 218 aa overlap 

10 20 30 40 50 60 

MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
I I I I I I I I I I I i I I I I [ I I I I I I I I I I I I I I I I I I I I I I I I I I I I I [ I I I I I I I I I I | | | 
MLARCH FLN I QLRAVLADKLLEQLMRFLQFLSE FLFALFRI FTHKSNRALKFARRHHI H I 
10 20 30 40 50 60 

70 80 90 100 110 120 

NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARXXXXXXXXXXXXXXXXX 
I I I I M I I I I I I I I I : : I I I I I I I I I I I : I II I I I I I I I I : I I : 
NIMFFQQAVDIRYFRYNTHRTDNRKRSGNNFIRHTRHHITTARRHLIDGDGQRNIAFAQT 
70 80 90 100 110 120 

130 140 150 160 170 180 

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAXF 
: : : : I I I I I I 

PKLRSRQTVTVNHAARTFQSKQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 
130 140 150 160 170 180 

190 200 210 219 

XRFGRFLPTLLQTFFLCFGFRLFLFLFLFFLMLCLFPAX 
I I I I II I I I I I I : I I I I I I I I I I I I I I I I I I : I I I I I I 
LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
190 200 210 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 012 shows 58.7% identity over a 21 8 aa overlap with a predicted ORF (ORF 012.ng) 
from N. gonorrhoeae: 

m012/g012 

10 20 30 40 50 60 

m012 . pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
I I I I : I : I I I I I : : I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I M I I I I I 
g012 MLARRYFFNIQPGAVFTDKLLEQLMRFLQFLPEFLFALFRIFTHKSNRALKFARRHHIHI 



CCTCAATATC CAATTGAGGG CGGTTCTCGC 
TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 
ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 
CATCCACATC AATATCATGT TTTTTCAACA 
TCCGCTACAA CACCCACCGA ACCGACAATC 
TTTATCCGCC ACACACGCCA TCATATAACC 
CGACGGCGAC GGTCAGCGGA ATATTGCGTT 
GAAGCCGCCA AACCGTAACC GTGAACCACG 
AAGCAGAACC TCATCTTCAG GCTTGGCAAT 
CATGACGCAA GGATTCTACG GCGTGTGCAT 
AACACAAAAA GGCGGGATTT TTGCGTTTCG 
CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 
CCTCTTTTTC CTGATGTTTT GCCTCTTCCC 



m012.pep 
a012 

m012.pe'p 
a012 

m012 .pep 
a012 

m012 .pep 
a012 
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10 20 ' 30 40 50 60 

70 80 90 100 110 120 

m012 . pep NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARXXXXXXXXXXXXXXXXX 
I I I i I I I I I I I I : I I I I I I I I I : I I I I I : I If I I I I I I I : I I : 
g012 NIMFFQQAVDIRHFRHHTHRTDDRKRSGNNFIRHTRHHIAAACRDLIDGDGQRNIAFAQT 

70 80 90 100 110 120 



130 140 150 160 170 180 

m012 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAXF 

: : : : MINI 

g012 PKLRSRQTVTVNHAART FQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 

130 140 150 160 170 180 



190 200 210 219 

m012 . pep XRFGRFLPTLLQTFFLCFGFRLFLFLFLFFLMLCLFPAX 
I I I I I I I : I I f I : I I 1 I I I I I I I I I I : I I I I : 1 I I II 
g012 LRFGRFLPALLQTLFLCFG FRLFLFLFFFFLMFCLFLAX 

190 200 210 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 83>; 

m012-l . seq 

1 ATGCTCGCCC GTTGCCACTT 

51 TGACAAACTG CTTGAACAAC 

101 TTCTGTTTGC CCTTTTCCGT 

151 AAATTCGCCC GCCGTCATCA 

201 GGCGGTGGAT ATTCGGTACT 

251 GCAAACGGAG CGGAAGCAAT 

301 GCCGCTCGCC GCCACCTTAT 

351 CGCGCAAACG CyTAAGCTGC 

4 01 CCGCCCGGAC TTTCCAATCT 

4 51 CAAAAGCACC GCCGTAATCT 

501 ACAAATCGCC GTCAAAATCC 

551 GCAGATTTCT CCCCACCCTC 

601 CGCCTTTTCC TTTTTCTTTT 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 84; ORF 012-1>: 

m012-l.pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR TDNRKRSGSN FIRHTRHHIT 

101 AARRHLIDGD GQRNIAFAQT XKLRSRQTVT VNHAARTFQS EQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 

201 R LFLFLFLFF LMFCLFPA * 

m012-l/g012 91 . 7% . identity in 218 aa overlap 

10 20 30 40 50 60 

m012-l.pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
Mil : I : I I I I I :: I I I I I I I I I I I 1 ( I I I I I I I I I I I I i I I I I I I I I I I II I I I I 
g012 MLARRYFFNIQPGAVFTDKLLEQLMRFLQFLPEFLFALFRIFTHKSNRALKFARRHHIHI 

10 20 30 40 50 60 



CCTCAATATC CAATTGAGGG CGGTTCTCGC 
TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 
ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 
CATCCACATC AATATCATGT TTTTTCAACA 
TCCGCCACCA CACCCACCGA ACCGACAATC 
TTTATCCGCC ACACACGCCA TCATATAACC 
CGACGGCGAC GGTCAGCGGA ATATTGCGTT 
GAAGCCGCCA AACCGTAACC GTGAACCACG 
GAGCAGAACC TCATCTTCAG GCTTGGCAAT 
CATGACGCAA GGATTCTACG GCGTGTGCAT 
AACACAAAAA GGCGGGATTT TTGCGTTTCG 
CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 
CCTCTTTTTC CTGATGTTTT GCCTCTTCCC 



70 80 90 100 110 120 

m012-l . pep NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARRHLIDGDGQRNIAFAQT 
I I I I I I I I I I M : I I I I I II I I : I I I 1 i: I I I I I ! I I 1 I : I I I I I I I I I I I 1 I I I I I I 
g012 NIMFFQQAVDIRHFRHHTHRTDDRKRSGNNFIRHTRHHIAAACRDLIDGDGQRNIAFAQT 

70 80 90 100 110 120 



130 140 150 160 170 180 

m012-l.pep XKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I | | 
g012 PKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 
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130 



140 



150 



160 



170 



180 



190 200 210 219 

m012-l . pep LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
I I I I I I I I : I I I I I I I I I I I I I I I I I I : I I I I I I I I II 
g 012 LRFGRFLPALLQTLFLCFGFRLFLFLFFFFLMFCLFLAX 

190 200 210 



The following partial DNA sequence was 

a012-l. seq 

ATGCTCGCCC GTTGCCACTT 

TGACAAACTG CTTGAACAAC 

TTCTGTTTGC CCTTTTCCGT 

AAATTCGCCC GCCGTCATCA 

GGCGGTGGAT ATTCGGTACT 

GCAAACGGAG CGGAAACAAT 

ACCGCTCGCC GCCACCTTAT 

CGCGCAAACG CCTAAGCTGC 

CCGCCCGGAC TTTCCAATCT 

CAAAAGCACC GCCGTAATCT 

ACAAATCGCC GTCAAAATCC 

GAAGATTTCT CCCCACCCTC 

CGCCTTTTCC TTTTTCTTTT 
CGCTTAA 



identified in N. meningitidis <SEQ ID 85>: 



l 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



CCTCAATATC 
TGATGCGTTT 
ATTTTCACCC 
CATCCACATC 
TCCGCTACAA 
TTTATCCGCC 
CGACGGCGAC 
GAAGCCGCCA 
AAGCAGAACC 
CATGACGCAA 
AACACAAAAA 
CTTCAAACGC 
CCTCTTTTTC 



CAATTGAGGG 
CCTCCAGTTC 
ATAAAAGTAA 
AATATCATGT 
CACCCACCGA 
ACACACGCCA 
GGTCAGCGGA 
AACCGTAACC 
TCATCTTCAG 
GGATTCTACG 
GGCGGGATTT 
TTTTTCTCTG 
CTGATGTTTT 



CGGTTCTCGC 
CTGTCGGAAT 
CCGTGCGCTT 
TTTTTCAACA 
ACCGACAATC 
TCATATAACC 
ATATTGCGTT 
GTGAACCACG 
GCfTGGCAAT 
GCGTGTGCAT 
TTGCGTTTCG 
CTTTGGCTTC 
GCCTCTTCCC 



This corresponds to the amino acid sequence <SEQ ID 86; ORF 012-1 .a>: 

a012-l .pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDNRKRSGNN FIRHTRHHIT 

101 TARRHLIDGD GQRNIAFAQT PKLRSRQTVT VNHAART FQS KQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 

201 RLFLFLFLFF LMFCLFPA* 



a012-l/m012-l 



97.2% identity in 218 aa overlap 



10 20 30 40 50 60 

a012-l.pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKF7VRRHHIHI 
I I! I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m012-l MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 

10 20 30 40 50 60 

70 80 90 100 • 110 120 

a012-l . pep NIMFFQQAVDIRYFRYNTHRTDNRKRSGNNFIRHTRHHITTARRHLIDGDGQRNIAFAQT 
I I I I I I I I I I I I I I I : : I I II I I I I I I I : I I I I I I I I I I I : I I I I I I I I I I I I M I II I I 
m012-l NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARRHLIDGDGQRNIAFAQT 

70 80 90 100 110 120 

130 140 150 160 170 180 

a012-l . pep PKLRSRQTVTVNHAARTFQSKQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 
I I I I I I I II I I I I I I I I I I : I I I I I I I I I I I I I I I I U I I I I I I I I I I I I I I I I I I I I I 
m012-l XKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 

130 140 150 160 170 180 

190 200 210 219 

a012-l.pep LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
mO 1 2 - 1 LRFGRFLPTLLQTLFLCFG FRLFLFLFLFFLMFCLFPAX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 87>: 

g013.seq 

1 aTgcctttga ccatgctgtg cagcaGGAcg tGCGGTTtgt tcataataca 

51 gtCcgaccGG AAAagcggAG GAAaCGCAGT GCCGCGCCCT TCCCCTTTCT 

101 TGCCGTGGCA GGCGATGCag tTgGATTCGT ACACTTTTTG CCCTTTtGtC 
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151 atgatGCTgt tgtcggCGGC AGAAGCgGCG GcgCAGAGGC AGCACAAGAT 

201 GAAGGCGGTC GGCAGTCGGG TTGTGTtcat tGgcgTTTCC cctaatgttt 

251 tgaaaccttg ttttttgatt Ttgcctttac ggggtgaaaa gtttttTtgg 

301 cccaaatccg gaatttag 

This corresponds to the amino acid sequence <SEQ ID 88; ORF 013.ng: 

g013 .pep 

1 MPLTMLCSRT CGLFIIQSDR KSGGNAVPRP SPFLPWQAMQ LDSYTFCPFV 

51 MMLLSAAEAA AQRQHKMKAV GSR VVFIGVS PNVLKPCFLI LPLRGEKFFW 

101 PKSGI* — — 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 89>: 

m013 . seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT TCATGATGAA 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAAGC AGCCCAAGAC 

201 GhGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTCATGTTTG 

251 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAAGATTTT TTTGCCGAAT 

301 CAGTAG 

This corresponds to the amino acid sequence <SEQ ID 90; ORF 01 3>: 

m013.pep 

1 MPLTMLCSST CGFFMMKSER XSGGNMVPRP SPFLPWQATQ LDSYTFCPFV 
51 MMLLSAAEAA AQKQPKTRAV GSR VVFIGVS FMFETLLLI L RSGXKIFLPN 
101 Q* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 91>: 

a013. seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT TCATGATGAA 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAGGC AGCCCAAGAC 

201 GAGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTAATGTTTG 

251 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAAGATTTT CTTGCCGAAT 

301 CGGTAG 

This corresponds to the amino acid sequence <SEQ ID 92; ORF 013.a>: 

a013 .pep 

1 MPLTMLCSST CGFFMMKSER *SGGNMVPRP SPFLPWQATQ LDSYTFCPFV 
51 MMLLSAAEAA AQRQPKTRAV GSR VVFIGVS LMFETLLLIL RSG*KIFLPN 
101 R* 

m0i3/aoi3 97.0% identity over a 101 aa overlap 

10 20 30 40 50 60 

m013.pep MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 
I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 1 I I I t I I I I I I I I I I I I I I | | | | | 
a013 MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 

10 20 30 40 50 60 

70 80 90 100 

m013 . pep AQKQPKTRAVGSRVVFIGVSFMFETLLLILRSGXKIFLPNQX 
I I : I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I- 1 I 1 I I I : I 
a013 AQRQPKTRAVGSRVVFIGVSLMFETLLLILRSGXKIFLPNRX 

70 80 90 100 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 013 shows 73.3% identity over a 101 aa overlap with a predicted ORF (ORF 013.ng) 
from N. gonorrhoeae: 
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m013/g013 

10 20 30 40 50 60 

m013 . pep MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 
I I I I I I I I ! I I : I : : : I : I III) I I I I I I I I I I I I I M I i I I t I I I I I I I II I I I I 
g013 MPLTMLCSRTCGLFIIQSDRKSGGNAVPRPSPFLPWQAMQLDSYTFCPFVMMLLSAAEAA 

10 20 30 40 50 60 



70 80 90 100 

m013 . pep AQKQPKTRAVGSRVVFIGVS F-MFETLLLILR- SGXKI FLPNQX 

i I : I I : I i I I I I I I II I I : : : MM I HI I: 
g 013 AQRQHKMKAVGSRVVFIGVSPNVLKPCFLILPLRGEKFFWPKSGIX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 93>: 

g015 . seq 

1 ATGCAGTATC TGATTGTCAA ATACAGCCAT CAAATCTTCG TTACCATCAC 

51 CATTTTGGTA TTCAACATCC GTTTTTTCCT ACTTTGGAAA AAT C C AG AAA 

101 AGCCCTTGGT CGGCTTTTGG AAAGCACTGC CCCACCTCAA CGACACGATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAG ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGCA CAAAAATCCT GCTCCTGTTC G C CT AC AT CG 

251 CACTGGGCAT GGTAATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

301 ACCGTTTACC TGCTCGCTAT GTGTTGCATC GCCTGCATCG TTTACCTTGC 

351 CAAAACCAAA GTCCTGCCAT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 94; ORF 015.ng>: 

g015 .pep 

1 MQYLIVKYSH Q IFVTITILV FNIRFFLL WK NPEKPLVGFW KALPHLNDTM 

51 LLFTGLWLMK ITHFSPFNAP WLGT KILLLF AYIALGMVMM RA RPRSTKFY 

101 TVYLLAMCC1 ACIVYLAKTK VLPF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 95>: 

m015 . seq (partial) 

1 . . AAAATCAGAA AAGCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA 

51 CGACACCATG CTGCTGTTTA CGGGATTGTG GCTGATGAAA ATTACCCATT 

101 TCTCCCCGTT CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC 

151 GCCTATATCG CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC 

201 CAAGTTCTAC ACCGTTTACC TGCTCGCCAT GTGTTGCGTC GCCTGCATCG 

251 TTTACCTTGC CAAAACCAAA GTCCTGCCTT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 96; ORF 015: 

m015 . pep {partial ) 

1 . . KIRKALAGFW KALPHLNDTM LLFTGLWLMK ITHFSPFNAP WLGTKILLLL 
51 AY I ALGMMMM RARPRSTKFY TVYLLAMCCV ACIVYLAKTK VLPF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 97>: 

a015. seq 

1 ATGCAGTATC TGATTGTCAA ATACAGCCAT CAAATCTTCG TTACCATCAC 

51 CATTTTGGTA TTCAACATCC GTGTTTTCNT ACTTTGGAAA AAT CC AG AAA 

101 AGCCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA CGACACCATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAA ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC GCCTATATCG 

251 CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

301 ACCGTTTACC TGCTCGCCAT GTGTTGCCTC ACCTGCATCG TTTACCTTGC 

351 CAAAACCAAA GTCCTGCCTT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 98; ORF 015.a>: 

a015 . pep 
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1 MQYLIVKYSH Q IFVTITILV FNIRVFXL WK NPEKPLAGFW KALPHLNDTM 
51 LLFTGLWLMK ITHFSPFNAP WLGTK ILLLL AYIALGMMMM RA RPRSTKFY 
101 TVYLLAMCCL TCIVYLAKTK VLPF* 



moi5/aoi5 96.7% identity over a 91 aa overlap 



10 20 30 

mO 1 5 . pep KIRKALAGFWKALPHLNDTMLLFTGLWLMKITH 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
a015 LIVKYSHQIFVTITILVFNIRVFXLWKNPEKPLAGFWKALPHLNDTMLLFTGLWLMKITH 
10 20 30 40 50 60 



40 50 60 70 80 90 

m015 . pep FSPFNAPWLGTKILLLLAYIALGMMMMRARPRSTKFYTVYLLAMCCVACIVYLAKTKVLP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I : : I II I I i I I I I I I 
a015 FSPFNAPWLGTKILLLLAYIALGMMMMRARPRSTKFYTVYLLAMCCLTCIVYLAKTKVLP 

70 80 90 100 • 110 120 



m015.pep FX 
I I 

a015 FX 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 015 shows 94.5% identity over a 91 aa overlap with a predicted ORF (ORF 015.ng) 
from TV. gonorrhoeae: 

m015/g015 

10 20 30 

mOlS . pep KIRKALAGFWKALPHLNDTMLLFTGLWLMKITH 

I I : I I I I I I I I I I I I I I II I I I I I I I I I I 
g015 LIVKYSHQIFVTITILVFNIRFFLLWKNPEKPLVGFWKALPHLNDTMLLFTGLWLMKITH 
10 20 30 40 50 60 

40 50 60 70 80 90 

FSPFNAPWLGTKILLLLAYIALGMMI^ 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 h 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

FSPFNAPWLGTKILLLFAYIALGMVMMRARPRSTKFYTVYLLAMCCIACIVYLAKTKVLP 
70 80 90 100 110 120 



m015 .pep 
g015 



m015.pep FX 
II 

g015 FX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 99>: 

g018 . seq 

1 atGCAGCAGG GGCagttggt tggacgcgtc gcccgcaata AAGATATGCG 

51 GAATgctggt CTGCATggtC AGCGGATCGG CAACGGGtac gccgcgcgcg 

101 tctttgTCGA TATTGATGTT TTCCAAACCG ATATtgTCAA CGTTCGGACG 

151 GCgACCTACG GCTGCCAACA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT tctACATTGC CGTCTGCGTC GAGTTTGACC 

251 TCGGTTTTAG CATCCAGATG CAGTTTCAAT tCtTCTCCGA ACACGGCTTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 100; ORF 018.ng>: 

g018 .pep 

1 MQQGQLVGRV ARNKDMRNAG LHGQRIGNGY AARVFVDIDV FQTDIVNVRT 

51 ATYGCQHIFG NKY AFFAILL PMDFYIAVCV EFDLGFSIQM QFQFFSEHGF 
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101 RLV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 101>: 

m018 . seq 

1 ATGCAGCAGA GGCAGTTGGT TGGACGCATC GCCTGCGATG AAGATATGCG 

51 GAATACTGGT CTGCATGGTC AGCGGGTCGG CAACAGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATATT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCCACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCATC GAGTTTGACC 

251 TCGGTTTTAG CATCCAGATG CAGTTTCAAT TCTTCGCCGA ACACGGCGTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 102; ORF 018>: 

mOl8 .pep 

1 MQQRQLVGRI ACDEDMRNTG LHGQRVGNRY AARIFFDIDI FQTDIVNVRT 
51 AAHGCQHIFG NKY AFFAILL PMDFYIAVCI EFDLGFSIQM QFQFFAEHGV 
101 RLV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 103>: 

a018 . seq 

1 ATGCAGCAGG GGCAGTTGGT TGGACGCGTC GCCCGCAATA AAGATATGCG 

51 GAATACTGGT CTGCATAGTC AGCGGATCGG CAACGGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATGTT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCTACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCGTC GAGTTTGGCC 

251 TCGGTTTTAG CATCCAAATG CAGTTTCAAT TCTTCACCGA ACACGGCTTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 104; ORF 018.a>: 

a018.pep 

1 MQQGQLVGRV ARNKDMRNTG LHSQRIGNGY AARIFFDIDV FQTDIVNVRT 

51 AAYGCQHIFG NKY AFFAILL PMDFYIAVCV EFGLGFSIQM QFQFFTEHGF 

101 RLV* ~" 

moi8/aOi8 86.4% identity over a 103 aa overlap 

10 20 30 40 50 60 

m018.pep MQQRQLVGRIACDEDMRNTG LHGQRVGNRY AARI FFDIDI FQTDIVNVRTAAHGCQHI FG 

Ml 11111:1 : : II I I I I I I : I I : I I I I I I I I I I I I : I I I I I I II I I I I : I I I I I I I 
a018 MQQGQLVGRVARNKDMRNTGLHSQRIGNGYAARIFFD I DVFQTD I VNVRT AAYGCQHIFG 

10 20 30 40 50 60 

70 80 90 100 

m018 . pep NKYAFFAI LLPMDFYIAVCIE FDLGFS IQMQFQFFAEHGVRLVX 

I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I : I I I 1 I I I 
a018 NKYAFFAI LLPMDFYIAVCVEFGLGFSIQMQFQFFTEHGFRLVX 

70 80 90 100 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 018 shows 84.5% identity over a 103 aa overlap with a predicted ORF (ORF 018.ng) 
from N. gonorrhoeae: 

m018/g018 

10 20 30 40 50 60 

m018 . pep MQQRQLVGRIACDEDMRNTGLHGQRVGNRYAARIFFDIDIFQTDIVNVRTAAHGCQHIFG 
Ml I I I I I : I : : I I I I : I II I I I : I I I I I I : I I I I : I I I I I I I I I I I : : I I I I I I I 
g018 MQQGQLVGRVARNKDMRNAGLHGQRIGNGYAARVFVDIDVFQTDIVNVRTATYGCQHIFG 

10 20 30 40 50 60 



WO 99/57280 



PCT/US99/09346 



201 



70 80 90 100 

m018.pep NKYAFFAILLPMDFYIAVCIEFDLGFSIQMQFQFFAEHGVRLVX 
I I I I I I I I I I I M I I I I I I : I I I I I I I I I I I I I I I : I I I I I I I 
g018 NKYAFFAILLPMDFYIAVCVEFDLGFSIQMQFQFFSEHGFRLVX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 105>: 

g019.seq (partial) 

1 . .ctgctggcgg ccctggtgct tgccgcgtgt tcttcgACAA ACAcacTGCC 
51 AGCCGGCAAG ACCCCGGCAG ACAATATAGA AActgcCgAC CTTTCGGCAA 
101 GCGTTCCCAC ccgcCCTGCC GAACCGGAAG GAAAAACGCT GGCAGATTAC 
151 GGCGGCTACC CGTCCGCACT GGATGCAGTG AAACAGAACA ACGATGCGGC 

2 01 AGCCGCCGCC TATTTGGAAA Acgcaggaga cagCGcgatg gcGGAAAatg 
251 tccgcaagga gtgGCTGa 

This corresponds to the amino acid sequence <SEQ ID 106; ORF 019.ng>; 

g019 .pep (partial ) 

1 . . LLAALVLAAC SSTNTLPAGK TPADNIETAD LSASVPTRPA EPEGKTLADY 
51 GGYPSALDAV KQNNDAAAAA YLENAGDSAM AENVRKEWL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 107>: 

m019.seq (partial) 

1 ATGTACCTAC CCTCTATGAA GCATTCCCTG CCGCTGCTGG CGGCCCTGGT 

51 GCTTGCCGCG TGTTCTTCGA CAAACACACT GCCAGCCGGC AAGACCCCGG 

101 CAGACAATAT AGAAACTGCC GACCTTTCGG CAAGCGTTCC CACCCGCCCT 

151 GCCGAACCCG AAAGAAAAAC GCTGGCAGAT TAGGGCGGCT ACCCGTCCGC 

201 ACTGGATGCA GTGAAACAGA AAAACGATGC CGCCGTCGCC GCCTATTTGG 

251 AAAACGCCGG CGACAGCGCG ATGGCGGAAA ATGTCCGCAA CGAGTGGCTG 

301 AAGTCTTTGG GCGCACGCAG ACAGTGGACG CTGTTTGCAC AGGAATACGC 

3 51 CAAACTCGAA CCGGCAGGGC GCGCCCAAGA AGTCGAATGC TACGCCGATT 

4 01 CGAGCCGCAA CGACTATACG CGTGCCGCTG AACTGGTCAA AAATACGGGC 
4 51 AAACTGCCTT CGGGCTGCAC CAAACTGTTG GAACAGGCAG CCGCATCCGG 
501 CTTGTTGGAC GGCAACGACG CCTGGAGGCG CGTGCGCGGA CTGCTGGCCG 
551 GCCGCCAAAC CACAGACGCA CGCAACCTTG CCGCCGCATT GGGCAGCCCG 
6 01 TTTGACGGCG GTACACAAGG TTCGCGCGAA TATGCCCTGT TGAACGTCAT 
651 CGGCAAAGAA GCACGCAAAT CGCCGAATGC CGCCGCCCTG CTGTCCGAAA 
701 TGGAAAGCGG TTTAAGCCTC GAACAACGCA GTTTCGCGTG GGGCGTATTG 
751 GGGCATTATC AGTCGCAAAA CCTCAATGTG CCTGCCGCCT TGGACTATTA 
801 CGGCAAGGTT GCCGACCGCC GCCAACTGAC CGACGACCAA ATCGAGTGGT 
851 ACGCCCGCGC CGCCTTGCGC GCCCGACGTT GGGACGAGCT GGCCTCCGTT 
901 ATCTCGCATA TGCCCGAAAA ACTGCAAAAA AGCCCGACCT GGCTCTACTG 
951 GCTGGCACGC AGCCGCGCCG CAACGGGCAA CACGCAAGAG GCGGAAAAAC 

1001 TTTACAAACA GGCGGCAGCG ACGGGCAGGA ATTTTTATGC GGTGCTGGCA 

1051 GGGGAAGAAT TGGGTCGGAA AATCGATACG CGCAACAATG TGCCCGATGC 

1101 CGGCAAAAAC AGCGTCCGCC GCATGGCGGA AGACGGTGCA GTCAAACGCG 

1151 CACTGGTACT GTTCCAAAAC AGCCAATCTG CCGGTGATGC AAAAATGCGC 

12 01 CGTCAGGCTC AGGCGGAATG GCGTTTTGCC ACACGCGGCT TTGACGAAGA 
1251 CAAGCTGCTG ACCGCCGCGC AAACCGCGTT CGACCACGGT TTTTACGATA 

13 01 TGGCGGTCAA CAGCGCGGAA CGCACCGACC GCAAACTCAA CTACACCTTG 
1351 CGCTATATTT CGCCGTTTAA AGACACGGTA ATCCGCCACG CGCAAAATGT 

14 01 TAATGTCGAT CCGGCTTGGG TTTATGGGCT GATTCGTCAG GAAAGCCGCT 
14 51 TCGTTATAGG CGCGCAATCC CGCGTAGGCG CGCAGGGGCT GATGCAGGTT 
1501 ATGCCTGCCA CCGCGCGCGA AATCGCCGGC AAAATCGGTA TGGATGCCGC 
1551 ACAACTTTAC ACCGCCGACG GG. . . 

This corresponds to the amino acid sequence <SEQ ID 108; ORF 01 9>: 

m019 .pep (partial) 

1 MYLPSMKHSL PLLAALVLAA CSSTNTLPAG KTPADNIETA DLSASVPTRP 
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51 AEPERKTLAD YGGYPSALDA VKQKNDAAVA AYLENAGDSA MAENVRNEWL 

101 KSLGARRQWT LFAQEYAKLE PAGRAQEVEC YADSSRNDYT RAAELVKNTG 

151 KLPSGCTKLL EQAAASGLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGSP 

201 FDGGTQGSRE YALLNVIGKE ARKS PNAAAL LSEMESGLSL EQRSFAWGVL 

251 GHYQSQNLNV PAALDYYGKV ADRRQLTDDQ IEWYARAALR ARRWDELASV 

301 ISHMPEKXjQK SPTWLYWLAR SRAATGNTQE AEKLYKQAAA TGRNFYAVLA 

3 51 GEELGRKIDT RNNVPDAGKN SVRRMAEDGA VKRALVLFQN SQSAGDAKMR 

4 01 RQAQAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAE RTDRKLNYTL 
451 RYISPFKDTV IRHAQNVNVD PAWVYGLIRQ ESRFVIGAQS RVGAQGLMQV 
501 MPATAREIAG KIGMDAAQLY TADG . . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 109>: 

a019.seq 

1 ATGTACCCAC CCTCTCTGAA GCATTCCCTG CCGCTGCTGG TGGNCCTGGT 

51 GCTTGCCGCG TGTTCTTNGA CAAACACACT GTCAGCCGAC AAGACCCCGG 

101 CAGACAATAT AGAAACTGCC GACCTTTCGG CAAGCGTTCC CACCNGCCCT 

151 GCCGAACCCG AANGAAAAAC GTNGGCAGAT TACGGCGGCT ACCCGTCCGC 

201 ACTGGATGCA GTGAAACAGA AAAACGATGC CGCCGTCGCC GCCTATTTGG 

251 AAAACGCCGG CGACAGCGCG ATGGCGGAAA ATGTCCGCAA CGAGTGGCTG 

301 AAGTCTTTGG GCGCGCGCAG ACAGTGGACG CTGTNTGCAC ANGAATATGC 

351 NAAACTCGAA CCGGCANGGC GCGCCCAAGA AGTCGAATGC TACGCCGATT 

4 01 CGAGCCGCAA CGACTATACG CGTGCCGCCG AACTGGTCAA AAATACGGGC 

4 51 AAACTGCCTT CGGGCTGCAC CAAACTGTTG GAACAGGCAG CCGCATCCGG 

501 CTTGTTGGAC GGCAACGACG CCTGGAGGCG CGTGCGCGGA CTGCTGGCCG 

551 GCCGCCAAAC CACAGACGCA CGCAACCTTG CCGCCGCATT GGGCAGCCCG 

601 TTTGACGGCG GTACACAAGG TTCGCGCGAA TATGCCCTGT TGAACGTCAT 

651 CGGCAAAGAA GCACGCAAAT CGCCGAATGC CGCCGCCCTG CTGTCCGAAA 

701 TGGAAAGCGG TTTAAGCCTC GAACAACGCA GTTTCGCGTG GGGCGTATTG 

751 GGGCATTATC AGTCGCAAAA CCTCAATGTG CCTGCCGCCT TGGACTATTA 

801 NGGCAAGGTT GCCGACCGCC GCCAACTGAC CGACGACCAA ATCGAGTGGT 

851 ACGCCCGCGC CGCNNTNNGC NNNCGNNGTT NGNANGANNT GGCNNCCGNN 

901 ANCNCGNNNN TGCNNGANAA ACNNNNNNAN AGNCNNANNT NGNTNNANTG 

951 NNTGGCACGC AGCCGCGCCG CNACGGGCAA CACGCAANAN GCGGANAAAC 

1001 TNTACAAACA GGCGGCAGCA NCGGGCANGA ATTTTTATGC NGTGCTGNCN 

1051 GGGGAAGAGT TGGGGCGCAN AATCGATACG CGCAACAATG TGCCCGATGC 

1101 CGGCAAAANC AGCGTCCTCC GTATGGCGGA AGACGGCGCG ATTAAGCGCG 

1151 CGCTGGTGCT GTTCCGAAAC AGCCGAACCG CCGGCGATGC GAAAATGCGC 

1201 CGTCNGGCTC AGGCGGAATG GCGTTTCGCC ACACGCGGCT TCGATGAAGA 

1251 CAAGCTGCTG ACCGCCGCGC AAACCGCGTT CGACCACGGT TTTTACGATA 

1301 TGGCGGTCAA CAGCGCGGAA CGCACCGACC GCAAACTCAA CTACACCTTG 

1351 CGCTACATTT CGNNNNNTNA NGACACGGTA ATCCGCCACG CGCAAAATGT 

1401 TAATGTCGAT CCGGCGTGGG TTTACGGGCT GATTCGTCAG GAAAGCCGCT 

14 51 TCGTTATGGG CGCGCAATCC CGCGTAGGCG CGCAGGGGCT GATGCAGGTT 

1501 ATGCCTGCCA CCGCGCGCGA AATCGCCGGC AAAATCGGTA TGGATGCCGC 

1551 ACAACTTTAC ACCGCCGACG GCAATATCCG TATGGGGACG TGGTATATGG 

1601 CGGACACCAA ACGCCGCCTG CAAAACAACG AAGTCCTCGC CACCGCAGGC 

1651 TATAACGCCG GTCCCGGCAG GGCGCGCCGA TGGCAGGCGG ACACGCCCCT 

1701 CGAAGGCGCG GTATATGCCG AAACCATCCC GTTTTCCGAA ACGCGCGACT 

1751 ATGTCAAAAA AGTGATGGCC AATGCCGCCT ACTACGCCTC CCTCTTCGGC 

1801 GCGCCGCACA TCCCGCTCAA ACAGCGTATG GGCATTGTCC CCGCCCGCTG 

1851 A 

This corresponds to the amino acid sequence <SEQ ID 1 10; ORF 019.a>: 

a019.pep 

1 MYPPSLKHSL PLLVXLVLAA CSXTNT LSAD KTPADNIETA DLSASVPTXP 

51 AEPEXKTXAD YGGYPSALDA VKQKNDAAVA AYLENAGDSA MAENVRNEWL 

101 KSLGARRQWT LXAXEYAKLE PAXRAQEVEC YADSSRNDYT RAAELVKNTG 

151 KLPSGCTKLL EQAAASGLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGSP 

201 FDGGTQGSRE YALLNVIGKE ARKS PNAAAL LSEMESGLSL EQRSFAWGVL 

251 GHYQSQNLNV PAALDYXGKV ADRRQLTDDQ IEWYARAAXX XRXXXXXAXX 

301 XXXXXXKXXX XXXXXXXXAR SRAATGNTQX AXKLYKQAAA XGXNFYAVLX 

351 GEELGRXIDT RNNVPDAGKX SVLRMAEDGA IKRALVLFRN SRTAGDAKMR 

4 01 RXAQAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAE RTDRKLNYTL 
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4 51 RYISXXXDTV IRHAQNVNVD PAWVYGLIRQ ESRFVMGAQS RVGAQGLMQV 

501 MPATAREIAG KIGMDAAQLY TADGNIRMGT WYMADTKRRL QNNEVLATAG 

551 YNAG PGRARR WQADTPLEGA VYAETIPFSE TRDYVKKVMA NAAYYASLFG 

601 APHIPLKQRM GIVPAR* 



m0i9/aoi9 88.9% identity over a 524 aa overlap 



10 20 30 40 50 60 

m019.pep MYLPSMKHSLPLLAALVLAACSSTNTLPAGKTPADNIETADLSASVPTRPAEPERKTLAD 
II 11:1111111: I I It I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 
a019 MYPPSLKHSLPLLVXLVLAACSXTNTLSADKTPADNIETADLSASVPTXPAEPEXKTXAD 

10 20 30 40 50 60 

70 80 90 100 110 120 

mOl 9 . pep YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLFAQEYAKLE 

I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
a019 YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLXAXEYAKLE 

70 80 90 100 110 120 

130 140 150 160 170 180 

m019 . pep PAGRAQEVECYADSSRNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a019 PAXRAQEVECYADSSRNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m019.pep LLAGRQTTDARNLAAALGSPFDGGTQGSREYALLNVIGKEARKSPNAAALLSEMESGLSL 
I I I I I I M I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
a019 LLAGRQTTDARNLAAALGSPFDGGTQGSREYALLNVIGKEARKSPNAAALLSEMESGLSL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m019 . pep EQRSFAWGVLGHYQSQNLNVPAALDYYGKVADRRQLTDDQIEWYARAALRARRWDELASV 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a019 EQRSFAWGVLGHYQSQNLNVPAALDYXGKVADRRQLTDDQIEWYARAAXXXRXXXXXAXX 

250 260 270 280 290 300 

310 320 330 340 350 360 

m019 . pep ISHMPEKLQKSPTWLYWLARSRAATGNTQEAEKLYKQAAATGRNFYAVLAGEELGRKIDT 
i : II I I I I I I I I I I I I I If II I : I MINI MINI III 

a019 XXXXXXKXXXXXXXXXXXARSRAATGNTQXAXKLYKQAAAXGXNFYAVLXGEELGRXIDT 

310 320 330 340 350 360 

370 380 390 400 410 420 

m019.pep RNNVPDAGKNSVRRMAEDGAVKRALVLFQNSQSAGDAKMRRQAQAEWRFATRGFDEDKLL 
I I I I I I I I I II I I II I I I : I I I I I I I : I I : : I I I I I I I I I I I I I I I I I I I I I II I I I 
a0l9 RNNVPDAGKXSVLRMAEDGAIKRALVLFRNSRTAGDAKMRRXAQAEWRFATRGFDEDKLL 

370 380 390 400 410 420 

430 440 450 460 470 480 

m019.pep TAAQTAFDHGFYDMAVNSAERTDRKLNYTLRYISPFKDTVIRHAQNVNVDPAWVYGLIRQ 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
a019 TAAQT AFDHG FY DMAVN S AERT DRKLN YT LRY I S XXX DT V I RHAQN VN V D PAWV YG L I RQ 

430 440 450 460 470 480 

490 500 510 520 

m019 . pep ESRFVIGAQSRVGAQGLMQVMPATAREIAGKIGMDAAQLYTADG 
I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a019 ESRFVMGAQSRVGAQGLMQVMPATAREIAGKIGMDAAQLYTADGNIRMGTWYMADTKRRL 

490 500 510 520 530 540 



a019 



QNNEVLATAG YNAG PGRARRWQADT PLEGAVYAET I P FSETRDYVKKVMAN AAY YAS L FG 
550 560 570 580 590 600 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 019 shows 95.5% identity over a 89 aa overlap with a predicted ORF (ORF 019.ng) 
from N. gonorrhoeae: 

10 20 30 40 49 

LLAALVLAACSSTNTLPAGKTPADNIETADLSASVPTRPAEPEGKTLAD 

1 1 II : II I! 1 1 1 1 1 M I ! 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 M 1 1 I 

MYLPSMKHSLPLLAALVLAACSSTNTLPAGKTPADNIETADLSASVPTRPAEPERKTLAD 
10 20 30 40 50 60 

50 60 70 80. 89 

YGGYPSALDAVKQNNDAAAAAYLENAGDSAMAENVRKEWL 

I I I I I I I I I I I I I : I I I I : I I I I I I I I I I II I I I I I = I I I 

YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLFAQEYAKLE 
70 80 90 100 110 120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 1 1>: 

g023 . seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 AATGCAGCGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCCCTG CCTAAAGAAT ATCCGGCATG GCAGGCATTT 

151 TTTAGTCAAG CTTGGGTAAA AGTATTTACC CAAGTGAGCT TTATCGCCGT 

201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 

251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT TGtCtGGCTG 

3 01 GTCGGCTGCC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 1 12; ORF 023. ng>: 

g023 -pep 

1 MVERKLTGAH YGLRDWVMQR ATAVIMLIYT VALLWLFAL P KEYPAWQAF 

51 FSQAWVKVFT QVSFIAVFLH AWVGIRDLWM DYIKPFGV RL FLQVATIVWL 

101 VGCLVY SVKV IWG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 13>: 

m023.seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 GATGCAACGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTTCCCTG CCTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAGTCAAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 

201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 

251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTTTGGCTG 

3 01 GTCGGCTGTC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 114; ORF 023>: 

m023 .pep 

1 MVERKLTGAH YGLRDWVMQR ATAV I MLIYT VALLWLFSL P KEYSAWQAF 

51 FSQTWVKVFT QVSFIAVFLH AWVGIRDLWM DYIKPFGVR L FLQVATIVWL 

101 VGCLVY SVKV IWG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 15>: 

a023.seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GGGATTGGGC 

51 GATGCAACGT GCGACCGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCTCTG CCTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAGTCAAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 



g019/m019 

g019 -pep 
m019 

g019.pep 
m019 
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201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATNA 
251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTCTGGCTG 
301 GTCGGCTGCT TGGTGTATTC AATTAAAGTA ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 1 16; ORF 023. a>: 

a023.pep 

1 MVERKLTGAH YGLRDWAMQR ATA V IMLIYT VALLVVLFAL P KEYSAWQAF 

51 FSOTWVKV FT QVSFIAVFLH AWVGI RDLWM DYXKPFGVR L FLQVATIVWL 

101 VGCLVYSIKV IWG* 

m023/a023 96.5% identity over a 1 13 aa overlap 

10 20 30 40 50 60 

MVERKLTGAHYGLRDWVMQRATAVIMLIYTVALLVVLFSLPKEYSAWQAFFSQTWVKVFT 

| | | | | | | | | | | I I I I I: I I ill I I I I I M I I I I I I I I I : I I I I I i I M I I I I I I I I I I I I 
MVERKLTGAHYGLRDWAMQRATAVIMLIYTVALLVVLFALPKEYSAWQAFFSQTWVKVFT 
10 20 30 40 50 60 

70 80 90 100 110 

QVSFIAVFLHAWVGIRDLWMDYIKPFGVRLFLQVATIVWLVGCLVYSVKVIWGX 

I | | I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 1 I I I I I I I I I I I : I ! I I I I 
QVSFIAVFLHAWVGIRDLWMDYXKPFGVRLFLQVATIVWLVGCLVYSIKVIWGX 
70 80 90 100 110 



n\023 .pep 
a023 

m023 .pep 
a023 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 023 shows 97.3% identity over a 1 13 aa overlap with a predicted ORF (ORF 023 .ng) 
from N. gonorrhoeae: 

g023/m023 

10 20 30 40 50 60 

q023 pep MVERKLTGAHYGLRDVn^QRATAVIMLIYWALLWLFALPKEYPAWQAFFSQAWVKVFT 

I M I ill II MMMII II Ml ill M I IIMMIMMMI I I II 1 1 1 1 h 1 1 1 1 1 1 

m023 MVERKLTGAHYGLRDWVMORATAVIMLIYTVALLWLFSLPKEYSAWQAFFSQTWVKVFT 

10 20 30 40 50 60 



70 80 90 100 110 

q023 pep QVSFIAVFLHAWVGIRDLWMDYIKPFGVRLFLQVATIVWLVGCLVYSVKVIWGX 

^ 1 1 I Ml Mill II II! Ml II MINIMI Ml MUNI III II II MINIMI 

m0 23 QVS F I AVFLHAWVG I RDLWMDYI KP FGVRLFLQVAT I VWLVGCLV YS VKVI WGX 

70 80 90 100 110 



The following partial DNA sequence was 


g025 . seq 






1 


ATGTTGAAAC 


AAAcgACACT 


51 


GTTGGGCGGT 


TGcgCCACCC 


101 


CAGGTATGCA 


GACCGTATCG 


151 


ACGCCGTACA 


ATGCCGCTCC 


201 


CGTGCAAact 


gcgccggttT 


251 


CACCTGCCGT 


TTCGGgtaca 


301 


aacgCGGCGa 


cgCataCTAT 


351 


caaAcgCtac 


CATATCTCTC 


401 


CCGACAATAC 


GTTGAGCATC 


451 


TATGCCGCAC 


CGAAAACCGC 


501 


TGCCGCGCAA 


ACCCCTGTGA 


551 


CGCCGCAACC 


TGCCGCGCCC 


601 


GCGCCCGCCC 


CGCAATCTCC 


651 


CGGCGGCATT 


GTTTGGCAGC 



identified in N. gonorrhoeae <SEQ ID 1 17>: 

TTTGGCAGCT TGTACCGCCG TTGCCGCTCT 
AACAGCCTGC TccTGTCATT GCAGGCAATT 
TCTGCGCCGG TTTACAATCC TTATGGCGCA 
TGCCGCCAac gatgcGCCgT ATGTGCCGCC 
ATTCGCCTCC TGCTTATGTT CCGCcgtCTG 
tatgtTCCTT CTTACGCACC CgtcgACATC 
TGTGCGTGGC GACACgGtgt acaACATTTc 
AAGACGATTT CCGTGCGTGG AACGGCATGA 
GGTCAGATTG TTAAAGTCAA ACCGGCaggA 
AGCCGTAGAA AGCAGGCCCG CCGTACCGGC 
AACCCGCCGC gcaACCGCCC GTTCAGTCCG 
GCTGCGGAAA ATAAAGCGGT TCCCGCCCCC 
TGCCGCTTCG CCTTCCGGCA CGCGTTCGGT 
GTCCGACCCA AGGTAAAGTG GTTGCCGATT 
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701 TCGGCGGCGG CAACAAGGGT GTCGATATTG CCGGCAATGC CGGACAACCC 

751 GTTTTGGCGG CGGCTGACGG CAAAGTGGTT TATGCCGGTT CAGGTTTGAG 

801 GGGATACGGA AACTTGGTCA TCATCCAGCA CAATTCCTCT TTCCTGACCG 

851 CGTACGGGCA CAACCAAAAA TTGCTGGTCG GCGAAGGTCA GCAGGTCAAA 

9 01 CGCGGTCAGC AGGTTGCTTT GATGGGTAAT ACCGATGCTT CCAGAACGCA 

951 GCTTCATTTC GAGGTGCGTC AAAACGGCAA ACCGGTTAAC CCGAACAGCT 

1001 ATATCGCGTT CTGA 

This corresponds to the amino acid sequence <SEQ ID 1 18; ORF 025.ng>: 

g025.pep 

1 MLKQTTLLAA CTAVAALLGG CATQQPAPVI AGNSGMQTVS SAPVYNPYGA 

51 TPYNAAPAAN DAPYVPPVQT APVYSPPAYV PPSAPAVSGT YVPSYAPVDI 

101 NAATHTIVRG DTVYNISKRY HISQDDFRAW NGMTDNTLS I GQIVKVKPAG 

151 YAAPKTAAVE SRPAVPAAAQ TPVKPAAQPP VQSAPQPAAP AAENKAVPAP 

201 APAPQSPAAS PSGTRSVGGI VWQRPTQGKV VADFGGGNKG VDIAGNAGQP 

251 VLAAADGKW YAGSGLRGYG NLVIIQHNSS FLTAYGHNQK LLVGEGQQVK' 

3 01 RGQQVALMGN TDASRTQLHF EVRQNGKPVN PNSYIAF* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 1 19>: 

m025 . seq (partial) 

1 . . GTGCCGCCGG TGCAAAGCGC GCCGGTTTAT ACGCCTCCTG CTTATGTTCC 

51 GCCGTCTGCA CCTGCCGTTT CGGGTACATA CGTTCCTTCT TACGCACCCG 

101 TCGACATCAA CGCGGCGACG CATACTATTG TGCGCGGCGA CACGGTGTAC 

151 AACATTTCCA AACGCTACCA TATCTCTCAA GACGATTTCC GTGCGTGGAA 

201 CGGCATGACC GACAATACGT TGAGCATCGG TCAGATTGTT AAAGTCAAAC 

251 CGGCAGGATA TGCCGCACCG AAAGCCGCAG CCGTAAAAAG CAGGCCCGCC 

3 01 GTACCGGCTG CCGCGCAACC GCCCGTACAG TCCGCACCCG TCGACATTAA 

3 51 CGCGGCGACG CATACTATTG TGCGCGGCGA CACGGTGTAC AACATTTCCA 

4 01 AACGCTACCA TATCTCTCAA GACGATTTCC GTGCGTGGAA CGGCATGACC 
451 GACAATATGT TGAGCATCGG TCAGATTGTT AAAGTCAAAC CGGCAGGATA 

5 01 TGCCGCACCG AAAACCGCAG CCGTAGAAAG CAGGCCCGCC GTACCGGCTG 
551 CCGTGCAAAC CCCTGTGAAA CCCGCCGCGC AACCGCCTGT GCAGTCCGCG 
601 CCGCAACCTG CCGCGCCCGC TGCGGAAAAT AAAGCGGTTC CCGCGCCCGC 
651 CCCGCAATCT CCTGCCGCTT CGCCTTCCGG CACGCGTTCG GTCGGCGGCA 
701 TTGTTTGGCA GCGTCCGACG CAAGGTAAAG TGGTTGCCGA TTTCGGCGGC 
751 AACAACAAGG GTGTCGATAT TGCCGGTAAT GCGGGACAGC CCGTTTTGGC 
801 GGCGGCTGAC GGCAAAGTGG TTTATGCCGG TTCAGGTTTG AGGGGATACG 
851 GAAACTTGGT CATCATCCAG CATAATTCTT CTTTCCTGAC CGCATACGGG 
901 CACAACCAAA AATTGCTGGT CGGCGAGGGG CAGCAGGTCA AACGCGGTCA 
951 GCAGGTTGCT TTGATGGGCA ATACCGATGC TTCCAGAACG CAGCTTCATT 

1001 TCGAGGTGCG TCAAAACGGC AAACCGGTTA ACCCGAACAG CTATATCGCG 

1051 TTCTGA 

This corresponds to the amino acid sequence <SEQ ID 1 10; ORF 025>: 

m025.pep (partial) 

1 . .VPPVQSAPVY TPPAYVPPSA PAVSGTYVPS YAPVDINAAT HTIVRGDTVY 

51 NISKRYHISQ DDFRAWNGMT DNTLSIGQIV KVKPAGYAAP KAAAVKSRPA 

101 VPAAAQPPVQ SAPVDINAAT HTIVRGDTVY NISKRYHISQ DDFRAWNGMT 

151 DNMLSIGQIV KVKPAGYAAP KTAAVESRPA VPAAVQTPVK PAAQPPVQSA 

201 PQPAAPAAEN KAVPAPAPQS PAASPSGTRS VGGIVWQRPT QGKWADFGG 

251 NNKGVDIAGN AGQPVLAAAD GKWYAGSGL RGYGNLVIIQ HNSSFLTAYG 

3 01 HNQKLLVGEG QQVKRGQQVA LMGNTDASRT QLHFEVRQNG KPVNPNSYIA 

351 F* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 1>: 

a025.seq 

1 ATGTTGACAC CAACAACACT TTAGGTAGCT TGTACCGCCC TTGCCGCTCA 

51 GTTGGGCGGA TGCCCCACCC AACACCCTTC TCCTGTCATT GCAGGCAATT 

101 CAGGTATGCA GACCGTACCG TCTGCGCCGG TTTACAATCC TTATGGCGCA 
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151 


ACGCCGTACA 


201 


GGTGCAAAGC 


251 


CACCTGCCGT 


301 


AACGCGGCGA 


351 


CAAATGCTAC 


401 


CCGACAATAC 


451 


TATGCCGCAC 


501 


TGCCGCGCAA 


551 


CGCATACTAT 


601 


CATATCTCTC 


651 


GTTGAGCATC 


701 


CGAAAGCCGC 


751 


ACCCCTGTGA 


801 


TGCCGCGCCC 


851 


CTCCTGCCGC 


901 


CAGCGTCCGA 


951 


GGGTGTCGAT 


1001 


ACGGCAAAGT 


1051 


GTCATCATCC 


1101 


AAAATTGCTG 


1151 


CTTTGATGGG 


1201 


CGGCAAAACG 



ATGCCGCTCC 
GCGCCGGTTT 
TTCGGGTACA 
CCCATACTAT 
CATATCTCTC 
GTTGAGCATC 
CGAAAGCCGC 
CCGCTCGTAC 
TGTGCGCGGC 
AAGACGATTT 
GGTCAGATTG 
AGCCGTAAAA 
AACCCGCCGC 
GCTGCGGAAA 
TTCGCCTTCC 
CGCAAGGTAA 
ATTGCAGGAA 
GGTTTATGCA 
AGCATAATTC 
GTCGGCGAAG 
CAATACCGAG 
GCAAACCGGT 



TGCCGCCAAC 
ATANGCCTCC 
TACGTTCCTT 
TGTGCGCGGC 
AAGACGATTT 
GGTCAGATTG 
AGCCGTAAAA 
AGTCCGCACC 
GACACGGTGT 
CCGTGCGTGG 
TTAAAGTCAA 
AGCAGGCCCG 
GCAACCGCCT 
ATAAAGCGGT 
GGCACGCGTT 
AGTGGTTGCC 
ATGCGGGACA 
GGTTCCGGTT 
TTCCTTCCTG 
GCCAGCAGGT 
GCTTCTAGAA 
TAATCCGAAC 



GATGCGCCGT 
TGCTTATGTT 
CTTACGCANC 
GACACCGTGT 
CCGTGCGTGG 
TTAAAGTCAA 
AGCAGGCCCG 
CGTCGACATC 
ACAACATTTC 
AACGGCATGA 
ACCGGCAGGA 
CCGTACCGGC 
GTGCAGTCCG 
TCCCGCGCCC 
CGGTCGGCGG 
GATTTCGGCG 
GCCCGTTTTG 
TGAGGGGATA 
ACCGCATACG 
CAAACGCGGG 
CGCAGCTTCA 
AGCTATATCG 



This corresponds to the amino acid sequence <SEQ ID 1 12; ORF 

a025.pep 

1 MLTPTTL*VA CTALAAQLG G CPTQHPSPVI AGNSGMQTVP 

51 TPYNAAPAAN DAPYVPPVQS APVYXPPAYV PPSAPAVSGT 

101 NAATHTIVRG DTVYKISKCY HISQDDFRAW NGMTDNTLSI 

151 YAAPKAAAVK SRPAVPAAAQ PLVQSAPVDI NAATHTIVRG 

201 HISQDDFRAW NGMTDNTLSI GQIVKVKPAG YAAPKAAAVK 

251 TPVKPAAQPP VQSAPQPAAP AAENKAVPAP APQSPAASPS 

301 QRPTQGKWA DFGGNNKGVD IAGNAGQPVL AAADGKVVYA 

351 VIIQHNSSFL TAYGHNQKLL VGEGQQVKRG QQVALMGNTE 

4 01 RQNGKPVNPN SYIAF* 



ATGTGCCGCC 
CCGCCGTCTG 
CGTCGACATC 
ACAAGATTTC 
AACGGCATGA 
ACCGGCAGGA 
CCGTACCGGC 
AACGCGGCGA 
CAAACGCTAC 
CCGACAATAC 
TATGCCGCAC 
TGCCGTGCAA 
CGCCGCAACC 
GCCCCGCAAT 
CATTGTTTGG 
GCAACAACAA 
GCGGCGGCTG 
CGGCAATTTG 
GGCACAACCA 
CAGCAGGTCG 
TTTCGAGGTG 
CGTTCTGA 

025.a>: 

SAPVYNPYGA 
YVPSYAXVDI 
GQIVKVKPAG 
DTVYNISKRY 
SRPAVPAAVQ 
GTRSVGGIVW 
GSGLRGYGNL 
ASRTQLHFEV 



m025/a025 97.4% identity over a 351 aa overlap 

10 20 30 

m025 pep VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 
" I I I I 1 I I I I i : I I I I I I I I I I I I I I I II I I 

a02 5 GMQTVPSAPVYNPYGATPYNAAPAANDAPYVPPVQSAPVYXPPAYVPPSAPAVSGTYVPS 
40 50 60 70 80 90 

40 50 60 70 80 90 

m025 pep YAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 
|| | | | | | | | I I I | I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
a025 YAXVDINAATHTIVRGDTVYKISKCYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 
100 110 120 130 140 150 

100 110 120 130 140 150 

m025 pep KAAAVKSRPAVPAAAQPPVQSAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMT 

| | | | 1 | I 1 I I I ! I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

a 025 KAAAVKSRPAVPAAAQPLVQSAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMT 
160 170 180 190 200 210 

160 170 180 190 200 210 

m025 pep DNMLSIGQIVKVKPAGYAAPKTAAVESRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 

II I I I I I I II I I I I I I I I I I : I I I : I I M I I I M I I I I I I I I I I I I I I I I I II I 

a 025 DNTLSIGQIVKVKPAGYAAPKAAAVKSRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 
220 230 240 250 260 270 

220 230 240 250 260 270 

m025 . pep KAVPAPAPQSPAASPSGTRSVGGIVWQRPTQGKWADFGGNNKGVDIAGNAGQPVLAAAD 
I I | | | | | I I I I I I II I II I I I I II I I I I I I I I II I I I II 1 I I I I I I I I II I I I I II I I I I 
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a0 25 KAVPAPAPQSPAASPSGTRSVGGIVWQRPTQGKVVADFGGNNKGVDIAGNAGQPVLAAAD 
280 290 300 310 320 330 

280 290 300 310 320 330 

m025 pep GKVVYAGSGLRGYGNLVIIQHNSSFLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDASRT 
| M 1 | | | | | | | I I 1 I I I I I II I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I : I I I I 
a0 25 GKVVYAGSGLRGYGNLVIIQHNSSFLTAYGHNQKLLVGEGQQVKRGQQVALMGNTEASRT 
340 350 360 370 380 390 

340 350 
m025.pep QLHFEVRQNGKPVNPNSYIAFX 
I I I I I I I I I I I I I I I I I I I I I I 
a025 QLHFEVRQNGKPVNPNSYIAFX 
400 410 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORP from N. gonorrhoeae 

ORF 025 shows 75.6% identity over a 353 aa overlap with a predicted ORF (ORF 025.ng) 
from N. gonorrhoeae: 

m025/g025 



m025 .pep 
g025 



10 20 30 

VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 

M Ml : I I i I :| I I I I I I I I II ! I I I M I I 
GMQTVSSAPVYNPYGATPYNAAPAANDAPYVPPVQTAPVYSPPAYVPPSAPAVSGTYVPS 
40 50 60 70 80 90 



40 50 60 70 80 90 

m02 5 pep YAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

g02 5 YAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 
100 110 120 130 140 150 



m025 .pep 
g025 



100 110 120 130 140 150 

KAAAVKS R P AV P AAAQ P PVQS AP VD I N AATHT I VRGDTV YN I S KR YH I S QDD FRAWNGMT 



m025 .pep 
g025 



160 170 180 190 200 210 

DNMLSIGQIVKVKPAGYAAPKTAAVESRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 

II II III M I II h I II I II 1 1 1 ! 1 1 1 II 1 1 1 1 1 1 Ml I 

TAAVES RPAV P AAAQT P VKP AAQ P P VQS APQ P AAP AAEN 

160 170 180 190 



220 230 240 250 260 

m025 pep KAVPAPAP- -QSPAASPSGTRSVGGIVWQRPTQGKWADFGGNNKGVDIAGNAGQPVLAA 

M I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I h I I I I I I I I I I I I I I I I I 
g 025 KAVPAPAPAPQSPAASPSGTRSVGGIVWQRPTQGKWADFGGGNKGVDIAGNAGQPVLAA 

200 210 220 230 240 250 



270 280 290 300 310 320 

m0 2 5 pep ADGKWYAGSGLRGYGNLVI I QHNS S FLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDAS 

| | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I II II I II 
g 025 ADGKWYAGSGLRGYGNLVI I QHNSSFLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDAS 

260 270 280 290 300 310 



m025 .pep 



330 340 350 

RTQLHFEVRQNGKPVNPNSYIAFX 
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RTQLH FE VRQNG KP VN PNS Y I AFX 
320 330 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 13>: 

g031 . seq 

1 ATGGTGTCCC TCCGCTTCAG ATTCGGCAAC CACTTTAAAC GCCGACATTC 

51 TGACAATTTC CTTTTCCGCC AGCCAAATAT CATGCGTATC TTTCGGTTCG 

101 GGCTTGTTGG GCATGGCAAC CTTCAACAGC CGCGCCATCA CAGGAATCGT 

151 CGTTCCCTGA ATCAGCAGCG ACAGCACCAC CACGGCAAAC GCCACATCAA 

201 ACAGCAGGTG CGAATTGGGA ACGCCCATCA CCAGCGGCAT CATCGCCAGC 

251 GAAATCGGTA CGGCTCCTCG CAAGCCCAAC CAACTGATAT ACGCCTTTTC 

301 ACGCAGGCTG TAATTGAATT TCCACAAACC GCCGAACACT GCCAGCGGAC 

351 GCGCGACCAG CATCAGGAAC GCCGCAATCG CCAAGGCTTC CGCCGCCCTG 

4 01 TCCAACACGC CGGCGGGAGA AACCAGCAGA CCGAGCATGA CGAACAAAGT 

4 51 TGCCTGCGCC AGCCAAGCCA AACCGTCCAT CACACGCAAA ACGTGTTCCG 

501 TcgcACGGTT GCGCTGGTTA CCGACAATGA TGCCGGCAAG GTAAACCGCC 

551 AAAAAGCCGC TGCCGCCTAT GGTATTGGTA AACGCAAACA CAAGCAGCCC 

601 GCCCGACACA ATCATCAGCG CGTACAGACC TTCCGtacac acctccaatt 

651 cccaatcaac gtcatagctg tctcccgtgt taaaatgttc ttcacttcag 

701 aatccccccc ttcttcccag cccgaaacct tcatgtgtta naccctgggg 

751 tgccccaacg gatttagtaa cctcccaatg actctgcttg tcgccccctt 

801 cgcccgcttt ctccttccgg gaaaacttgt tgtccccgtc ttacattaa 

This corresponds to the amino acid sequence <SEQ ID 1 14; ORF 031.ng>: 

g031 .pep 

1 MVSLRFRFGN HFKRRHSDNF LFRQPNIMRI FRFGLVGHGN LQQPRHHRNR 

51 RSLNQQRQHH HGKRHIKQQV RIGNAHHQRH HRQRNRYGSS QAQPTDIRLF 

101 TQAVIEFPQT AEHCQRTRDQ HQERRNRQGF RRPVQHAGGR NQQTEHDEQS 

151 CLRQPSQTVH HTQNVFRRTV ALVTDNDAGK VNRQKAAAAY GIGKRKHKQP 

201 ' ARHNHQRVQT FRTHLQFPIN VIAVSRVKMF FTSESPPSSQ PETFMCXTLG 

251 CPNGFSNLPM TLLVAPFARF LLPGKLV VPV LH* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 15>: 

m031 . seq (partial) 

1 ... CGCCTGAAGC ACGGTGTCGG ACTGCATTTC TATTCGGCTA TACGCCTTTT 
51 CACGCAGGCT GTAATTGAAT TTCCACAAAC CGCCGAACAC TGCCGACGGA 

101 CGCGCGACCA GCATCAGGAA CGCCGCAATC GCCAAgGCTT CCGCCGCCCT 

151 GTCCAACACG TTGGCAGGAG AAACCAGCAG CAAAGGCATT CCCAAACGTG 

201 CGGACAAAGT GGTCGAAACC ACGCTCAGAA ACAACAGTGC GCCACCCGGC 

251 AG 

This corresponds to the amino acid sequence <SEQ ID 1 16; ORF 03 1>: 

m031.pep (partial) 

1 . . . RLKHGVGLHF YSAIRLFTQA VIEFPQTAEH CRRTRDQHQE RRNRQGFRRP 
51 VQHVGRRNQQ QRHSQTCGQS GRNHAQKQQC ATRQ. . . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 17>: 

a031 . seq 

1 ATACGCCTTT TCACGCAGGC TGTAATTGAA TTTCCACAAA CCGCCGAACA 

51 CTGCCGGCGG ACGCGCGACC AGCATCAGGA ACGCCGCAAT CGCCAAGGCT 

101 TCCGCCGCCC CGTCCAACAC GTTGGCAGGA GAAACCAGCA GCAAAGGCAT 

151 TCCCAAACGT GCGGACAAAG TGGTCGAAAC CACGCTCAGA AACAACAGTG 

201 CGCCACCCGG CAG 

This corresponds to the amino acid sequence <SEQ ID 1 18; ORF 031.a>: 

a031.pep (partial) 

1 IRLFTQAVIE FPQTAEHCRR TRDQHQERRN RQGFRRPVQH VGRRNQQQRH 

51 SQTCGQSGRN HAQKQQCATR Q 
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m03i/a03i 100.0% identity over a 71 aa overlap 

10 20 30 40 50 60 

m031 . pep RLKHGVGLHFYSAIRLFTQAVIEFPQTAEHCRRTRDQHQERRNRQGFRRPVQHVGRRNQQ 

II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
a031 IRLFTQAV IE FPQT AEHCRRTRDQHQERRNRQG FRRPVQHVGRRNQQ 

v 10 20 30 4 0 

70 80 
m031 . pep QRHSQTCGQSGRNHAQKQQCATRQ 
I I I I I I I I I I I I I I II I I I II I I I 
a 031 QRHSQTCGQSGRNHAQKQQCATRQ 
50 60 70 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORP from TV. gonorrhoeae 

ORF 031 shows 60.0% identity over a 85 aa overlap with a predicted ORF (ORP 031.ng) 
from N. gonorrhoeae: 

m031/g031 

10 20 30 

m031 . pep RLKHGVGLH FYS AI RLFTQAVI E FPQTAEH 

I : : I = : I I I I I I I I II II I I I II 
g031 NQQRQHHHGKRHIKQQVRIGNAHHQRHHRQRNRYGSSQAQPTDIRLFTQAVIE FPQTAEH 

60 70 80 90 100 110 

40 50 60 70 80 

m031 . pep CRRTRDQHQERRNRQGFRRPVQHVGRRNQQQRHS-QTCGQSGRNHAQKQQCATRQ 
I : I I I I I I I I I I I I I I II I I I I I : I I I I I : I : I : I : : ::::[:): 
g031 CQRTRDQHQERRNRQGFRRPVQHAGGRNQQTEHDEQSCLRQPSQTVHHTQNVFRRTVALV 
120 130 140 150 160 170 



g031 



TDNDAGKVNRQKAAAAYGIGKRKHKQPARHNHQRVQT FRTHLQFPINVIAVSRVKMFFTS 
180 190 200 210 220 230 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 19>: 

g032 . seq 

1 ATGCGGCGAA ACGTGCCTGC CGTCGCCGTA TTGCGCCGCC CACGATTCGA 

51 GGCGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAAGGCTTCC ACGCTTTTGC CGGTCAGCGG AACCTGACGC TGCTTGCGCC 

201 CTTTGCCGGT AACGTGTACC CACGCTTCGT CCAAATATAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGCTC 

301 GAACAGCGCG TGGTCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 AATCCAACAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

401 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG CGCGCATCAG 

4 51 CCCGCGTTTG ACCAGCCAGG CGCAATACTG CCGCCACGCC GACAGCTTGC 

501 GCGCCAGCGT CCGACCGTCC AAACCGCGCT GCGACAGCCG CCGCAACGCC 

551 GccgTAAAAT CGCGCCGCGA CAAGTCCTGC GGCACGCcgc ctgcaTCTTC 

601 AGACGGCATT TGTGCCAACA GTGCAAACAG TTCTTCCAAA TCGCGCCGGT 

651 ATGCCGCAAC CGTGTGCTCC GACTTGCCCT CGCGCACGAT GTTTTCCAAA 

701 TAAGCGTCAA AATacgccgC AAACccgTCC AAAACCATAA CCGTCCCACA 

7 51 CAAATATCAA AAAACCAGTG A 

This corresponds to the amino acid sequence <SEQ ID 110; ORF 032.ng>: 

g032.pep 

1 MRRNVPAVAV LRRPRFEAFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFAGQR NLTLLAPFAG NVYPR FVQIY IICIQAVYLA HA QTAAVHQL 
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101 EQRVVAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGAHQ 
151 PAFDQPGAIL PPRRQLARQR PTVQTALRQP PQRRRKIAPR QVLRHAACIF 
201 RRHLCQQCKQ FFQIAPVCRN RVLRLALAHD VFQISVKIRR KPVQNHNRPT 
251 QISKNQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 1>: 

ACGTGCmTGC mGTCGCCGTT kTGCGCCGCC CATTGCGCCA 
GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 
TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 



m032 . seq 


(partial) 


1 


ATGCGGCGAA 


51 


AACGTTTTTG 


101 


AACAGGGCTT 


151 


CAGGGCTTCC 


201 


CTTTGCCGAT 


251 


TTCAAGCCGT 


301 


GAACAGGGCG 


351 


AATCCAGCAT 


401 


TCGGGCTGCT 


451 


GCCGCGCTTT 


501 


GAGCCAGCGT 



TGGTCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 
CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 
TCGGCGGTTT GATGTCGGCG GTCGGGTCGG CGTGCATCAG 
ACCAGCCAAA CGCAATACTG CCGCCAAGAC GAAAGCTTGC 
CCGTTCCCCC AAACCGCG . . . 

This corresponds to the amino acid sequence <SEQ ID 1 12; ORF 032>: 

m032 .pep (partial) 

1 MRRNVXAVAV XRRPLRQTFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFADQR HLPLXAPFAD NVYPR XVQID IICIQAVYLA HA QTAAVHQF 

101 EQGVVAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGVHQ 

151 AALYQPNAIL PPRRKLASQR PFPQTA. . . 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 1 13>: 

a032.seq 

1 ATGCGGCGAA ACGTGCCTGC CGTCGCCGTT TTGCGCCGCC CATTGCGCCA 

51 AACGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAGGGCTTCC ACGCTTTTGC CGGTCAGCGG AACCTGCCGC TGCTTGCGTC 

201 CTTTGCCGGT AACGTGTACC CACGCCTCGT CCAAATATAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGTTC 

301 GAACAGCGCG TGATCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 AATCCAGCAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

4 01 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG TATGCAGCAG 

4 51 ACCGCGTTTG ACCAGCCAGG CGCAATACTG CCGCCAAGAC GACAGCTTGC 

501 GCGCCAGCGT CCGCGCATTC AAACCGCGCT GCGACAGCCG CCGCAACGCC 

551 GCCGTAAAAT CGCGCTGCGA CAAGCCCTGC GGCACGCCGC CTGCATCTTC 

601 AGACGGCATT TGTGCCAACA GCGCAAACAG TTCTTCCAAA TCGCGCCGGT 

651 ATGCCGCCAC CGTGTGCTCC GACTTGCCCT CGCGCACGAT GTTTTCCAAA 

701 TAAGCGTCAA AATGCGCCGC AAACCCGTCC AAAACCATAA CCGCCCCACA 

751 CAAATATCAA AAAAACAGTG A 

This corresponds to the amino acid sequence <SEQ ID 114; ORF 032.a>: 

a032.pep 

1 MRRNVPAVAV LRRPLRQTFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QG FHAFAGQR NLPLLASFAG NVYPR LVQIY IICIQAVYLA HA QTAAVHQF 

101 EQRVIAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGMQQ 

151 TAFDQPGAIL PPRRQLARQR PRIQTALRQP PQRRRKIALR QALRHAACIF 

201 RRHLCQQRKQ FFQIAPVCRH RVLRLALAHD VFQISVKMRR KPVQNHNRPT 

251 QISKKQ* 

m032/a032 88.1% identity over a 176 aa overlap 

10 20 30 40 50 60 

m032 . pep MRRN VXAVAVXRR PLRQT FL DLALAQARAV PAGKQG FAVRC RLTQRQ I V FQG FH AFADQR 

Mil! I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II 
a032 MRRNVPAVAVLRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQG FHAFAGQR 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m032 . pep HLPLXAPFADNVYPRXVQIDIICIQAVYLAHAQTAAVHQFEQGWAHRQRVAAVHGQIQH 
: I I I I I I Mill Mi II I I M M II M M M II II I I I : I II II I M M I I M I 
a032 NLPLLASFAGNVYPRLVQIYIICIQAVYLAHAQTAAVHQFEQRVIAHRQRVAAVHGQIQH 

70 80 90 100 110 120 

130 140 150 160 170 

m032 . pep PVQPFLRQG FG YALGLLRRFDVGGRVGVHQAAL YQPN AI L P PRRKLASQR PFPQTA 

I II I II II II I M M I I II II II I I I I : :\ : \ : II : I M I II I : I I Ml III 
a032 PVQPFLRQGFGYALGLLRRFDVGGRVGMQQTAFDQPGAILPPRRQLARQRPRIQTALRQP 

130 140 150 160 170 180 

a032 PQRRRKIALRQALRHAACIFRRHLCQQRKQFFQIAPVCRHRVLRLALAHDVFQISVKMRR 

190 200 210 220 230 240 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 032 shows 86.4% identity over a 176 aa overlap with a predicted ORF (ORF 032.ng) 
from N. gonorrhoeae: 

m032/g032 

10 20 30 40 50 60 

m032 . pep MRRNVXAVAVXRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFADQR 
Mill II I I III : : I I II II I M II I M I II II II I M I I M I I I II M M M II 
g032 MRRNVPAVAVLRRPRFEAFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFAGQR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m032.pep HLPLXAPFADNVYPRXVQIDIICIQAVYLAHAQTAAVHQFEQGVVAHRQRVAAVHGQIQH 
: I I MM I II II III II II I I II M II M I II I I : II I II II I II I II II I I M 
g032 NLTLLAPFAGNVYPRFVQIYIICIQAVYLAHAQTAAVHQLEQRVVAHRQRVAAVHGQIQH 

70 80 90 100 110 120 

130 140 150 160 170 

m032 . pep PVQPFLRQG FGYALGLLRRFDVGGRVGVHQAALYQPNAILPPRRKLASQRPFPQT A 

II I II II M I I I I I I II II I M I I I I I : M I : II M M II II : II III Ml 
g032 PVQPFLRQGFGYALGLLRRFDVGGRVGAHQPAFDQPGAILPPRRQLARQRPTVQTALRQP 

130 140 150 160 170 180 



g032 



PQRRRKIAPRQVLRHAACIFRRHLCQQCKQFFQIAPVCRNRVLRLALAHDVFQISVKIRR 
190 200 210 220 230 240 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 15>: 

g033 . seq 

1 ATGGCGGCGG CGGACAAACT CTTGGGCGGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGAGAC GGCGCGATGA CGGCGGGGCA GGCGTTTGAA GCCTTGAATT 

101 GCGCGGGCGA TATGGATGTG GATTTGCTGG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGCGCGTTG CCCAAATATC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GACTGTTGAG TACCGTCAAA GCGCAAAcgg 

251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGagtTTGC CCAAAAAGTC 

3 01 GAACAcaaaA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 
351 GCTGTCGCTG TTTGAAAATT TCGGCTTCCG CTACACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAGAATCTG GTGGACGTAT TGAAAGACTT GCGCAGCCGC 
451 AAAGGCCCTC AGTTGCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 
501 ACTCGCCGAA AACGACCCcg tcaAATACCA CGCCGTCGCc aACCTGCcta 
551 AAGAAGGCGG GGCGCAAATg CCGTCTGAAA AAGAACCCAA GCCCGCCgCc 
601 aaaccgACCT ATACCCAAGT ATTCGGCAAA TGGCTGTGCG ACCGGGCGGC 
651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 
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701 


GACTGGTGGA 


GTTTGAACAA 


751 


ATCGCCGAGC 


AGCACGCCGT 


801 


CATGAAGCCC 


GTCGTGGCGA 


851 


ACCAACTGGT 


GCACGACATC 


901 


GTCGACCGTG 


CGGGCATCGT 


951 


GTACGATTTG 


AGCTTCTTGC 


1001 


CGAGCGATGA 


AAACGAATGC 


1051 


GATGCGCCCG 


CCGCCGTCCG 


1101 


GGTTTCAGAC 


GGCATGGAAA 


1151 


GCGAAGGTGA 


GAAAACCGCC 


1201 


GCATTGGCGG 


TTGCCGAAAA 


1251 


cgtcaaacCG 


ATAGACGAAG 


1301 


accGCATCGT 


TACCCTTGAA 


1351 


GCGGTCTTGG 


AAGTGTTGGC 


1401 


TTTGGGCGTT 


GCCGATACCG 


1451 


TGGACGATTT 


GGGTTTGAGT 


1501 


TGGCTGCCGG 


ACCGTGATGC 



CGATTCCCCG ACCGCTATTT CGATGTCGGC 
tacCTTTGCC GGCGGTTTGG CGTGCGAAGG 
TTTATTCCAC CTTTTTACAA CGCGCCTACG 
GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 
CGGCGCGGAC GGTCCGACCC ATGCCGGCTT 
GCTGTGTGCC GAACATGATT GTTGCCGCGC 
CGCCTGCTGC TTTCGACCTG CTATCAGGCG 
CTATCCGCGC GGCACGGGTA CGGGCGCGCC 
CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 
TTcatTGCCT TCGGCAGTAT GGTCGCCACC 
ACTGAACGCC ACCGTCGCCG ATATGCGCTt 
AGTTGATTGT CCGCCTTGCC CGAAGCCAcg 
GAAAACGCCG AACAGGGCGG CGCAGGCGGC 
GAAACACGGC ATCTGCAAAC CCGTTTTGCT 
TAACCGAACA CGGCGATCCG AAAAAACTTT 
GCCGAAGCGG TGGAACGCCG GGTGCGCGAG 
GGCAAATTAA 



This corresponds to the amino acid sequence <SEQ ID 116; ORF 033.ng>: 

g033 .pep 

1 MAAADKLLGG DRRSVAIIGD GAMTAGQAFE ALNCAGDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI PGAMEFAQKV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLKDLRSR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKEGGAQM PSEKEPKPAA 

2 01 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 
251 IAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 

3 01 VDRAGIVGAD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

3 51 DAPAAVRYPR GTGTGAPVSD GMETVEIGKG I IRREGE KTA FIAFGSMVAT 

4 01 ALAVA EKLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGG 
451 AVLEVLAKHG ICKPVLLLGV ADTVTEHGDP KKLLDDLGLS AEAVERRVRE 
501 WLPDRDAAN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 17>: 

m033 .seq 

1 ATGGCGGCGG CAGACAAACT CTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGCGAC GGCGCGATGA CGGCGGGGCA GGCGTTTGAA GCCTTGAATT 

101 GCGCaG . CGA TATGGATGTr GATTTGCTrG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGCGCGCTG CCGAAATACC TTGCCAGCAA 

2 01 CGTCGTGCGC GATATGCACG GCCTGTTGAG TACCGTCAAA GCGCAAACGG 
251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGAGTTTGC CCAAAAAGTC 

3 01 GAACACAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 
351 GCTGTCTTTG TTTGAAAACT TCGGCTTCCG CTACACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAAAATCTG GTGGACGTAT TGAAAGACTT GCGCAGCCGC 
4 51 AAAGGCCCTC AGTTGCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 
501 ACTCGCCGAA AACGACCCCG TCAAATACCA CGCCGTCGCC AACCTGCCTA 
551 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA GCCCGCCGCC 

6 01 AAACCGACCT ATACCCAAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 
651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 

7 01 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

7 51 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 
801 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

8 51 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTACCCGT TTTGTTTGCC 
901 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTCT 
951 GTACGATTTG AGCTTTTTGC GCTGCGTGCC GAACATGATT GTCGCCGCGC 

1001 CGAGCGATGA AAACGAATGC CGCCTGTTGC TTTCGACCTG CTATCAGGCA 

1051 GACGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGCGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCC 

12 01 GCATTGGCGG TTGCCGAAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

1251 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 
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13 01 ACCGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCGGC 

13 51 GCGGTGCTGG AAGTATTGGC GAAACACGGC ATCTGCAAAC CCGTTTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGGACA CGGCGATCCG AAAAAACTTT 
14 51 TAGACGATTT GGGCTTGAGT GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 
1501 TGGCTGTCGG ATCGGGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 118; ORF 033>: 

m03 3 .pep 

1 MAAADKLLGS DRRSVAIIGD GAMTAGQAFE ALNCAXDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI P GAME FAQ KV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLKDLRSR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

251 IAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 

.301 VDRAGIVGAD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

351 DAPAAVRYPR GTGTGAPVSD GMETVEIGKG IIRREGE KTA FIAFGSMVAP 

4 01 ALAVA EKLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGG 

4 51 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP KKLLDDLGLS AEAVERRVRA 

501 WLSDRDAAN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 19>: 

a033 . seq 

1 ATGGCGGCGG CGGACAAACA GTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGCGAC GGCGCGATGA CGGCGGGTCA GGCGTTTGAA GCCTTGAACT 

101 GCGCGGGCGA TATGGATGTG GATTTGCTGG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGTGCGTTG CCCAAATACC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GACTGTTGAG TACCGTCAAA GCGCAAACGG 

251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGAGTTTGC CCAAAAAGTC 

301 GAACATAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 

351 ACTGTCTTTG TTTGAAAACT TCGGCTTCCG CTATACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAAAATCTG GTCGATGTAT TGGAAGACCT GCGCGGACGC 

4 51 AAAGGCCCGC AGCTTCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 

501 ACTCGCCGAA AACGATCCCG TCAAATACCA CGCCGTCGCC AACCTGCCTA 

551 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA GCCCGCCGCC 

601 AAACCGACCT ATACCCAAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 

651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 

701 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

7 51 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 

801 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 

901 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTTT 

951 GTACGATTTA AGCTTTTTGC GCTGCATTCC GAATATGATT GTCGCCGCGC 

1001 CGAGCGATGA AAATGAATGC CGCCTGCTGC TTTCGACCTG CTATCAGGCA 

1051 GACGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGTGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCT 

1201 GCATTGGCGG TCGCCGGAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

1251 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 

1301 ACCGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCAGC 

1351 GCGGTGCTGG AAGTGTTGGC GAAACACGGC ATCTGCAAAC CCGTCTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGGACA CGGCGATCCG AAAAAACTTT 

14 51 TAGACGATTT GGGCTTGAGT GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 

1501 TGGCTGTCGG ATCGGGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 120; ORF 033.a>: 

a033 . pep 

1 MAAADKQLGS DRRSVAIIGD GAMTAGQAFE ALNCAGDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI PGAMEFAQKV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLEDLRGR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

251 IAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 
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301 VDRAGIVGAD GPTHAGLYDL SFLRCIPNMI VAAPSDENEC RLLLSTCYQA 

351 DAPAAVRYPR GTGTGVPVSD GMETVEIGKG IIRREGEK TA FIAFGSMVAP 

4 01 ALAVAGKLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGS 

4 51 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP KKLLDDLGLS AEAVERRVRA 

501 WLSDRDAAN* 

m033/a033 98.4% identity over a 509 aa overlap 

10 20 30 40 50 60 

m033.pep MAAADKLLGSDRRSVAIIGDGAMTAGQAFEALNCAXDMDVDLLVVLN DNEMS ISPNVGAL 

I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I II I I I I i I I I I i I II I I II I I I I I M 
a033 MAAADKQLGSDRRSVAIIGDGAMTAGQAFEALNCAGDMDVDLLVVLNDNEMS ISPNVGAL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m033 . pep PKYLASNVVRDMHGLLSTVKAQTGKVLDKIPGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 
I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I M I I I I I I I II I I I I | | | | | | | | | | | 
a033 PKYLASNVVRDMHGLLSTVKAQTGKVLDKIPGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m033 . pep FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 
I I I I I I I I I I I I I I I I I I I I I I I I : I I I : I I I I I I I I M I I I II I I I I I I | | | | | | | | | | 
a033 FENFGFRYTGPVDGHNVENLVDVLEDLRGRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m033 . pep NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVE FEQ 

I I M I I I I I I I I I I I I I I I I I I I I I I I II II I I II I I I I I I I I I I I I I I I I I I || | | | } | 
a033 NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

m033 . pep RFPDRYFDVGIAEQHAVT FAGGLACEGMKPVVAIYSTFLQRAYDQLVHDIALQNLPVLFA 

I I I I I I I I I I I I I I I I I I I I II II 1 I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | 
a033 R F PDR Y FD VG I AEQH AVT FAGG LACEGMK P VVA I Y S T FLQRAY DQLVH D I ALQN L P VL FA 

250 260 270 280 290 300 

310 320 330 340 350 360 

m033.pep VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 
I I I I I I I I I I I I I I I I I I I I I I I I I : 1 I I I I I I I I I I i I II I I I I I I II I I I I I I I I I I I 
a033 VDRAGIVGADGPTHAGLYDLSFLRCIPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 

310 320 330 340 350 360 

370 380 390 400 410 420 

m033 . pep GTGTGAPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVAPALAVAEKLNATVADMRFVKP 
I I I I I : I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I | | I I | | | | | | | | | | | | | 
a033 GTGTGVPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVAPALAVAGKLNATVADMRFVKP 

370 380 390 400 410 420 

430 440 450 460 470 480 

m033 . pep IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTGHGDP 
I I I I I I I I I I I I I I I I I I I I I I II I I I I I : I I I I I I I I I I I I I I I I I I || | | | | I I | | | | 
a033 IDEELIVRLARSHDRIVTLEENAEQGGAGSAVLEVLAKHGICKPVLLLGVADTVTGHGDP 

430 440 450 460 470 480 

490 500 510 

m033 . pep KKLLDDLGLSAEAVERRVRAWLSDRDAANX 
I I I I I I M I I I I I I I I I I I I I II I I I I I I I 
a033 KKLLDDLGLSAEAVERRVRAWLSDRDAANX 

490 500 510 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 033 shows 98.4% identity over a 509 aa overlap with a predicted ORF (ORF 033. ng) 
from N. gonorrhoeae: 



xn033/g033 






m033 


.pep 


MAAADKLLGSDRRS VAI IGDGAMTAGQAFEALNCAXDMDVDLL WLNDNEMS I S PNVGAL 

I II 1 II M MMM IMMIIIIIIMIII MM 1 IIIIIIIIIIMIIIIIMIMI! 

MAAADKLLGGDRRSVAI IGDGAMTAGQAFEALNCAGDMDVDLLWLNDNEMS I S PNVGAL 


60 


g033 




60 


m033 


.pep 


PKYLASNWRDMHGLLSTVKAQTGKVLDKI PGAMEFAQKVEHKI KTLAEEAEHAKQSLSL 

1 III M 1 1! 1 111! IIMIMMMII 1 II Ml Ml! 1 1 II III II 1 1 1 1 M MINI II 

PKYLASNWRDMHGLLSTVKAQTGKVLDKI PGAMEFAQKVEHKI KTLAEEAEHAKQSLSL 


120 


g033 




120 


m033 
g033 


-pep 


FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 

M 1 1 1 M M 1 M M II MM IM MM 1 M 1 M Ml 1 M M M II 1 1 1 1 M M M 1 1 M 1 

FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 


180 
180 


m033 
g033 


.pep 


NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 

1 1 1 1 1 = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 M 1 M 1 1 1 1 1 

NLPKEGGAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 


240 
240 


m033 


.pep 


RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNLPVLFA 

M M 1 M 1 1 M 1 M II 1 1 M MM M M M M M 1 II M II M M M M M M M 1 M M 

RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNLPVLFA 


300 


g033 




300 


m033. 


.pep 


VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 

1 M 1 1 II 1 M II M II 1 Ml MM II M 1 IM II II 1 M M II M M 1 1 II 1 1 II 1 II M 

VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 


360 


g033 




360 


m03 3. 
g033 


.pep 


GTGTGAPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVAPALAVAEKLNATVADMRFVKP 

1 M 1 M II 1 M IIMMIMMIMIM 1 M MMMM 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 

GTGTGAPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVATALAVAEKLNATVADMRFVKP 


420 
420 


m033. 


•pep 


IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTGHGDP 

1 M 1 II M 1 IMMMMMIMIMM! M MMI IM M III II II M III M' INI 

IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTEHGDP 


480 


g033 




480 


m033. 
g033 


pep 


KKLLDDLGLSAEAVERRVRAWLSDRDAANX 510 

IIIIIIIIIIIIMIIIII II MMI 

KKLLDDLGLSAEAVERRVREWLPDRDAANX 510 





The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 12 1>: 

g034 . seq 

1 ATGAGCCGTT TATGGTTTTT TGCCGTAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGACCA CGCCGCCGAA AACAGCTACG GCCTGCCCGC GTTCAACGTC 

151 AACAACCTCG AACAAATGCG CGCCATTATG GAAGCCGCCG ACCAAGTCAA 

201 CGCGCCCGTC ATCGTACAGG CGAGCGCAGG TGCGCGCAAA TACGcggGCG 

251 CGCCGTTTTT GCGCCACCTG ATTCTGGCGG CAGTCGAAGA ATTTCCGCAC 

3 01 ATCCCCGTCG TGATGCACCA AGACCACGGC GCATCGCCCG ACGTgtgCCA 
351 ACGCTCCATC CAACTGGGCT TCTCCTCCGT GATGATGGAC GGCTCTTTGC 

4 01 TCGAAGACGG CAAAACCCCT TCTTCTTACG AATACAACGT CAACGCCACC 
4 51 CGTACCGTCG TCAACTTCTC CCACGCCTGC GGCGTGTCCG TCGAAGGCGA 
501 AATCGGCGTA TTGGGCAACC TCGAAACCGG CGAAGCAGGC GAAGAAGACG 
551 GAGTGGGCGC GGCAGGCAAA CTCTCACACG ACCAAATGCT CACCAGCGTT 
601 GAAGATGCCG TGCGTTTCGT TAAAGATACC GGCGTTGACG CATTGGCGAT 
651 TGCCGTCGGC ACCAGCCACG GCGCATACAA ATTCACCCGT CCGCCCACAG 
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701 GCGACGTATT GCGTATCGAC 

7 51 AATACACACA TCGTGATGCA 
801 GAAAGTCATC AACGAATACG 

8 51 CGGTTGAAGA AATCGTCGAA 
901 ATCGATACCG ACCTGCGCCT 
951 TGCCGAAAAC CCGTCCGACT 

1001 TTGAAGCGAT GAAGCAAATC 

10 51 GAAGGTCAGG CAGGCAAAAT 

1101 CCGTTATGCC AAGGGCGAAT 



CGCATCAAGG AAATCCACCA AGCCCTGCCC 
CGgctCCAGC TCCGTTCCGC AAGAatgGCT 
GCGGCAATAT CGGCGAAACC TACGGCGTGC 
GGCATCAAAC ACGGCGTGCG CAAAGTCAAC 
CGCTTCCACC GGCGCGGTAC GCCGCTACCT 
TTGATCCGCG CAAATACTTG GGCAAAACCA 
TGCCTCGACC GTTATCTTGC GTTCGGTTGC 
CAAACCTGTT TCGTTGGAAA AAATGG C AAG 
TGAACCAAAT CGTCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 122; ORF 034.ng>: 



g034 .pep 
1 


MSRLWFFAVK 


NIIIRLIYLL 


PKETQMALVS 


MRQLLDHAAE 


NSYGLPAFNV 


51 


NNLEQMRAIM 


EAADQVNAPV 


IVQASAGARK 


YAGAPFLRHL 


ILAAVEEFPH 


101 


I P WMHQDHG 


ASPDVCQRSI 


QLGFSSVMMD 


GSLLEDGKTP 


SSYEYNVNAT 


151 


RTWNFSHAC 


GVSVEGEIGV 


LGNLETGEAG 


EEDGVGAAGK 


LSHDQMLTSV 


201 


EDAVRFVKDT 


GVDALAIAVG 


TSHGAYKFTR 


PPTGDVLRID 


RIKEIHQALP 


251 


NTHIVMHGSS 


SVPQEWLKVI 


NEYGGNIGET 


YGVPVEEIVE 


GIKHGVRKVN 


301 


IDTDLRLAST 


GAVRRYLAEN 


PSDFDPRKYL 


GKTIEAMKQI 


CLDRYLAFGC 


351 


EGQAGKI KPV 


SLEKMASRYA 


KGELNQIVK* 







The following partial DNA sequence was identified in N. meningitidis <SEQ ID 123>: 

m034 . seq (partial) 

1 ATGAGCTGTT TATGGTTTTT TGCTGTAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGATCA TGCTGCCGAA wACAGCTACG GCyTGCCGGC GTTCAACGTC 

151 AACAACCTCG wACAGATGCG CGCCATCATG GAGGCTGCAG ACCAAGTCGA 

2 01 CGCCCCCGTC ATCGTACAGG CGAGTGCCGG TGCGCGCAAA TATGCGGGTG 
251 CGCCGTTTTT ACGCCACCTG ATTTTGGCGG CTGTCGAAGT ATTTCCACAC 

3 01 ATCCCCGTCG TCATGCACCA AGACCACGGC GCATCACCCG ACGTGTGCCA 
351 ACGCTCCATC CAACTGGGCT TCTCCTCTGT AATGATGGAC GGCTCGCTGA 

4 01 TGGAAGACGG CAAAACCCCT TCTTCTTACG AATACAACGT CAACGCCACA 
4 51 CGTACCGTGG TTAACTTCTC CCACGCTTGC GGCGTATCCG TTGAAGGCGA 
501 AATCGGCGTA TTGGGCAACC TCGAAACCGG CGATGCAGGC GAAGAAGACG 
551 GTGTAGGCGC AGTGGGCAAA CTTTCCCACG ACCAAATGCT GACCAGCGTC 
601 GAAGATGCCG TATGTTTCGT TAAAGATACC GGCGTTGACG CATTGGCTAT 
651 TGCCGTCGGC ACCAGCCACG GCGCATACAA ATTCACCCGT CCGCCCACAG 
701 GCGATGTATT ACGTATCGAC CGCATCAAAG AAATCCACCA AGCCCTGCCC 
751 AATACACACA TCGTGATGCA C. . . 

This corresponds to the amino acid sequence <SEQ ID 124; ORF 034>: 

m034.pep (partial) 

1 MSCLWFFAVK NIIIRLIYLL PKETQMALVS MRQLLDHA AE XSYGLPAFNV 

51 NNLXQMRAIM EAADQVDAPV IVQASAGARK YAGAPFLRHL ILAAVEVFPH 

101 I PWMHQDHG ASPDVCQRSI QLGFSSVMMD GSLMEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGDAG EEDGVGAVGK LSHDQMLTSV 

201 EDAVCFVKDT GVDALAIAVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMH. . . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 125>: 

a034 . seq 

1 ATGAGCCGTT TATGGTTTTT TGCCGCAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGATCA TGCTGCCGAA AACAGCTACG GCCTGCCCGC GTTCAACGTC 

151 AACAACCTCG AACAAATGCG CGCCATTATG GAAGCCGCCG ACCAAGTCAA 

201 CGCGCCCGTC ATCGTACAGG CGAGCGCAGG TGCGCGCAAA TACGCGGGCG 

251 CGCCGTTTTT GCGCCACCTG ATTTTGGCGG CTGTCGAAGA ATTTCCGCAC 

301 ATCCCCGTCG TGATGCACCA AGACCACGGC GCATCGCCCG ACGTGTGCCA 

351 ACGCTCCATC CAACTGGGCT TTTCCTCCGT GATGATGGAC GGCTCGCTGA 

4 01 TGGAAGACGG CAAAACCCCT TCTTCTTATG AATACAACGT CAACGCCACC 
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451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 



CGTACCGTGG 
AATCGGCGTA 
GTGTAGGCGC 
GAAGATGCCG 
TGCCGTCGGC 
GCGACGTGTT 
AATACACACA 
GAAAGTCATC 
CGGTTGAAGA 
ATCGATACCG 
TGCCGAAAAC 
TTGAAGCGAT 
GAAGGTCAGG 
CCGTTATGCC 



TTAATTTCTC 
TTGGGCAACC 
AGTGGGCAAA 
TGCGTTTCGT 
ACCAGCCACG 
GCGTATCGAC 
TCGTGATGCA 
AACGAATACG 
AATCGTCGAA 
ACTTGCGCCT 
CCGTCCGACT 
GAAGCAAATC 
CAGGCAAAAT 
AAGGGCGAAT 



CCACGCCTGC 
TCGAAACTGG 
CTTTCCCACG 
TAAAGATACC 
GCGCGTACAA 
CGCATCAAAG 
CGGCTCCAGC 
GCGGCAATAT 
GGCATCAAAC 
TGCTTCCACC 
TCGATCCGCG 
TGCCTCGACC 
CAAACCGGTT 
TGAACCAAAT 



GGCGTATCCG 
CGAAGCCGGC 
ACCAAATGCT 
GGCGTTGACG 
ATTCACCCGT 
AAATCCACCA 
TCCGTTCCGC 
CGGCGAAACC 
ACGGCGTGCG 
GGCGCGGTAC 
CAAATATTTG 
GCTACCTCGC 
TCCTTGGAAA 
CGTCAAATAA 



TTGAAGGCGA 
GAAGAAGACG 
CACCAGCGTC 
CATTGGCGAT 
CCGCCCACAG 
AGCCCTGCCC 
AAGAATGGCT 
TACGGCGTGC 
TAAAGTCAAC 
GCCGCTACCT 
AGCAAAACCA 
GTTCGGTTGC 
AAATGGCAAA 



This corresponds to the amino acid sequence <SEQ ID 126; ORF 034.a>: 

a034 .pep 

1 MSRLWFFAAK NIIIRLIYLL PKETQMALVS MRQLLDHA AE NSYGLPAFNV 

51 NNLEQMRAIM EAADQVNAPV IVQASAGARK YAGAPFLRHL ILAAVEEFPH 

101 IPVVMHQDHG ASPDVCQRSI QLGFSSVMMD GSLMEDGKTP SSYEYNVNAT 

151 RTVVNFSHAC GVSVEGEIGV LGNLETGEAG EEDGVGAVGK LSHDQMLTSV 

201 EDAVRFVKDT GVDALAIAVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMHGSS SVPQEWLKVI NEYGGNIGET YGVPVEEIVE GIKHGVRKVN 

301 IDTDLRLAST GAVRRYLAEN PSDFDPRKYL SKTIEAMKQI CLDRYLAFGC 

351 EGQAGKIKPV SLEKMANRYA KGELNQIVK* 

m034/a034 96.9% identity over a 257 aa overlap 

10 20 30 40 50 60 

m034 .pep MSCLWFFAVKNIIIRLIYLLPKETQMALVSMRQLLDHAAEXSYGLPAFNVNNLXQMRAIM 
| i | | | | | : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I 
a 034 MSRLWFFAAKNIIIRLIYLLPKETQMALVSMRQLLDHAAENSYGLPAFNVNNLEQMRAIM 

10 20 30 4 0 50 60 



70 80 90 100 110 120 

m034 . pep EAADQVDAPVIVQASAGARKYAGAPFLRHLILAAVEVFPHIPVVMHQDHGASPDVCQRSI 
I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 
a 034 EAADQVNAPVIVQASAGARKYAGAPFLRHLILAAVEEFPHIPWMHQDHGASPDVCQRSI 

70 80 90 100 110 120 

130 140 150 160 170 180 

m034 . pep QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTVVNFSHACGVSVEGEIGVLGNLETGDAG 
I | | | I i I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I 
a034 QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTWNFSHACGVSVEGEIGVLGNLETGEAG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m034 .pep EEDGVGAVGKLSHDQMLTSVEDAVCFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 
| | | | | | | | I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a034 EEDGVGAVGKLSHDQMLTSVEDAVRFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 

190 200 210 220 230 240 



250 

m034 .pep RIKEIHQALPNTHIVMH 
I I I I I I I I I I ! I I I I I I 

a 034 RIKEIHQALPNTHIVMHGSSSVPQEWLKVINEYGGNIGETYGVPVEEIVEGIKHGVRKVN 

250 260 270 280 290 300 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 034 shows 96.5% identity over a 257 aa overlap with a predicted ORF (ORF 034.ng) 
from N. gonorrhoeae: 
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m034/g034 






m034 . 


pep 


MSCLWFFAVKNI I IRLIYLLPKETQMALVSMRQLLDHAAEXSYGLPAFNVNNLXQMRAIM 

M 1 1 1 1 1 1 1 1 1 1 MM iMMlMMIMI II II 1 II II llllllllllll MM 

MSRLWFFAVKNI I IRLI YLLPKETQMALVSMRQLLDHAAENSYGLPAFNVNNLEQMRAIM 


60 


g034 




60 


m03 4 . 


.pep 


EAADQVDAPVI VQASAGARKYAGAPFLRHLILAAVEVFPHI PWMHQDHGAS PDVCQRS I 

| M 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 II 1 1 1 1 1 MMMMMMMMM 

E AADQVNAPV I VQ AS AGARKYAGAP FLRHL I LAAVEE FPHI PWMHQDHGAS P DVCQRS I 


120 


gu J 4 




120 




• pep 


QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTWNFSHACGVSVEGEIGVLGNLETGDAG 

1 1 1 1 ! 1 1 1 MMhMMIMMI MMMMMMMMM MMMMMMMMM 

• QLGFSSVMMDGSLLEDGKTPSSYEYNVNATRTVVNFSHACGVSVEGEIGVLGNLETGEAG 


180 


g034 




180 


m034 


.pep 


EEDGVGAVGKLSHDQMLTSVEDAVCFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 

MMMMMMMMMMIMI M M M 1 M M M M M M M M M M M M M M 

EEDGVGAAGKLSHDQMLTSVEDAVRFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 


240 


g034 




240 


m034 


■pep 


RIKEIHQALPNTHIVMH 

M M M M M M M M 1 

RIKEIHQALPNTHIVMHGSSSVPQEWLKVINEYGGNIGETYGVPVEEIVEGIKHGVRKVN 


257 


g034 




300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 127>: 

g036. seq 

1 ATGCTGAAGC CGTGTTTGGT ATACAGTGCC TGTGCGGCGG cgttgcCTGC 

51 GCGGACTTCG AGCAGCAGGC GTTGCGTGCC TTCGGGCAGA TGTGCGTACC 

101 AATATTCGAG CAGGGCGGAC GCAACGCCCC GTCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCTGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACAAG CGCGGACTCA AATTGGCGTT GCGTCCACGC GGACGGGTTG 

301 CAGACGGTAT CGAGCGCGGC CAGTGCGGCG CAGTCGGACG GTGAGGCTGG 

351 GCGGATGTTC ATGTTCGTGC CTTCCGTTCC GCCTGTTCTT TGGCAGTCAG 

401 GGCGATTTTG TTGCGGACGT AGAGCAGTTC GGCGTGTGCC GCGCCAGTTG 

4 51 CGGGATAGCC GCCGCCGAGG GCGAGCGCGA GAAAATCGGC GGCGGTCGGC 

501 ATATCGGGTT TGCCTGAGAA GGGCGGACGG TTTTCCAGTG CGAACGCACT 

551 GCCGATGCCG TCTGAAAAGA CGTACCCCTC GGGGAGGGCA ATGTCTGCCG 

601 CCCTACCGAC TTGATAATCG CTCAAACGGC GGCGGTTCAG CGTGTCGAAC 

651 CACGCATAAA ACACTTCGCC CATACGCGCG TCCGCAGCGG CGAGTATGCA 

701 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGTG GGGATGCCGA 

751 TTAAAGGCGT GTCGAACGGC GTTGCCAAAC CTTGCGCCAC GCCGATGCCG 

801 ATACGCAGTC CGGTAA 

This corresponds to the amino acid sequence <SEQ ID 128; ORF 036.ng>: 

g036.pep 

1 MLKPCLVYSA CAAALPARTS SSRRCVPSGR CAYQYSSRAD ATPRRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA ILPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTVSSAASAA QSDGEAGRMF MFVPSVPPVL WQSGRFCCGR RAVRRVPRQL 

151 RDSRRRGRAR ENRRRSAYRV CLRRADGFPV RTHCRCRLKR RTPRGGQCLP 

201 PYRLDNRSNG GGSACRTTHK TLRPYARPQR RVCSFAAAAA RRRHRAWGCR 

251 LKACRTALPN LAPRRCRYAV R* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 129>: 

m036.seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCg.CGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGCGGC CAGCTCCTCA CAATCGGCAC AAACGGCACG 
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351 GCGGATGTTC ACGGGCGCGC TCTCCGTTCG GCCTGTTCTT TGGCAGTCAG 

4 01 GGCGATTTTG TTGCGGACGT AGAGCAAACC GGCGTGTGCG GCATGGACGG 

4 51 CAGGATAACC GCCCTTGGCT GCCAATGCGA GAAAGTCGGC GGCAGTCGGC 

501 ATATCCGGTC TGCCTGAGAA CGGCGGAGCT TCTTCCAGCG CGAACGCGCT 

551 GCCTATGCCG TCTGAAAAGG CGCATCCCTC CGGCAGCCGG ATGTCTGCCG 

601 CCCGCCCGAC CTGATAATCG CTCAAACGGT GGCAGTTCAG CGTATCGAAC 

651 CAT GC AT AAA ACACTTCGCC CATACGAGCG TCCGTAGCGG CAAGGATGCA 

701 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGAG GGTACGCCGA 

7 51 TTAAGGGGGT ATCAAACGGC GTTGCCAAAC CCTGAGCTAC ACCGATGCCG 

801 ATACGCAGTC CGGTAA 

This corresponds to the amino acid sequence <SEQ ID 130; ORF 036>: 

m036 .pep 

1 MLKPCAVYSA CAAVLPARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPXSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASSS QSAQTARRMF TGALSVRPVL WQSGRFCCGR RANRRVRHGR 

151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 

201 PARPDNRSNG GSSAYRTMHK TLRPYERP*R QGCSFAAAAA RRRHRARVRR 

251 LRGYQTALPN PELHRCRYAV R* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 13 1>: 



a036. seq 






GTACAGTGCC 


1 


ATGCTGAAGC 


CGTGCGCCGT 


51 


ACGGACTTCG 


AGCAGCAGGC 


GTTGCGTGTC 


101 


AATATTCGAG 


CAGGGCGGAC 


GCAATTCCTT 


151 


GTGGCAATCA 


GGTGCAGTTC 


GGATTCGTCG 


201 


AAAGGCGGCA 


ATCCCGCCGT 


CTTTTTCCGC 


251 


GCGAAACCAG 


TGCGGACTCA AATTGGCGTT 


301 


CAGACGGCAT 


CGAGCGCGGC 


GAGTGCGGCG 


351 


GCGGATGTTC 


ACAGGCGCGC 


CCTCCGTTCC 


401 


GGCGATTTTG 


TTGCGGACGT 


AGAGCAGCTC 


451 


CGGGAAAACC 


GCCTTCAGCC 


GCCAGATTGA 


501 


ATATCGGGTT 


TGCCTGAGAA 


GGGCGGACGG 


551 


GCCGATGCCG 


TCTGAAAAGG 


CGCATCCTTC 


601 


CCCGACCGAC 


CTGATAATCG 


CTCAAACGGC 


651 


CATGCATAAA 


ACACTTCGCC 


CATACGTGCG 


701 


GCTTTGCGGC 


GGCGGCAGCG 


AGGCGGCGGC 


751 


TTAAAGGAGT 


ATCAAACGGC 


GTTGCCAAAC 


801 


ATACGCAGTC 


CCGTAA 





This corresponds to the amino acid sequence <SEQ ID 132; ORF 036.a>: 

a036 . pep 

1 MLKPCAVYSA CAAVLPARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASAA QSA*TARRMF TGAPSVPPVL WQSRRFCCGR RAARRVPQRR 

151 RENRLQPPD* GSRRRSAYRV CLRRADGFPA RTHCRCRLKR RILPAAGCLP 

201 PDRPDNRSNG GGSACRTMHK TLRPYVRPQR QGCSFAAAAA RRRHRARVRR 

251 LKEYQTALPN LAPRRCRYAV P* 

m036/a036 85.6% identity over a 270 aa overlap 



m036.pep 
a036 



10 20 30 40 50 60 

MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 

I I I I | ! I | I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I ! I I I I I I I M I 

MLKPCAVYSACAAVLPARTSSSRRCVS SGRCVNQYSSRADAI PWRRHSGAVAIRCSS DS S 
10 20 30 40 50 60 



70 80 90 100 110 120 

m036 oeo GRFCQTIKAAIPXSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 

MINIMUM IIMIilMIIIIMMMMIIlll I I I I I I I I I I I I 

Q36 GRFCQTIKAAIPPSFSARKTCSDGETSADSNWRCVHADGLQTASSAASAAQSAXTARRMF 
" 70 80 90 100 110 120 
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130 140 150 160 170 180 

m03 6 pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
| | | 1 | I I I I I I I I I I I I I I I I I : I : : I I I I I I : I I I I I I I I :: I I 
a 036 TGAPSVPPVLWQSRRFCCGRRAARRVPQRRRENRLQPPDXGSRRRSAYRVCLRRADGFPA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m036 pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPXRQGCSFAAAAA 
II : I I I I I I I I I I I I I I I I 111111111:11 I I ! I II I I I I M I i I I I I I I I I I 
a0 36 RTHCRCRLKRRILPAAGCLPPDRPDNRSNGGGSACRTMHKTLRPYVRPQRQGCSFAAAAA 

190 200 210 220 230 240 

250 260 270 

m036 . pep RRRHRARVRRLRGYQTALPNPELHRCRYAVRX 

I I I I I I I I I I I : I I I I M I- : I I I I I I 
a 0 3 6 RRRHRARVRRLKE YQT ALPNLAPRRCR YAVPX 

250 260 270 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from 7V. gonorrhoeae 

ORF 036 shows 74.9% identity over a 271 aa overlap with a predicted ORF (ORF 036.ng) 
from N. gonorrhoeae: 

m036/g036 

10 20 30 40 50 60 

MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 

Mill I I I II 11 : I I I I I I I I I I I I MM: I I I I I I I I I I I I I I I I I I I I I I I I I 
MLKPCLVYSACAAALPARTSSSRRCVPSGRCAYQYSSRADATPRRRHSGAVAIRCSSDSS 
10 20 30 40 50 60 

70 80 90 100 110 120 

GRFCQTIKAAIPXSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 
I I I I i M I I I I II I I II I I M I I II I I I I I I I I M M IM:M I II:: I I I III 
GRFCQTIKAAILPSFSARKTCSDGETSADSNWRCVHADGLQTVSSAASAAQSDGEAGRMF 
70 80 90 100 110 120 

130 140 150 160 170 180 

TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 

: | | I 1 I I I I I I I I I I I I I I I I : : I : I I I : I I : I I I I I I I I : : I : 

MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 
130 140 150 160 170 180 

190 200 210 220 230 240 

RTRCLCRLKRRI PPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPXRQGCS FAAAAA 
I | : | 1 I II I I i : : Mil l II I I I I I : I I II I I II 11 I II I : I I I I II I I 
RTHCRCRLKRRT PRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCS FAAAAA 
190 200 210 220 230 240 

250 260 270 

RRRHRAR VRRLRG YQT ALPN PELHRCR Y AVRX 

M 1 I II I i : : :\ I II I M M M II I 
RRRHRAWGCRLKACRTALPNLAPRRCRYAVRX 
250 260 270 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 133>: 

m036-l . seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGCGGC CAGCTCCTCA CAATCGGCAC AAACGGCACG 



m036 .pep 
g036 

m03 6 .pep 
g036 

m036 .pep 
g036 

m036 .pep 
g036 

m036.pep 
g036 
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351 GCGGATGTTC 

4 01 GGCGATTTTG 

4 51 CAGGATAACC 

501 ATATCCGGTC 

551 GCCTATGCCG 

601 CCCGCCCGAC 

651 CAT G CAT AAA 



ACGGGCGCGC 
TTGCGGACGT 
GCCCTTGGCT 
TGCCTGAGAA 
TCTGAAAAGG 
CTGATAATCG 
ACACTTCGCC 



TCTCCGTTCG 
AGAGCAAACC 
GCCAATGCGA 
CGGCGGAGCT 
CGCATCCCTC 
CTCAAACGGT 
CATACGAGCG 



GCCTGTTCTT 
GGCGTGTGCG 
GAAAGTCGGC 
TCTTCCAGCG 
CGGCAGCCGG 
GGCAGTTCAG 
TCCGTAG 



TGGCAGTCAG 
GCATGGACGG 
GGCAGTCGGC 
CGAACGCGCT 
ATGTCTGCCG 
CGTATCGAAC 



This corresponds to the amino acid sequence <SEQ ID 134; ORF 0036- 1>: 

m036-l .pep 

1 MLKPCAVYSA CAAVLPARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 
51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDGETSADS NWRCVHADGL 
101 QTASSAASSS QSAQTARRMF TGALSVRPVL WQSGRFCCGR RANRRVRHGR 
151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 
201 PARPDNRSNG GSSAYRTMHK TLRPYERP* 

m036-i/g036 76.8% identity in 228 aa overlap 

10 20 30 40 50 60 

m03 6-l .pep MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
EMM I I I 1 I I I : I I I I I I I I I I I I MM: I M M M I I M M M M I I M M M 
g036 MLKPCLVYSACAAALPARTSSSRRCVPSGRCAYQYSSRADATPRRRHSGAVAIRCSSDSS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m036-l .pep GRFCQTIKAAIPPSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 
M M M M M I M M M M M M M I M M M M M M M M M M M : M i Ml 
g03 6 GRFCQTIKAAILPSFSARKTCSDGETSADSNWRCVHADGLQTVSSAASAAQSDGEAGRMF 

70 80 90 100 110 120 

130 140 150 160 170 180 

mO 3 6-1 .pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
: M M M M M M M ! M Ml : MM M M I : M I MM I : : I : 

g03 6 MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 

130 140 150 160 170 180 

190 200 210 220 229 

m036-l .pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPX 

MM M M M I : : MM I M M M M M M M M M I i I 
g036 RTHCRCRLKRRTPRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCSFAAAAA 

190 200 210 220 230 240 



g036 



RRRHRAWGCRLKACRTALPNLAPRRCRYAVRX 
250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 135>: 

g038 . seq 

1 ATGACTGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTC AACGACGGCG CGTCCACGCT GCAACTGGCA 

151 AAATTCTATG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGCCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC GCGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCAATCAA ACTGATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

451 ATCGCGCTCG ACCGCATGGA AAAAGGCACG GGTAAATTGT CCGCCGTTCA 

501 GGAAGTGGAA AAACAATACG GCCTGCCCGT CGCCCCCATC GC CAGCCTGA 

551 ACGATTTGTT TATCCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GCACCTACCG CCGGCAGTAC GGCGTAGAAT AA 
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This corresponds to the amino acid sequence <SEQ ID 136; ORF 038.ng>: 

g038.pep 

1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGASTLQLA 

51 KFYAQSIIES GIRFDMLFGP A YKG 1 1 LAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDRGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GKLSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRTYRRQY GVE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 137>: 

m038.seq 

1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAGGC AGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTT AACGACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATTTTACG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGTCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC ACGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCGATCAA ACTGATTGAA GCGGAGGGTG CAACCCCcGC CGGTGTCGCC 

451 ATCGCGCTCG ATCGCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

501 GGAAGTGGAr AAACAATACG GkCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GAGCCTACCG TCGGCAGTAC GGC GTAGAAT AA 

This corresponds to the amino acid sequence <SEQ ID 138; ORF 038>: 

m038 .pep 

1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDHGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GELSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRAYRRQY GVE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 139>: 

a038.seq 

1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTCA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTT AACGACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATTTTACG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGCCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC ACGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

4 01 AATCGATCAA ACTGATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

4 51 ATCGCGCTCG ACCGCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

501 GGAAGTGGAA AAACAATACG GCCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GAGCCTACCG TCGGCAGTAC GGCGTAGAAT AA 

This corresponds to the amino acid sequence <SEQ ID 140; ORF 038.a>: 

a038.pep 

1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDHGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GELSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRAYRRQY GVE* 

m038/a038 100.0% identity over a 213 aa overlap 

10 20 30 40 50 60 

ra038.pep MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
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I I I I I I I I I I I I I I II I 1 1 1 1 1 1 I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I I M 

a038 MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 

10 20 30 40 50 60 

70 80 90 100 110 120 

m038 . pep GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 
I I I I I I II I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
a038 GIRFDMLFG PAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m038.pep IIDDVISAGTSVRESIKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I If I I I I I I I 
a 038 UDDVISAGTSVRESIKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 

130 140 150 160 170 180 

190 200 210 

m038 . pep ASLNDLFILLQNNPEFGQFLE PVRAYRRQYGVEX 

I I I I I I I I I I I I I I I I I M I I I I I I I i I I I I II I 
a038 ASLNDLFILLQNNPEFGQFLE PVRAYRRQYGVEX 

190 200 210 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 038 shows 98.1% identity over a 213 aa overlap with a predicted ORF (ORF 038.ng) 
from K gonorrhoeae: 

m038/g038 

10 20 30 40 50 60 

MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
! I II I I I I II I I I I I I I I I I ! I I I II I II I II I I I I I I I I I II I I II II I I I I I I I II I 
MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGASTLQLAKFYAQSIIES 
10 20 30 40 50 60 

70 80 90 100 110 120 

GIRFDMLFG PAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 

I I I I II I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I M I I M I I 
G I RFDMLFGPAYKG 1 1 LAAATAMMLAEKGVNVP FAYNRKEAKDRGEGGVLVGAPLKGRVL 

70 80 90 100 110 120 

130 140 150 160 170 180 

IIDDVISAGTSVRESIKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 

II II I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I 
IIDDVISAGTSVRESIKLIEAEGAT PAGVAIALDRMEKGTGKLSAVQEVEKQYGLPVAPI 

130 140 150 160 170 180 

190 200 210 

ASLNDLFILLQNNPEFGQFLEPVRAYRRQYGVEX 
I I I I I I I I I I I I I I I I I I I I I I I I : I I II I I I I I 
ASLNDLFILLQNNPEFGQFLEPVRTYRRQYGVEX 
190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 141>: 

g039 . seq 

1 ATGCCGTCCG AACCACCTGC CGCTTCAGAC GGCATCAAAC CGACACACAC 

51 CGAGAAAACA TCATGCCCGC CTGTTTCTGT CCGCACTGCA AAACCCGCCT 

101 CTGGGTCAAA GAAAcccagC TCAAcgtCgC ccaagGCTTC GTCGTCTgcc 

151 aaAAAtgcga agGGCTgttt aaAgccaaaG accAtctggc aaGcacGAAA 

201 gaacctatat tcaacgattg gcccgaagct gtttcgggat gTcaaaCTCG 



m038.pep 
g038 

m038 . pep 
g038 

m038 .pep 
g038 

m038 . pep 
g038 
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2 51 TCcaccgcaT cggcacgcac gccattagca aGAaacagat gtcccgcgac 

3 01 gaaatCgccg atatcctcaa cggcggtaca acCCTGCACG ATACGCCGCC 

3 51 CGCAACCGCC GCTGCCGCac CtGCCGCCGC ACCGCaggTT TCCGTACCGC 

4 01 CCGCCCGTCA GGAAGGGCTC AACTGGACTA TTGCAACCCT GTTCGCACTT 
4 51 ATCGTCCTCA TTATGCAGCT TTCCTACCTC TTCATCCTAT GA 



This corresponds to the amino acid sequence <SEQ ID 142; ORF 039.ng>: 

g03 9 .pep 

1 MPSEPPAASD GIKPTHTEKT SCPPVSVRTA KPASGSKKPS STSPKASSSA 

51 KNAKGCLKPK TIWQARKNLY STIGPKLFRD VKLVHRIGTH AISKKQMSRD 

101 EIADILNGGT TLHDTPPATA AAAPAAAPQV SVPPARQEGL NWTIATLFAL 

151 IVLIMQLSYL FIL * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 143>: 

m039 . seq 

1 ATGCCGTCCG AACCGCCTTA CGCCTCAGAC GGCATCAAAC CTGACACACA 

51 CGAGGAAATA CCATGCCCGC CTGTTTCTGC CCCCACTGCA AAACCCGTCT 

101 CTGGGTCAAA GAAACCCAAC TCAATGTCGC CGnnnnnnnn nnnnnnnnnn 

151 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

2 01 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnCCC GAGGCTGTTT 

2 51 CGGATGTCAA ACTCGTTCAC CGTATCGGCA CGCGCGCCAT CGGCAAGAAA 

3 01 CAGATTTCCC GTGACGAAAT CGCCGGCATC CTCAACGGCG GTACAACCCA 

3 51 GCCCGATATT CCGCCCGCAA CCGCCGCCAC CCCTGCTGCC GCACCGCAGG 

4 01 TTACCGTACC GCCCGCCGCG CCCGCCCGTC AGGATGGGTT CAACTGGACG 
4 51 ATTGCAACCC TGTTTGCCCT TATCGTCCTC ATTATGCAGC TTTCCTACCT 
501 CGTCATCCTA TGA 



This corresponds to the amino acid sequence <SEQ ID 144; ORF 039>: 

m039.pep 

1 MPSEPPYASD GIKPDTHEEI PCPPVSAPTA KPVSGSKKPN SMSPXXXXXX 

51 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXP EAVSDVKLVH RIGTRAIGKK 

101 QISRDEIAGI LNGGTTQPDI PPATAATPAA APQVTVPPAA PARQDGFNWT 

151 IATLFALIVL IMQLSYLVIL * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 145>: 

a039.seq 

1 ATGCCGTCTG AACCGCCTTA CGCCTCAGAC GGCATCAAAC CTGACACACA 

51 CGAGGAAATA CCATGCCCGC CTGTTTCTGC CCCCACTGCA AAACCCGTCT 

101 CTGGGTCAAA GAAACCCAAC TCAATGTCGC CCAAGGCTTC GTCGTCTGCC 

151 AAAAATGCGA AGGAATGTTT AAAGCCAAAG ACCATCTGGC AAGCACGAAA 

201 GAACCCATAT TCAACGATT . TGCCCGAAGC TGTTTCGGAT GTCAAACTCG 

251 TTCACCGCAT CGGCACGAGC GCCATCGGCA AGAAACAGAT TTCCCGTGAC 

301 GAAATCGCCG GCATCCTCAA CGGCGGCACA ACCCAGCCCG ATATTCCGCC 

351 CGCAACCGCC GCCACCCCTG CTGCCGCACC GCAGGTTACC GTACCGCCCG 

401 CCGCGCCCGC CCGTCAGGAT GGGTTCAACT GGACGATTGC AACCCTGTTT 

451 GCCCTTATCG TCCTCATTAT GCAGCTTTCC TACCTCGTCA TCCTATGA 

This corresponds to the amino acid sequence <SEQ ID 146; ORF 039. a>: 

a039.pep 

1 MPSEPPYASD GIKPDTHEEI PCPPVSAPTA KPVSGSKKPN SMSPKASSSA 

51 KNAKECLKPK TIWQARKMPY STIXPEAVSD VKLVHRIGTS AIGKKQISRD 

101 EIAGILNGGT TQPDIPPATA ATPAAAPQVT VPPAAPARQD GFNWTIATLF 

151 ALIVLIMQLS YLVI L* 



m039/a039 79.4% identity over a 170 aa overlap 

10 20 30 40 50 60 

m039.pep MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPXXXXXXXXXXXXXXXX 
I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

a039 MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPKASSSAKNAKECLKPK 
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10 20 30 ■ 40 50 60 

70 80 90 100 110 120 

m03 9 . pep XXXXXXXXXXXXXXXXXXXPEAVSDVKLVHRIGTRAIGKKQISRDEIAGILNGGTTQPDI 
: : I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I | | | 

a039 TIWQARKNPYSTIX PEAVSDVKLVHRIGTSAIGKKQISRDEIAGILNGGTTQPDI 

70 80 90 100 110 



130 140 150 160 170 

m039.pep PPATAAT PAAAPQVTVPPAAPARQDGFNWTIATLFALIVLIMQLSYLVILX 

I I t I I I II II I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a03 9 PPATAAT PAAAPQVTVPPAAPARQDGFNWTIATLFALIVLIMQLSYLVILX 

120 130 140 150 160 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from 7V. gonorrhoeae 

ORF 039 shows 60.8% identity over a 171 aa overlap with a predicted ORF (ORF 039.ng) 
from N. gonorrhoeae: 

m039/g039 



10 20 30 40 50 60 

m03 9 .pep MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPXXXXXXXXXXXXXXXX 

II I III Mill II h llllh lllhlllllhl II 

g03 9 M P S E P PAASDG I KP THTE KTS C P PVS VRTAKPAS GS KKP S S TS P KAS S S AKNAKGCLKPK 

10 20 30 40 50 60 



70 80 90 100 110 120 

m03 9 . pep XXXXXXXXXXXXXXXXXXXPEAVSDVKLVHRIGTRAIGKKQISRDEIAGILNGGTTQPDI 

h III I I ! I I I I M ! H I hi I I I M IMIIII I 
g039 T I WQARKNL YS TIG PKLFRDVKLVHRIGTHAISKKQMSRDEIADILNGGTTLHDT 

70 80 90 100 110 



130 140 150 160 170 

m03 9.pep PPATAAT- PAAAPQVTVPPAAPARQDGFNWTIATLFALIVLIMQLSYLVILX 

Illllh Mill Ihl II I Ihhl Mill MM I II II I MM III 

g039 PPATAAAAPAAAPQVSVPPA- - -RQEGLNWTIATLFALIVLIMQLSYLFILX 

120 130 140 150 160 



The following partial DNA sequence 



g040 . seq 






1 


ATGAACGCGC 


CCGACAGCTT 


51 


CATCCGCCAA 


ATGCGCGGCA 


101 


TGCTCGAAGG 


CGGCACCTTA 


151 


TCGCAACTGG 


GCATCCGACT 


201 


CGAccgCCTC 


GCCGCCGCGC 


251 


tGCGCGTTAC 


CGACGaAACc 


301 


AccgTCCGCA 


GCCGTTTTGA 


351 


cgcgCGCGCG 


CCTTCCGTCC 


401 


GTCcgatggg 


cgtgattgac 


451 


cgcaaaaccg 


ACACCGCCGC 


501 


CGTCTGGATG 


CCGCCGCTCG 


551 


TCGATATGGT 


GCAGGCCGCC 


601 


AAACTCGTTT 


ACCTGACCCT 


651 


GCTCGCCGAA 


ACCCTCTCGG 


701 


CCGCCAGCGA 


AACCCGACGA 


751 


GGCGGCGTGC 


ATCGCGTCCA 


801 


GCTGCAAGAA 


CTCTTCACCC 


851 


AAGCCTTCGT 


CTCCATCCGG 


901 


GCCGCCCTCA 


TCCGCCCGCT 


951 


CCGCGAATAC 


CTCGAAAACC 



: identified in N. gonorrhoeae <SEQ ID 147>: 

TGTCGCCCAC TTCCGCGAAG CCGCCCCCTA 
CGACACTGGT CGCCGGCATA GAcggCCGCC 
AATAAGCTCG CCGCCGACAT CGGGCTGTTG 
CGTCCTCATC CACGGCGCGT ACCACTTCCT 
AAGgccGCAC GCCGCATTAT TGCCGgggtt 
tcGctcgGAC AGGCGCAGCA GtttGCCGGC 
agcCGCATTG tgcggcagCG tttcaggatt 
CGCTCGTAtc gggcaacttc ctgacCGCCC 
ggaACCGata tggaatacgc gggggttatc 
CCTCCGTTTC CAACTCGACG CGGGCAATAT 
GGCATTCCTA CGGCGGCAAA ACCTTCAATC 
GCTTCCGTCG CCGTCTCGCT TCAGGCCGAA 
TTCAGACGGC ATTTCCCGCC CCGACGGCAC 
CACAGGAAGC GCAATCGCTG GCGGAACACG 
CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 
AATCCTCAAC GGGGCCGCCG ACGGCAGCCT 
GCAACGGCAT CGGCACGTCC ATTGCCAAAG 
CAGGCGCACA GCGGCGACAT CCCGCACATC 
GGAAGAACAG GGCGTCCTAT TGCACCGCAG 
ACATTTCCGA ATTTTCCATC CTCGAACACG 
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1001 ACGGCGACCT GTACGGCTGT GCCGCACTCA AAACCTTTGC CGAAGCCGAT 

1051 TGCGGCGAAA TCGCCTGCCT TGCCGTCTCG CCGCAGGCAC AGGACGGCGg 

1101 CtACGGCGAA CGCCTGCTTG CCCACATTAT CGATAAGGCG CGCGGCATAG 

1151 GCATAAGCAG GCTGTTCGCA CTGTCCACAA ATACCGGCGA ATGGTTTGCC 

1201 GAACGCGGCT TTCAGACGGC ATCGGAAGAC GAGCTGCCCG AAACGCGGCG 

12 51 CAAAGACTAC CGCAGCAACG GACGAAACCC GCATATTCTG GTGCGTCGCC 

1301 TGCACCGCTG A 

This corresponds to the amino acid sequence <SEQ ID 148; ORF 040.ng>: 

g040 -pep 

1 MNAPDSFVAH FREAAPYIRQ MRGTTLVAGI DGRLLEGGTL NKL AADIGLL 

51 SQLGIRLVLI HGAYHFLDRL AAAQGRTPHY CRGLRVTDET SLGQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPMGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWM PPLGHSYGGK TFNLDMVQAA ASVAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAE TLSAQEAQSL AEHAASETRR LISSAVAALE 

251 GGVHRVQILN GAADGSLLQE LFTRNGIGTS IAKEAFVSIR QAHSGDIPHI 

301 AALIRPLEEQ GVLLHRSREY LENHISEFSI LEHDGDLYGC AALKTFAEAD 

351 CGEIACLAVS PQAQDGGYGE RLLAHIIDKA RGIGISRLFA LSTNTGEWFA 

4 01 ERGFQTASED ELPETRRKDY RSNGRNPHIL VRRLHR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 149>: 

m040 . seq 

1 ATGAGCGCGC CCGACCTCTT TGTCGCCCAC TTCCGCGAAG CCGTCCCCTA 

51 CATCCGCCAA ATGCGCGGCA AAACGCTGGT CGCCGGCATA GACGACCGCC 

101 TGCTCGAAGG TGATACCTTA AACAAGCTCG CCGCCGACAT CGGGCTGTTG 

151 TCGCAACTGG GCATCAGGCT CGTCCTCATC CACGGCGCGC GCCACTTCCT 

201 CGACCGCCAC GCCGCCGCTC AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 

251 TGCGCGTTAC CGACGAAACC TCGCTCGAAC AGGCGCAgCA GTTTGCCGGC 

301 ACCGTCCGCA GCCGTTTTGA AGCCGCATTG TGCGGCAGCG TTTCCGGGTT 

351 CGCGCGCGCG CCTTCCGTCC CGCTCGTATC GGGCAACTTC CTGACCGCCC 

4 01 GTCCGATAGG TGTGATTGAC GGAACCGATA TGGAATACGC GGGCGTTATC 

4 51 CGCAAAACCG ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGCTG CCGCCGCTCG GACATTCCTA CAGCGGCAAG ACCTTCTATC 

551 TCGATATGCT TCAAACCGCC GCCTCCGCCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGAA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

701 CCGGCGGGCA AACGCGACGG CTGATTTCGT CCGCCGAACT CTTCACCCGC 

751 AACGG CAT CG GCACGTCCAT TGCCAAAGAA GCCTTCGTCT CCATCCGGCA 

801 rGCGCAywgG G.CGACATCC CGCACATCGC CGCCCTCATC CGCCCGCTGG 

851 AAGAACAGGG CATCCTGCTG CACCGCAs . c GCGAATACCT CGAAAACCAC 

901 ATTTCCGAAT TTTCCATCCT CGAACACGAC GGCAACCTGT ACGGTTGCGC 

951 CGCCCTGAAA ACCTTTGCCG AAGCCGATTG CGGCGAAATC GCCTGCCTTG 

1001 CCGTCTCGCC GCag . cACAG GACGGCGGCT ACGGCGAACG CnTGCTTGCC 

1051 CACATTATCG ATAAGGCGCG CGGCATAGGC ATAAGCAGGC TGTTCGCACT 

1101 GTCCACAAAT ACCGGCGAAT GGTTTGCCGA ACGCGGCTTT CAGACGGCAT 

1151 CGGAAGACGA GTTGCCCGAA ACGCGGCGCA AAGACTACCG CAGCAACGGA 

1201 CGGAACTCGC ATATTCTGGT ACGTCGCCTG CACCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 150; ORF 040>: 

m040 . pep 

1 MSAPDLFVAH FREAVPYIRQ MRGKTLVAGI DDRLLEGDTL NKL AADIGLL 

51 SQLGIRLVLI HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLEQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPIGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWL PPLGHSYSGK TFYLDMLQTA ASAAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAE TLSAQEAQSL AEHAGGQTRR LISSAELFTR 

251 NGIGTSIAKE AFVSIRQAHX XDIPHIAALI RPLEEQGILL HRXREYLENH 

301 ISEFSILEHD GNLYGCAALK TFAEADCGEI ACLAVSPQXQ DGGYGERXLA 

351 HIIDKARGIG ISRLFALSTN TGEWFAERGF QTASEDELPE TRRKDYRSNG 

4 01 RNSHILVRRL HR* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 151>: 



a040 . seq 
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1 


ATGATCGTGC 


CCGACCTCTT 


51 


CATCCGCCAA 


ATGCGCGGCA 


101 


TGCTCGAAGG 


TGATACCTTA 


151 


TCGCAACTGG 


GCATCAGGCT 


201 


CGACCGCCAC 


GCCGCCGCGC 


251 


TGCGCGTTAC 


CGACGAAACC 


301 


ACCGTCCGCA 


GCCGTTTTGA 


351 


CGCGCGCGCG 


CCTTCCGTCC 


401 


GTCCGATAGG 


TGTGATTGAC 


451 


CGCAAAACCG 


ACACCGCCGC 


501 


CGTCTGGCTG 


CCGCCGCTCG 


551 


TCGATATGCT 


TCAAACCGCC 


601 


AAACTCGTTT 


ACCTGACCCT 


651 


GCTCGCCGTA 


ACCCTCTCGG 


701 


CCGGCGGCGA 


AACGCGACGG 


751 


GGCGGCGTGC 


ATCGCGTCCA 


801 


GCTGCAAGAA 


CTCTTCACCC 


851 


AAGCCTTCGT 


CTCCATCCGG 


901 


GCCGCCCTCA 


TCCGCCCGCT 


951 


CCGCGAATAC 


CTCGAAAACC 


1001 


ACGGCAACCT 


GTACGGTTGC 


1051 


TGCGGCGAAA 


TCGCCTGCCT 


1101 


CTACGGCGAA 


CGCCTGCTTG 


1151 


GCATAAGCAG 


GCTGTTCGCA 


1201 


GAACGCGGCT 


TTCAGACGGC 


1251 


CAAAGACTAC 


CGCAGCAACG 


1301 


TGCACCGCTG 


A 



TGTCGCCCAC TTCCGCGAAG CCGCCCCCTA 
AAACGCTGGT CGCCGGCATA GACGACCGCC 
AACAAGTTCG CCGCCGACAT CGGGCTTTTG 
CGTCCTCATC CACGGCGCGC GCCACTTCCT 
AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 
TCGCTCGAAC AGGCGCAGCA GTTTGCCGGC 
AGCCGCATTG TGCGGCAGCG TTTCCGGGTT 
CGCTCGTATC GGGCAACTTC CTGACCGCCC 
GGAACCGATA TGGAATACGC GGGCGTTATC 
CCTCCGTTTC CAACTCGACG CGGGCAATAT 
GACATTCCTA CAGCGGCAAG ACCTTCCATC 
GCCTCCGTCG CCGTCTCGCT TCAGGCCGAA 
TTCAGACGGC ATTTCCCGCC CCGACGGCAC 
CACAGGAAGC GCAATCGCTG GCGGAACACG 
CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 
AATCCTCAAC GGAGCCGCCG ACGGCAGCCT 
GCAACGGCAT CGGCACGTCC ATTGCCAAAG 
CAGGCGCACA GCGGCGACAT CCCGCACATT 
GGAAGAACAG GGCATCCTGC TGCACCGCAG 
ACATTTCCGA ATTTTCCATC CTCGAACACG 
GCCGCCCTGA AAACCTTTGC CGAAGCCGAT 
TGCCGTCTCG CCGCAGGCAC AGGACGGCGG 
CCCACATTAT CGATAAGGCG CGCGGCATAG 
CTGTCCACAA ATACCGGCGA ATGGTTTGCC 
ATCGGAAGAC GAGTTGCCCG AAACGCGGCG 
GACGGAACTC GCATATTCTG GTGCGTCGCC 



This corresponds to the amino acid sequence <SEQ ID 152; ORF 040.a>: 

a04 0 . pep 

1 MIVPDLFVAH FREAAPYIRQ MRGKTLVAGI DDRLLEGDTL NKF AADIGLL 

51 SQLGIRLVLI HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLEQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPIGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWL PPLGHSYSGK TFHLDMLQTA ASVAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAV TLSAQEAQSL AEHAGGETRR LISSAVAALE 

251 GGVHRVQILN GAADGSLLQE LFTRNGIGTS IAKEAFVSIR QAHSGDIPHI 

301 AALIRPLEEQ GILLHRSREY LENHISEFSI LEHDGNLYGC AALKTFAEAD 

351 CGEIACLAVS PQAQDGGYGE RLLAHIIDKA RGIGISRLFA LSTNTGEWFA 

4 01 ERGFQTASED ELPETRRKDY RSNGRNSHIL VRRLHR* 



m040/a040 91.5% identity in 436 aa overlap 



10 20 30 40 50 60 

m04 0.pep MSAPDLFVAHFREAVPYIRQMRGKTLVAGIDDRLLEGDTLNKLAADIGLLSQLGIRLVLI 
I : I I I 1 I i M I i I : I I I I I M I I I I I I I I I I I I I I I I I I I i : I I I I ! I I I I I I I I I I I I 
a040 MIVPDLFVAH FREAAPYIRQMRGKTLVAGIDDRLLEGDTLNKFAADIGLLSQLGIRLVLI 

10 * 20 30 40 50 60 



70 80 90 100 110 120 

m04 0.pep HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 
I I I I I I M I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I ! ! I I I I I I I I I I I I I I I M ! 
a04 0 HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 

70 80 90 100 110 120 



130 140 150 160 170 180 

m04 0.pep PSVPLVSGNFLTARPIGVIDGTDMEYAGV I RKTDTAALRFQLDAGN I VWL PPLGHSYSGK 

I ! I I t I I I I I I I I I I I I I I M I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I II I I i I t I I I 
a040 PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 

130 140 150 160 170 180 



190 200 210 220 230 240 

m04 0.pep TFYLDMLQTAASAAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAGGQTRR 

I I : I I I I I I I I I : I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I : I I I 

a04 0 TFHLDMLQTAASVAVSLQAEKLVYLTLSDGISRPDGTLAVTLSAQEAQSLAEHAGGETRR 
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250 260 270 

m040.pep LISSA ELFTRNGIGTSIAKEAFVSIRQAHXXDIPHI 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I | M | | 

a04 0 LISSAVAALEGGVHRVQILNGAADGSLLQELFTRNGIGTSIAKEAFVSIRQAHSGDIPHI 

250 260 270 280 290 300 

280 290 300 310 320 330 

m04 0 . pep AALIRPLEEQGILLHRXREYLENHISEFSILEHDGNLYGCAALKT FAEADCGEIACLAVS 

I I II I I I I I I I I II I I I I I I I I I I II II I I I I I I II I I I I I I I I I I I I I I i I I | I | | | | 
a04 0 AALIRPLEEQGILLHRSREYLENHISEFSILEHDGNLYGCAALKT FAEADCGEIACLAVS 

310 320 330 340 350 360 

340 350 360 370 380 390 

m04 0.pep PQXQDGGYGERXLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 

II I I II I II I I I I I I I I II II I I I I I I I I I I I II I II I I I I (I I I I I I I I 1 I I I I I | | 
a04 0 PQAQDGGYGERLLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 

370 380 390 400 410 420 

400 410 
m04 0 . pep RSNGRNSHILVRRLHRX 
II I I I I I I I I 1 I I I I I I 
a04 0 RSNGRNSHILVRRLHRX 

430 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 040 shows 88.3% identity over a 436 aa overlap with a predicted ORF (ORF 040.ng) 
from N. gonorrhoeae: 

m04O/g04O 



m04 0.pep MSAPDLFVAHFREAVPYIRQMRGKTLVAGIDDRLLEGDTLNKLAADIGLLSQLGIRLVLI 60 

I : I I I I I I I I I I I : I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I | M 
g04 0 MNAPDSFVAHFREAAPYIRQMRGTTLVAGIDGRLLEGGTLNKLAADIGLLSQLGIRLVLI 60 

m04 0.pep HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 120 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I 
g040 HGAYHFLDRLAAAQGRTPHYCRGLRVTDETSLGQAQQFAGTVRSRFEAALCGSVSGFARA 120 

mO 4 0 . pep PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHS YSGK 180 

I I II I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I : I I I I I I I : I | 

g04 0 PSVPLVSGNFLTARPMGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWMPPLGHSYGGK 180 

m04 0 . pep TFYLDMLQTAASAAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAGGQTRR 24 0 

II M I : I : I I I : I I I I I I I I i I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I ::: | | | 

g04 0 TFNLDMVQAAASVAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAASETRR 24 0 

m040.pep LISSA ELFTRNGIGTSIAKEAFVSIRQAHXXDIPHI 27 6 

I I I I I 1 I I I I I I I II I I I I I I I I I I I I I I Mill 

g04 0 LISSAVAALEGGVHRVQILNGAADGSLLQELFTRNGIGTSIAKEAFVSIRQAHSGDIPHI 300 

m04 0 . pep AAL IRPLEEQG I LLHRXRE YLENHI SE FS I LEHDGNLYGCAALKT FAEADCGE IACLAVS 33 6 

I I I I I II I I I I : I I I I I I I I I I I ! I I I I I I I I I I : I I I I I I I I I I I I I | | | | | | | | | | | 

g04 0 AALIRPLEEQGVLLHRSREYLENHISEFSILEHDGDLYGCAALKTFAEADCGE IACLAVS 3 60 

m040.pep PQXQDGGYGERXLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 396 

II I I I I I I I I I i I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II | | | | | | | 
g040 PQAQDGGYGERLLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 4 20 

m04 0.pep RSNGRNSHILVRRLHRX 413 

I I I I I I I I I I I I I I II 
g04 0 RSNGRN PH I LVRRLHRX 4 37 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 153>: 

g04 1 . seq 

1 ATGAGTTCGC CCAAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGCCT 

51 GATTACCGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGTGCGCTGG 

101 TGTGCGAAGT ACCGCTGACC GATATGATCC GTTATCCGCT GCTGTCCGGC 

151 GGTTCAAGTT GGACGGACGA ATACGGCAAT CCGCAGAAAT ACGAAGCCTG 

201 CAAACGCCGG CTGGGCGAAT TGTCGCCGTA TCACAATCTT T C AG AC GGC A 

251 TCGATTATCC GCCCGCACTC ATTACCACCA GCCTCAGCGA CGACCGCGTC 

301 CATCCCGCCC ACGCGCTCAA ATTCTACGCC AAACTGCGCG AAACCTCGCC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CCCAACGCGA ATCCGCCGAC AAACTCGCCT GCGTGTTGCT GTTTTTGAAA 

4 51 GAATTTTTGG GATAA 

This corresponds to the amino acid sequence <SEQ ID 154; ORF 04Lng>: 

g04 1 . pep 

1 MSSPKHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

51 GSSWTDEYGN PQKYEACKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQRESAD KLACVLLFLK 

151 EFLG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 155>: 

m041 . seq 

1 ATCAGTTCGC CCGAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGACT 

51 GATTACTGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGCGCGCTGG 

101 TGTGCGAAGT GCCGCTGACC GACATGATCC GTTATCCGCT GCTCTCCGCC 

151 GGTTCAAGCT GGACAGACGA ATACGGCAAT CCGCAAAAAT ACGAAGTCTG 

201 CAAACGCCGG TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

251 TCGATTATCC GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 

301 CATCCCGCCC ACGCGCTCAA GTTCTACGCC AAACTGCGCG AAACCTCCGC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CCCAACGCGA ATCCGCCGAC GAACTCGCCT GCGTCTTGCT GTTTTTGAAA 

4 51 GAGTTTTTGG GCTAA 

This corresponds to the amino acid sequence <SEQ ID 156; ORF 041>: 

m04 1 . pep 

1 ISSPEHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

51 GSSWTDEYGN PQKYEVCKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSAQSW LYSPDGGGHT GNGTQRESAD ELACVLLFLK 

151 EFLG* 

The following partial DNA sequence was identified inN. meningitidis <SEQ ID 157>: 

a041 . seq 

1 ATCAGTTCGC CCGAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGACT 

51 GATTACTGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATA GGCGCGCTGG 

101 TGTGCGAAGT GCCGCTGACC GACATGATCC GTTATCCGCT GCTCTCCGCC 

151 GGTTCAAGCT GGACAGACGA ATACGGCAAT CCGCAAAAAT ACGAAGTCTG 

201 CAAACGCCGG TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

251 TCGATTATCC GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 

301 CATCCCGCCC ACGCGCTCAA GTTCTACGCC AAACTGCGCG AAACCTCGCC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CGCAGCGCGA AGCCGCCGAC GAACTCGCCT GCGTGTTGCT GTTTTTGAAA 

4 51 GAGTTTTTGG GCTAA 

This corresponds to the amino acid sequence <SEQ ID 158; ORF 041. a>: 

a041 .pep 

1 ISSPEHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

51 GSSWTDEYGN PQKYEVCKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQREAAD ELACVLLFLK 
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151 EFLG* 

m04i/a04i 98.7% identity over a 154 aa overlap 

10 20 30 40 50 60 

m04 1 . pep ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 
I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I || | | | | | | | | | | 
a04l ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 1 . pep PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSW 
I I t I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I | | | || | I | | | | | | | | IN 
a041 PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSW 

70 80 90 100 110 120 

130 140 150 

m041 .pep LYSPDGGGHTGNGTQRESADELACVLLFLKEFLGX 
I I I I I I I I I I I I I I I I I : I I I I I I I I I II I I I I I I 
a 0 4 1 LYS PDGGGHTGNGTQREAADELACVLLFLKE FLGX 

130 140 150 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 041 shows 96.8% identity over a 154 aa overlap with a predicted ORF (ORF 041. ng) 
from N. gonorrhoeae: 

m041/g041 

10 20 30 40 50 60 

m041 .pep ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 
: I I I : I I I ! I I I I I I I I I I I I i I I II I I f i I I I I I I I I I I I I I I I I I I I | I | | | | | I I i I 
g041 MSSPKHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 1 . pep PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSW 
I M M: I I I IMMIMIII I JMIIIMM IMIII MMMMMI II Ml M I III 
g041 PQKYEACKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSW 

70 80 90 100 110 120 

130 140 150 

m041 .pep LYSPDGGGHTGNGTQRESADELACVLLFLKE FLGX 

I I II I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I 
g04 1 LYSPDGGGHTGNGTQRESADKLACVLLFLKEFLGX 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 159>: 

g041-l.seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAAACAA CGACAAGGCG CGCGCACTTT CAGACGGCAT TTTGAATCAA 

151 ATGCAGGACA CGCGGCAGAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

201 GTACCATTTC CATCAGAATG CGGAATATCC GAAGGGCGTG TACCGCATGT 

251 GTACGGCGGC GACCTACCGT TCCGGCTATC CCGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGATGA GTTGCTCGGC GACGATGTGT ATTTGGGCGG 

351 CGTGTCGCAC TTGGTGGAGC AGCCCAACCG CGCGCTGCTG ACTTTGAACA 

4 01 AATCGGGCGG CGATACGGCG TATACGCTGG AAGTGGATTT GGAAGCAGGG 

4 51 GAATTGGTAG AGGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG GACGAACGCC 

551 AGTTGACCGA ATCGGGCTAT CCGCGCGAAG TGTGGCTGGT GGAACGCGGC 
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601 AAGAGTTTCG AGGAAAGCCT GCCGGCGTAC CAAATCGATA AAGGCGCGAT 
651 GATGGTAAAC GCGTGGCGTT ACCTCGATCC GCAGGGTTCG CCGATTGATT 
7 01 TGATTGAAGC GTCGGACGGT TTTTACACCA AGACGTATTT GCAGGTGTCG 
7 51 TCCGAAGGCG GGGCGAAACC GTTGAACCTG CCTAATGATT GCGATGTGGT 
801 CGGCTATCTG GCGGGACATC TTTTGCTGAC GCTGCGCAAG GACTGGCACC 
851 GCGCGAACCA AAGCTATCCG AGTGGCGCGT TGGTGGCGGT GAAACTGAAT 
901 CGGGGCGAAC TCGGGGCGGC GCAGCTTTTG TTTGCGCCCG ATGAAACGCA 
951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTGGTG GCAAGCCTGC 
1001 TGGAGAATGT ACAAGGCCGT CTGAAAGCGT GGCGGTTTGC CGACAGCAAA 
1051 TGGCAGGAAG CCGAGTTGCC GCACCTGCCC TCGGGCGCGT TGGAAATGAC 
1101 CGACCAACCG TGGGGCGGCG ACGTGGTTTA TCTTGCCGCC AGCGATTTCA 
1151 CCACGCCGCT GACGCTGTTT GCGCTGGATT TGAACGTGAT GGAACTGACC 
1201 GTCATGCGCC TCCAGCCGCA GCAGTTTGTT TCAGACGGCA TCGAAGTGCG 
1251 GCAGTTTTGG GCGGTGTCGT CCGACGGCGA ACGCATTCCT TATTTCCACG 
1301 TCGGCAAAAA CGCCGCGCCC GACACGCCGA CCTTAGTCTA TGCTTACGGA 

1351 GGTTTCGGCA TTCCTGAATT GCCGCATTAT CTGGGCAGCG TCGGCAAATA 

14 01 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCAAACATC CGCGGCGGCG 

14 51 GAGAATT CGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAC 

1501 AAAAGCGTTG ATGATTTGTT GGCAGTCGTG CGTGATTTGT CCGAACGCGG 

1551 CATGAGTTCG CCCAAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGCC 

1601 TGATTACCGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT CGGTGCGCTG 

1651 GTGTGCGAAG TACCGCTGAC CGATATGATC CGTTATCCGC TGCTGTCCGC 

17 01 CGGTTCAAGT TGGACGGACG AATACGGCAA TCCGCAGAAA TACGAAGCCT 

17 51 GCAAACGCCG GCTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 

1801 ATCGATTATC CGCCCGCACT CATTACCACC AGCCTCAGCG ACGACCGCGT 

1851 CCATCCCGCC CACGCGCTCA AATTCTACGC CAAACTGCGC GAAACCTCGC 

1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 

1951 ACCCAACGCG AATCCGCCGA CAAACTCGCC TGCGTGTTGC TGTTTTTGAA 

2001 AGAATTTTTG GGATAA 

This corresponds to the amino acid sequence <SEQ ID 160; ORF 041-l.ng>: 

g041-l . pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLNNDKA RALSDGILNQ 

51 MQDTRQIPFC QEHRARMYHF HQNAEYPKGV YRMCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLNKSGGDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW DERQLTESGY PREVWLVERG 

201 KSFEESLPAY QIDKGAMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLQVS 

251 SEGGAKPLNL PNDCDWGYL AGHLLLTLRK DWHRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPDETQALE SVETTKRFVV ASLLENVQGR LKAWRFADSK 

351 WQEAELPHLP SGALEMTDQP WGGDVVYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRLQPQQFV SDGIEVRQFW AVSSDGERIP YFHVGKNAAP DTPTLVYAYG 

4 51 GFGIPELPHY LGSVGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAVV RDLSERGMSS PKHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEACKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSPQSWLYS PDGGGHTGNG 

651 TQRESADKLA CVLLFLKEFL G* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 161>: 

m041-l.seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAGAAAA CGACAAGGCG CGCGCGCTTT CAGACGG CAT TTTGGCGCAG 

151 TTGCAGGACA CGCGGCAGAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

201 GTACCATTTC CATCAGGACG CGGAGTATCC GAAGGGCGTG TACCGCGTGT 

251 GTACCGCGGC GACGTATCGT TCCGGCTATC CCGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGACGA ATTGCTTGGC GACGATGTGT ATTTGGGCGG 

351 CGTGTCGCAC TTGGTGGAAC AGCCCAACCG CGCGTTGTTA ACACTGAGCA 

4 01 AATTGGGCAG CGATACGGCG TACACGCTGG AAGTGGATTT GGAAGCAGGG 

4 51 GAGTTGGTCG AAGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG AACGAACGCC 

551 AGTTGACCCA ATCGGGCTAT CCGCGCGAAG TATGGCTGGT GGAACGCGGC 

601 AAGAGTTTCG AGGAAAGCCT GCCTGTGTAT CAAATCGGCG AAGACGGCAT 

651 GATGGTGAAC GCGTGGCGTT ATCTCGATCC GCAGGGTTCG CCGATTGATT 

701 TGATTGAAGC GTCGGACGGT TTTTACACCA AAACCTATTT GCGGGTCTCA 
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7 51 GCCGAAGGCG AGGCGAAACC GTTAAACCTG CCCAACGATT GCGACGTGGT 

801 CGGCTATCTG GCGGGGCATC TTTTGCTGAC GCTGCGCAAG GACTGGAACC 

851 GCGCGAACCA AAGCTATCCG AGCGGCGCGC TGGTGGCGGT GAAGCTGAAT 

901 CGGGGCGAAC TCGGGGCGGC GCAGCTTTTG TTTGCGCCCG ATGAAACGCA 

951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTGGTG GCGAGCCTGT 

1001 TGGAGAACGT ACAAGGCCGT CTGAAAGCAT GGCGGTTTGC CGACGGCAAA 

1051 TGGCAGGAAG TCGAATTGCC GCGCCTGCCT TCGGGCGCGT TGGAAATGAC 

1101 CGACCAACCT TGGGGCGGCG ACGTGGTTTA CCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCGCTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC GCCAGCCGCA GCAGTTTGAT TCAGACGGCA TTAACGTGCA 

1251 GCAGTTTTGG ACGACTTCGG CTGACGGCGA GCGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACATGCCGA CGCTGGTCTA TGCCTACGGC 

1351 GGTTTCGGCA TTCCCGAATT GCCGCATTAT CTGGGCAGCA TTGGCAAATA 

1401 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCGAACATC CGCGGCGGCG 

1451 GCGAGTTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAT 

1501 AAAAGCGTTG ATGATTTATT GGCAGTCGTG CGCGATTTGT CCGAACGCGG 

1551 TATCAGTTCG CCCGAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGAC 

1601 TGATTACTGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT CGGCGCGCTG 

1651 GTGTGCGAAG TGCCGCTGAC CGACATGATC CGTTATCCGC TGCTCTCCGC 

1701 CGGTTCAAGC TGGACAGACG AATACGGCAA TCCGCAAAAA TACGAAGTCT 

1751 GCAAACGCCG GTTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 

1801 ATCGATTATC CGCCCGCGCT CATTACCACC AGCCTGTCCG ACGATCGCGT 

1851 CCATCCCGCC CACGCGCTCA AGTTCTACGC CAAACTGCGC GAAACCTCCG 

1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 

1951 ACCCAACGCG AATCCGCCGA CGAACTCGCC TGCGTCTTGC TGTTTTTGAA 

2001 AGAGTTTTTG GGCTAA 



This corresponds to the amino acid sequence <SEQ ID 162; ORF 041-1>: 

m041-l .pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLENDKA RALSDGILAQ 

51 LQDTRQIPFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSKLGSDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW NERQLTQSGY PREVWLVERG 

201 KSFEESLPVY QIGEDGMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLRVS 

251 AEGEAKPLNL PNDCDVVGYL AGHLLLTLRK DWNRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPDETQALE SVETTKRFVV ASLLENVQGR LKAWRFADGK 

351 WQEVELPRLP SGALEMTDQP WGGDVVYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERIP YFHVGKNAAP DMPTLVYAYG 

451 GFGIPELPHY LGSIGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAVV RDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSAQSWLYS PDGGGHTGNG 

651 TQRESADELA CVLLFLKEFL G* 



m041-l/g041-l 94.6% identity in 671 aa overlap 

10 20 30 40 50 60 

m041-l .pep MKSYPDPYRH FENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFC 
MMMIMMMIIIIIIIIIMIIIIIIIIIIhllllMIIIIII l:!illl||H 
g041-l MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLNNDKARALSDGILNQMQDTRQIPFC 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 1-1 . pep QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

m 1 1 r [ i f 1 1 1 r : 1 1 e 1 1 1 1 1 1 : i r ( 1 1 1 1 1 r 1 1 1 r i i 1 1 1 1 1 r i i j i [ r 1 1 1 1 1 1 1 1 f i 

g041-l QEHRARMYHFHQNAEYPKGVYRMCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m041-l.pep LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 
111111111111:1 I : I I I M I M I M I M I I I I I I I I I I 1 I | I M II I 1 I M 1 I M I I 
g041-l LVEQPNRALLTLNKSGGDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m041-l.pep NERQLTQSGY PREVWLVERGKS FEE SLPVYQIGEDGMMVNAWRYLDPQGS PIDLIEASDG 
: I I I I I M I I I I I I I I | | | | | | | | | | | | : | | | : : I I | I | || I I I I j I I I ! I I I | | | | | 
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g041-l DERQLTESGYPREVWLVERGKSFEESLPAYQIDKGAMMVNAWRYLDPQGSPIDLIEASDG 

190 200 210 220 230 240 

250 260 270 280 290 300 

m041-l .pep FYTKTYLRVSAEGEAKPLNLPNDCDWGYLAGHLLLTLRKDWNRANQSYPSGALVAVKLN 
I I I i I I I : I I : i I I > I I I I I i I M I I M I I I I I I I I I I I i I : I I I I I I 1 I | | | | | | | | | 
g041-l FYTKTYLQVSSEGGAKPLNLPNDCDVVGYLAGHLLLTLRKDWHRANQSYPSGALVAVKLN 

250 260 270 280 290 300 

310 320 330 340 350 360 

m04 1-1 . pep RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADGKWQEVELPRLP 
I I I I M I I I I I I I I I I I I I I I I I I t I I I I M I I i I I I I I I I I i I I I I I : I | | I : | | | : | ! 
g041-l RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADSKWQEAELPHLP 

310 320 330 340 350 360 

370 380 390 400 410 420 

m041~l .pep SGALEMTDQPWGGDVVYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 
I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I II I I I I I I I I 1 I I I I I 1111:1:111 
g041-l SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRLQPQQFVSDGIEVRQFW 

370 380 390 400 410 420 

430 440 450 460 470 480 

m04 1-1 . pep TTSADGERI PYFHVGKNAAPDMPTLVYAYGGFGI PELPHYLGS IGKYWLEEGNAFVLANI 
I : I I I I I I I I M I I I I I I I I I ( I I I I M I I I M I I I I I I f : I I I I I I I I ! I I I I I I I 
■ g04 1-1 AVSSDGERIPYFHVGKNAAPDTPTLVYAYGGFGI PELPHYLGS VGKYWLEEGNAFVLANI 

430 440 450 460 470 480 

490 500 510 520 530 540 

mO 4 1-1 .pep RGGGEFGPRWHQAAQGISKHKSVDDLLAVVRDLSERGISSPEHIGLQGGSNGGLITAAAF 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M II : I I I : I I I I I I I I I | | | | | | | | | 
g041-l RGGGE FG PRWHQAAQGI SKHKS VDDLLAVVRDLSERGMS S PKH I GLQGGSNGGLITAAAF 

490 500 510 520 530 540 

550 560 570 580 590 600 

ra041-l .pep VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
I ! I I I I I I I I I I M I I I I II I I I I I I I I ! I I I I I I I I I I I I I : I | | || | | | | | | | | | | | | 
g041-l VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEACKRRLGELSPYHNLSDG 

550 560 570 580 590 600 

610 620 630 640 650 660 

m04 1-1 . pep IDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGGGHTGNGTQRESADELA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I i I I I I : I I 
g041-l IDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSWLYSPDGGGHTGNGTQRESADKLA 

610 620 630 640 650 660 



670 

mO 4 1- 1 . pep CVLLFLKEFLGX 
I I I I I I I I I I I I 
g 0 4 1 - 1 C VLLFLKE FLGX 

670 



m041-l/P55577 

sp|P55577|Y4NA_RHISN PROBABLE PEPTIDASE Y4NA >gi 12182536 (AE000086) Y4nA [Rhizobium sp. 
NGR234] Length « 72 6 
Score = 370 bits (940), Expect = e-101 

Identities - 217/682 (31%), Positives - 331/682 (47%), Gaps = 22/682 (3%) 

Query: 2 KSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFCQ 61 

K DP + +D + + N T + ++ + L LQ T +1 

Sbjct: 42 KDASDPRAYLNEIDGDKAMTWVEAHNLSTVDKLSKDPRYSEYQADALTILQATDRIASPS 101 

Query : 62 EHRARMY-HFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 120 

R M +F QD + +G++R T +YRSG P+W+ + V + G G 
Sbjct: 102 FARDGMIDNFWQDGTHVQGLWRRTTWESYRSGNPQWRTILDVDALSKAEGKTWVFEGGDC 161 

Query: 121 LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 180 
L N L+ LS G D E D+ GE V+ GF P GK V+W DEN+++V W 
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Sbjct: 162 LPPTSNLCLIRLSDGGKDADVVREFDIAKGEFVKEGFVLPEGKQSVTWVDENTIYVTREW 221 

Query: 181 NERQLTQSGYPREVWLVERGKSFEESLPVYQ IGEDGMM--VNAWRYLDPQGSPI 232 

++T SGY +V+RG+S ++++ +++ E G++ ++ +D + 

Sbjct: 222 TPGEVTSSGYAYVTKVVKRGQSLDQAVEIFRGQKKDVSAERGVLRDIDGKYVMDTSYRGL 281 

Query: 233 DLIEASDGFYTKTYLRVSAEGEAKPLNLPNDCDVVGYLAGHLLLTLRKDWNRANQS-YPS 291 

D FY + + L LP GY G + L+ DW A + + + 

Sbjct: 282 DFFNTELAFYPNGH PDTRKVVLPLPTTAVFSGYYKGQAI YWLKSDWTSAKGTVFHN 337 

Query: 292 GALVAVKLNRGELGAAQL LFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFA 347 

GA++A L A++ LF P+E Q++ TK +V S+L NV ++++ F 

Sbjct: 338 GAIIAFDLKAALADPARVEPLVLFMPNEHQSVAGTTQTKNRLVLSILSNVTSEVRSFDFG 397 

Query: 34 8 DGKWQEVELPRLPSGALEMTDQPWGGDVVYLAASDFTTPLTLFALDLNVMELTVMRRQPQ 4 07 

G W +L + L +T D +++ + F P TLF D ++ + p 

Sbjct: 398 KGGWSSFKLALPENSTLSLTSSDDESDQLFVFSEGFLEPSTLFCADAATGQVEKITSTPA 4 57 

Query: 408 QFDSDGINVQQFWTTSADGERIPYFHVGKNAAP DMPTLVYAYGGFGI PELPHYLGS I 464 

+FD+ G+ QQFW TS.DG ++PYF V + PT++YAYGGF IP p Y + 

Sbjct: 458 RFDAGGLQAQQFWATSKDGTKVPYFLVARKDVKLDGTNPTILYAYGGFQIPMQPSYSAVL 517 

Query: 465 GKYWLEEGNAFVLANIRGGGEFGPRWHQAAQGISKHKSVDDLLAVVRDLSERGISSPEHI 524 

GK WLE+G A+ LANIRGGGEFGP+WH A ++ + DD AV +DL + ++S H+ 
Sbjct: 518 GKLWLEKGGAYALANIRGGGEFGPKWHDAGLKTNRQRVYDDFQAVAQDLIAKKVTSTPHL 577 

Query: 525 GLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVC 584 

G+ GGSNGGL+ ++ P A+V +VPL DM+ + +SAG+SW EYG+P V 

Sbjct: 578 GIMGGSNGGLLMGVQMIQRPDLWNAVVIQVPLLDMVNFTRMSAGASWQAEYGSPDD-PVE 636 

Query: 585 KRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGG 644 

L +SPYHN+ G+ YP TS DDRV PHAKA + +Y G 

Sbjct: 637 GAFLRSISPYHNVKAGVAYPEPFFETSTKDDRVGPVHARKMAALFEDMGLPFYYYENIEG 696 

Query: 64 5 GHTGNGTQRESADELACVLLFL 666 

GH +E A A +++ 

Sbjct: 697 GH AAAANLQEHARRYALEY I YM 718 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 163>: 

a041-l.seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAAACAA CGACAAGGCA CGCGCATTGT CTGACGGCAT TTTGGCGCAG 

151 TTGCAGGACA CGCGGCAAAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

201 GTACCATTTC CATCAAGATG CGGAATATCC GAAAGGCGTG TACCGCGTGT 

251 GTACCGCGGC GACTTACCGT TCGGGCTATC CTGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGACGA ATTGCTCGGT GACGATGTAT ATCTAGGCGG 

351 CGTGTCGCAC CTGGTGGAAC AGCCCAACCG CGCGTTGTTA ACACTGAGCA 

4 01 AATCGGGCGG CGATACCGCG TACACGCTGG AAGTGGATTT GGAAGCAGGG 

4 51 GAGTTGGTAG AAGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG GACGAACGCC 

551 AGTTGACCGA ATCGGGCTAT CCGCGCGAGG TGTGGCTGGT GGAACGCGGC 

601 AAGAGTTTCG AGGAAAGCCT GCCGGTGTAC CAAATTGCTG AAGACGGCAT 

651 GATGGTGAAC GCGTGGCGTT ACCTCGATCC GCAGGGTTCG CCGATTGATT 

7 01 TGATTGAAGC GTCTGACGGT TTTTACACCA AAACCTATTT GCAGGTCTCA 

7 51 GCCGAAGGCG AAGCGAAACC GTTAAACCTG CCCAACGATT GCGACGTAGT 

801 CGGCTATCTG GCCGGACATC TTTTGCTGAC CTTGCGTAAA GACTGGCACC 

851 GCGCGAACCA AAGCTATCCG AGTGGCGCAT TGGTAGCAGT AAAATTAAAC 

901 CGCGGCGAAT TGGGCGCGGC GCAGCTTTTG TTTGCGCCCA ATGAAACGCA 

951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTCGTG GCGAGCCTGC 

1001 TGGAAAACGT ACAGGGTCGT CTGAAAGCGT GGCGTTTTAC TGATGGCAAA 

1051 TGGCAGGAAA CCGAGTTGCC GCGCCTGCCT TCGGGCGCGT TGGAAATGAC 

1101 CGACCAACCG TGGGGGGGCG ACGTAGTTTA CCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCATTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC GCCAGCCGCA GCAGTTTGAT TCAGACGGCA TTAACGTGCA 

1251 GCAGTTTTGG ACGACTTCGG CTGACGGCGA GCGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACATGCCGA CGCTGGTCTA TGCCTACGGC 

1351 GGTTTCGGCA TTCCCGAATT GCCGCATTAT CTGGGCAGCA TTGGCAAATA 

14 01 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCGAACATC CGCGGCGGCG 
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14 51 GCGAGTTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAT 

1501 AAAAGCGTTG ATGATTTATT GGCAGTCGTG AGCGATTTGT CCGAACGCGG 

1551 TATCAGTTCG CCCGAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGAC 

1601 TGATTACTGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT AGGCGCGCTG 

1651 GTGTGCGAAG TGCCGCTGAC CGACATGATC CGTTATCCGC TGCTCTCCGC 

17 01 CGGTTCAAGC TGGACAGACG AATACGGCAA TCCGCAAAAA TACGAAGTCT 

1751 GCAAACGCCG GTTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 

1801 ATCGATTATC CGCCCGCGCT CATTACCACC AGCCTGTCCG ACGATCGCGT 

1851 CCATCCCGCC CACGCGCTCA AGTTCTACGC CAAACTGCGC GAAACCTCGC 

1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 

1951 ACGCAGCGCG AAGCCGCCGA CGAACTCGCC TGCGTGTTGC TGTTTTTGAA 

2001 AGAGTTTTTG GGCTAA 



This corresponds to the amino acid sequence <SEQ ID 164; ORF 041-l.a>: 

a041-l.pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLNNDKA RALSDGILAQ 

51 LQDTRQIPFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSKSGGDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW DERQLTESGY PREVWLVERG 

201 KSFEESLPVY QIAEDGMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLQVS 

251 AEGEAKPLNL PNDCDVVGYL AGHLLLTLRK DWHRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPNETQALE SVETTKRFVV ASLLENVQGR LKAWRFTDGK 

351 WQETELPRLP SGALEMTDQP WGGDVVYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERIP YFHVGKNAAP DMPTLVYAYG 

451 GFGIPELPHY LGSIGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAVV SDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSPQSWLYS PDGGGHTGNG 

651 TQREAADELA CVLLFLKEFL G* 



a041-l/m041-l 97.9% identity in 671 aa overlap 

10 20 30 40 50 60 

a041-l.pep MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLNNDKARALSDGILAQLQDTRQIPFC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I : I II I I I I I I I I I I | | | | M I I I I I 
m041-l MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFC 

10 20 30 40 50 60 



70 80 90 100 110 120 

a041-l.pep QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | i | I I I I I I I I I I 
m041-l QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

70 80 90 100 110 120 

130 140 150 160 170 180 

a041-l .pep LVEQPNRALLTLSKSGGDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 
I I I I I I I I I I I I I I I : I I I I I I I | | | | I | | | | | | | | | | | 1 | | | | | | | | | | | | | | | | | | | 
m041-l LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 

130 140 150 160 170 180 

190 200 210 220 230 240 

a041-l.pep DERQLTESGYPREVWLVERGKSFEESLPVYQIAEDGMMVNAWRYLDPQGSPIDLIEASDG 
• I I It I :! Ml M M I I I I I I I Ml I I I ! I I |: I M | | | || | | | | | M I I I I I i I I I II I 
m041-l NERQLTQSGYPREVWLVERGKSFEESLPVYQIGEDGMMVNAWRYLDPQGSPIDLIEASDG 

190 200 210 220 230 240 

250 260 270 280 290 300 

a041-l.pep FYTKTYLQVSAEGEAKPLNLPNDCDVVGYLAGHLLLTLRKDWHRANQSYPSGALVAVKLN 
M I I I I I : I M I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I II I I I I I I I I 
Iti041-1 FYTKTYLRVSAEGEAKPLNLPNDCDVVGYLAGHLLLTLRKDWNRANQSYPSGALVAVKLN 

250 260 270 280 290 300 

310 320 330 340 350 360 

a04 1-1 . pep RGELGAAQLLFAPNETQALESVETTKRFVVASLLENVQGRLKAWRFTDGKWQETELPRLP 
M I I I I I I I I M I : I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II : I I I I I I : | | 1 M I 
m041-l RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADGKWQEVELPRLP 

310 320 330 340 350 360 
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370 380 390 400 410 420 

a041-l.pep SGALEMTDQPWGGDVVYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 
I I I I I I 1. 1 I I I I I I [ I I I I I I I I I i I I I I I I I I I II I I I I I i I i I I I I I I I I I I I I I M I 

m041-l SGALEMTDQPWGGDVVYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 

370 380 390 400 410 420 



430 440 450 460 470 480 

a041-l.pep TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPHYLGSIGKYWLEEGNAFVLANI 
I I I ! I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 1 1 I I I I 
m041-l TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPHYLGSIGKYWLEEGNAFVLANI 

430 440 450 460 470 480 



490 500 510 520 530 540 

a041-l.pep RGGGEFGPRWHQAAQGISKHKSVDDLLAVVSDLSERGISSPEHIGLQGGSNGGLITAAAF 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I 
m041-l RGGGEFGPRWHQAAQG I SKHKSVDDLLAVVRDLSERGI S S PEHIGLQGGSNGGLITAAAF 

490 500 510 520 530 540 



550 560 570 580 590 600 

a041-l .pep VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I 1 [ I I I I 
m041-l VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 

550 560 570 580 590 600 



610 620 630 640 650 660 

a041-l .pep I DYPPALITTSLSDDRVHPAHALKFYAKLRETS PQSWLYS PDGGGHTGNGTQREAADELA 
I I I M I M I I I I I I I I I I I I I I I I I I I I I I I M I I i i I I I I i I I I I I I I I I I ( : I I I I I 
m041-l IDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGGGHTGNGTQRESADELA 

610 620 630 640 650 660 

670 

aO 4 1-1 .pep CVLLFLKEFLGX 
I I I I I I I I I I I I 
m041-l CVLLFLKEFLGX 

670 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 165>: 

g042 . seq 

1 AT G AC GAT G A TTTGCTTGCG CTTCCAagcG TTCGTGCCGC ATACCAGCGC 

51 GTTATCCAAC ACTTCCACGG CAGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TGCGGTCGAT GATGAAAATC CAGCCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACGG GCTGCCCGTG CCCTTCGTTG CGTAAAGATT CGTCCACGGG 

201 CGGCAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GATTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCTGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG cggCTTCGCG CTTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 ATTCAATTCC GCCACGCGCG CTTCCTTACC GAAAATCCGC GACAGGGTCT 

401 CCATCTGCTT CTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAAA 

451 TCTATGgtgG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCACCCGG 

501 CCCGCCGGTA ATGACAAACT GCGGATTGTG GCGGTGCAGG GATTCGCAAT 

551 CGGGCTCAAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 

601 AAATAG 

This corresponds to the amino acid sequence <SEQ ID 166; ORF 042.ng>: 

g042 .pep 

1 MTMICLRFQA FVPHTSALSN TSTAAGPSCP MAAVRSMMKI QPGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRFWA NSASICAFNS ATRASLPKIR DR VSICFSPL VRILPLSTV K 

151 SMVVAFFANC SYASAPGPPV MTNCGLWRCR DSQSGSNSVP TVAALSNAGC 

201 K* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 167>: 
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m042.seq 

1 AT G AC GAT G A TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAmT ACTTCGACAG CCGcCGGCCy TTCyTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

401 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

4 51 TCTATGGTGG TCGCGTTTTT CGCTAACTGT TCATACGCTT CCGCGCCCGG 

501 CCCGCCGGTA ATGACAAGCT GAGGATTGTA GCGGTGCAGG GCTTCGTAAT 

551 CGGGCTCGAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 

601 AAATAA 

This corresponds to the amino acid sequence <SEQ ID 168; ORF 042>: 

m042 .pep 

1 MTMICLRFQA FVPRTSALSX TSTAAGXSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR AK VSICFSPL VRILPLSTV R 

151 SMVVAFFANC SYASAPGPPV MTSXGLXRCR ASXSGSNSVP TVAALSNAGC 

201 K* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 169>: 

a042 . seq 

1 ATGACGATGA TTTGCTTGCG 

51 GTTATCCAAT ACTTCGACAG 

101 TACGGTCGAT GATGAAAATC 

151 AAGGAAACAG GCTGCCCGTG 

201 CGGTAGGCCG ATGTCGCCGT 

251 CGAAGGCGGA CACCTTGTTG 

301 TTGCCTTTGG CGGCTTCGCG 

3 51 CTTCAATTCC GCCGCGCGCG 

4 01 CCATCTGCTT TTCGCCGCTG 
4 51 TCTATGGTGG TCGCGTTTTT 
501 CCCGCCGGTA ATGACAAGCT 
551 CGGGCTCGAA CAGCGTCCCC 
601 AAATAA 

This corresponds to the amino acid sequence <SEQ ID 170; ORF 042.a>: 

a042 .pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR AK VSICFSPL VRILPLSTV R 

151 SMVVAFFANC SYASAPGPPV MTS*GL*RCR AS*SGSNSVP TVAALSNAGC 

201 K* 

m042/a042 99.0% identity over a 201 aa overlap 

10 20 30 40 50 60 

m04 2 .pep MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I M I I I I I I I I I I I I ! I ! I MINI I I I I I I I I I || j I | I | | | | | | | | | | | j | | | J | | 
a04 2 MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 2.pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I i I I I I I I I I I M II I I I I I I I I I I I I I I I M I 1 I | | | | | | | | | | | | | | | | | j | | | n | | 
a042 RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 

70 80 90 100 110 120 



CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 
CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 
CAATCGGGGT TTTTCTCTTT GATGTATTCG 
CCCCTCGTTG CGTAAAGATT CGTCTACAGG 
GTATCCAACT TGCCAACCGC GACTGCGTGC 
CCCGTAACCG ACAGCACCAG CCCGCGTCCT 
CGTTTGGGCG AACAGCGCGT CAATCTGCGC 
CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 
GTGCGGATAT TGCCGTTGTC CACCGTCAGA 
CGCCAACTGT TCATACGCTT CCGCGCCCGG 
GAGGATTGTA GCGGTGCAGG GCTTCGTAAT 
ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 



m042 . pep 



130 140 150 160 170 180 

AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSXGLXRCR 
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M I I I I M I I I I II I I I I I I I I j | | | | | | | | | | I I I ! ! I I i I I I I M I I I M I 

a04 2 AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSXGLXRCR 

130 140 150 160 . 170 180 

190 200 
m04 2.pep ASXSGSNSVPTVAALSNAGCKX 

I I I I II I I M i I I I I I I I I I I I 

a04 2 ASXSGSNSVPTVAALSNAGCKX 

190 200 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 042 shows 93.0% identity over a 201 aa overlap with a predicted ORF (ORF 042.ng) 
from N. gonorrhoeae: 

m042/g042 

10 20 30 40 50 60 

m042.pep MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 

II I I I I I I I I I I I : I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I f I 
g04 2 MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVRSMMKIQPGFFSLMYSKETGCPCPSL 

10 20 30 40 50 " 60 

70 80 90 100 110 120 

m04 2.pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
i I I I I I I 1 I I I I I I I I I I I I I I I I I I 1 I I I M I I [ I I I I I I I I I I II II I I I I I I I I I i 
g04 2 RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRFWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 180 

m042.pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSXGLXRCR 
1:11111111 : I I M I I I I I I I I I I I | | : i I i I I I I I I I I I I I I I I I I I It : II III 
904 2 ATRASLPKIRDRVSICFSPLVRILPLSTVKSMVVAFFANCSYASAPGPPVMTNCGLWRCR 

130 140 150 160 170 180 

190 200 
m04 2.pep ASXSGSNSVPTVAALSNAGCKX 
I I I I I I I I I I I I I I I I I I I I 
g042 DSQSGSNSVPTVAALSNAGCKX 

190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 171>: 

m042-l.seq 

1 ATGACGATGA TTTGCTTGCG 

51 GTTATCCAAT ACTTCGACAG 

101 TACGGTCGAT GATGAAAATC 

151 AAGGAAACAG GCTGCCCGTG 

201 CGGTAGGCCG ATGTCGCCGT 

251 CGAAGGCGGA CACCTTGTTG 

301 TTGCCTTTGG CGGCTTCGCG 

351 CTTCAATTCC GCCGCGCGCG 

4 01 CCATCTGCTT TTCGCCGCTG 

4 51 TCTATGGTGG TCGCGTTTTT 

501 CCCGCCGGTA A 

This corresponds to the amino acid sequence <SEQ ID 172; ORF 042-1 >: 

m04 2-l . pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 
51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 
101 LPLAASRVWA NSASICAFNS AARASLPKIR AK VSICFSPL VRILPLSTV R 
151 SMVVAFFANC SYASAPGPPV MTS* — 



CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 
CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 
CAATCGGGGT TTTTCTCTTT GATGTATTCG 
CCCCTCGTTG CGTAAAGATT CGTCTACAGG 
GTATCCAACT TGCCAACCGC GACTGCGTGC 
CCCGTAACCG ACAGCACCAG CCCGCGTCCT 
CGTTTGGGCG AACAGCGCGT CAATCTGCGC 
CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 
GTGCGGATAT TGCCGTTGTC CACCGTCAGA 
CGCTAACTGT TCATACGCTT CCGCGCCCGG 



m042-l/g042 95.4% identity in 173 aa overlap 
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m042-l . pep 
g042 



m04 2-l . pep 
g042 



m04 2-l .pep 
g042 



g042 



10 20 30 40 50 60 

MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I I I I M Ml I I I f: I ( MM Mil I I I II I II I I I I I i I I I I I M I I I I II M I M M I 
MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVRSMMKIQPGFFSLMYSKETGCPCPSL 

10 20 30 40 50 ' 60 

70 80 90 100 110 120 

RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I M I II II 
RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRFWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 

AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSX 
I : I I I I I I I I : M I 1 I I I i I I I I ] I I M : I | I I j | | f | | | | M I M [ I I f I : 
ATRASLPKIRDRVSICFSPLVRILPLSTVKSMVVAFFANCSYASAPGPPVMTNCGLWRCR 

130 140 150 160 170 180 

DSQSGSNS VPTVAALSNAGCKX 
190 200 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 173>: 

a042-l.seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC 

251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC 

4 01 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC 

4 51 TCTATGGTGG TCGCGTTTTT CGCCAACTGT TCATACGCTT 

501 CCCGCCGGTA A 



GTACCAGCGC 
ATGGCGGCGG 
GATGTATTCG 
CGTCTACAGG 
GACTGCGTGC 
CCCGCGTCCT 
CAATCTGCGC 
GCCAAGGTCT 
CACCGTCAGA 
CCGCGCCCGG 



This corresponds to the amino acid sequence <SEQ ID 174; ORF 042-1. a>: 

a042-l.pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 
51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 
101 LPLAASRVWA NSASICAFNS AARASLPKIR AKVSICFSPL VRILPLSTVR 
151 SMVVAFFANC SYASAPGPPV MTS* 



m042-l/a042-l 100.0% identity in 173 aa overlap 

10 20 30 40 50 60 

m042-l.pep MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | 
a042-l . MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m042-l.pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I I M I II I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | , 
a042-l RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 

m042-l .pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSX 
I I M I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
a042-l AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSX 

130 140 150 160 170 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 175>: 
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g04 3 . seq 

1 ATGGTTGTTT CAAATCAAAA 

51 TCACATCCGA AGGCAAAAAT 

101 CGTCCCGCGT ggcggtagcc 

151 GCCCGATTCG ATGAGGGcga 

201 GTCCGGCGAC GGTTTCGCCG 

251 CATTCGTTCA GCCCGACGCG 

301 GGCGAATTCG CTGTTCAAAA 

351 GGTAGCCGTT GTCGTGGCGG 



TATCTATGCC GTCGGCCCAT CAGCACTTTT 
CCGTAATGCC GCCTGAACGC TTCgttgaAC 
gcAAAAGTGC ATcGCGGCTT GGATGGTGCT 
gcGCGTGTTC CAGCCGCAGG CGGCGCAGGC 
GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 
GCGGGCGATT TCGGCGATGG TCAGCGGGCG 
TATCGGCGGC TTCGTCTATG CGCCGGCGGC 
CGGAAGGTGA AGCGTAA 



This corresponds to the amino acid sequence <SEQ ID 176; ORF 043. ng>: 

g043 .pep 

1 MVVSNQNIYA VGPSALFHIR RQKSVMPPER FVEPSRVAVA AKVHRGLDGA 

51 ARFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQPDA AGDFGDGQRA 

101 GEFAVQN IGG FVYAPAAVAV VVAA EGEA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 177>: 

m043 . seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 

101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 

151 GCCGGATTCG ATGAGGGCGA GCGCGTGTTC CAGCCGCAGG CGGCGCAgGC 

201 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

251 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGACG 

301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

351 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 

This corresponds to the amino acid sequence <SEQ ID 178; ORF 043>: 

m04 3 . pep 

1 MVVSNQNIYA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVHGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQSDA AGDFGDGQRT 

101 GEFVLQDVGG FVYAPTAVTV VVAAEGEAQ* 

Computer analysis of the amino acid sequences gave the following results: 

Homology with a predicted ORF from N. meningitidis menA with menB 

ORF 043 shows 89.8% identity over a 128 aa overlap with a predicted ORF (ORF043.a) from 

N. gonorrhoeae: 

m043/g043 



10 20 30 40 50 60 

m04 3 . pep MVVSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 
I I I I M I I I I : I I I i I : I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g04 3 MVVSNQNIYAVGPSALFHIRRQKSVMPPERFVEPSRVAVAAKVHRGLDGAARFDEGERVF 

10 20 30 40 50 60 



70 80 ~- 90 100 110 120 

m04 3 . pep Q PQAAQAS GDGFAGLRFE I A FQVAFVQSDAAGDFGDGQRTGEFVLQDVGG FVYAPTAVTV 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I E i I I I ! I : I I I : : I : : I I I I ! ! [ : | | : { 
g043 Q PQAAQAS G DG FAG LR FE I AFQ VA FVQ P D AAG DFGDGQRAGE FA VQN I GG FVYAPAAVAV 

70 80 90 100 110 120 

130 

mO 4 3 . pep VVAAEGEAQX 

MINIM 
g04 3 VVAAEGEAXX 

130 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 179>: 

a043.seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 

101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 
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151 GCCGGATTCG ATGAGGGCGA GCGCGTGTTC CAGCCGCAGG CGGCGCAGGC 

201 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

251 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGACG 

301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

351 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 



This corresponds to the amino acid sequence <SEQ ID 180; ORF 043. a>: 

a043.pep 

1 MVVSNQNIYA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVHGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQSDA AGDFGDGQRT 

101 GEFVLQD VGG FVYAPTAVTV VVAA EGEAQ* 



m043/a043 100.0% identity in 129 aa overlap 



10 20 30 40 50 60 

m04 3 . pep MVVSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
a04 3 MVVSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 

10 20 30 40 50 60 



70 80 90 100 110 120 

m04 3 . pep QPQAAQASGDGFAGLRFEIAFQVAFVQSDAAGDFGDGQRTGEFVLQDVGGFVYAPTAVTV 
M M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I | | M I I I I 
a04 3 QPQAAQASGDGFAGLRFEIAFQVAFVQSDAAGDFGDGQRTGEFVLQDVGGFVYAPTAVTV 

70 80 90 100 110 120 



130 

mO 4 3 . pep VVAAEGEAQX 
I I I I I I I I I I 
a04 3 VVAAEGEAQX 

130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 81>: 

g044 . seq 

1 ATGCTGCCCG ACCAGAGCGT CGAGTTCTTG CCACAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGCCC TTCGATAACG GCGGTCAGCT 

2 01 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 CGGCTGCCGT AGCGCATTAA 



This corresponds to the amino acid sequence <SEQ ID 182; ORF 044.ng>: 

g044 .pep 

1 MLPDQSVEFL PQ VWFDGLF GGGFPAVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQP FDNGGQLHAV VGGLRFAAEK FFFAAAVAH* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 183>: 

m044 . seq 

1 ATGCCGTCCG ACTAGAGCGT CGAGTTCTTT CCAGAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGAGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCATTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCAGTCAGTT 

2 01 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 TGGCTACCGT AGCGCAyTAa 



This corresponds to the amino acid sequence <SEQ ID 184; ORF 044>: 

m044 .pep 

1 MPSDXSVEFF PE VWFDGLF GGGFPAVAL P TVYPVFHAIF DVLRVGADDD 
51 GAAAFERFQS FDDGSQFHAV VGGLRFAAEK FFFVATVAH* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 185>: 

a044 . seq 

1 GTGCCGTCCG ACCAGCGCGT CGAGTTCTTT CCACAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCGGTCAGTT 

201 CCATACGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 TGGCTGCCGT AGCGCATTAA 

This corresponds to the amino acid sequence <SEQ ID 186; ORF 044.a>: 

a044 .pep 

1 VPSDQRVEFF PQ VVVFDGLF GGGFPAVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQS FDDGGQFHTV VGGLRFAAEK FFFVAAVAH* 

m044/a044 91.0% identity over a 89 aa overlap 

10 20 30 40 50 60 

m04 4 . pep MPSDXSVEFFPEVVVFDGLFGGGFPAVALPTVYPVFHAIFDVLRVGADDDGAAAFERFQS 
•III I I II I : I I I I I I I I I I I I I I I I I I ! | | | | | | j : | | | | | | | | | | J | | | | | j | j | | 
a04 4 VPSDQRVEFFPQVVVFDGLFGGGFPAVALPTVYPVFHAVFDVLRVGADDDGAAAFERFQS 

10 20 30 40 50 60 

70 80 90 

m04 4 . pep FDDGSQFHAWGGLRFAAEKFFFVATVAHX 
I I I I : I I I : I I I I II I I I I I I I I II : I I I | 
a04 4 FDDGGQFHTVVGGLRFAAEKFFFVAAVAHX 

70 80 90 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 044 shows 86.5% identity over a 89 aa overlap with a predicted ORF (ORF 044.ng) 
from N, gonorrhoeae: 

m044/g044 

10 20 30 40 50 60 

m044 . pep MPSDXSVEFFPEVWFDGLFGGGFPAVALPTVYPVFHAIFDVLRVGADDDGAAAFERFQS 

I I M I h h 1 1 1 1 i 1 1 II 1 1 M I II 1 1 1 1 1 1 II I h 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

9044 MLPDQSVEFLPQVWFDGLFGGGFPAVALPTVYPVFHAVFDVLRVGADDDGAAAFERFQP 

10 20 30 40 50 60 

70 80 90 

m04 4 . pep FDDGSQFHAWGGLRFAAEKFFFVATVAHX 
I h hhl II I I I II M I I I I !h|:| I ! I 
904 4 FDNGGQLHAWGGLRFAAEKFFFAAAVAHX 

70 80 90 



The following partial DNA sequence 



g046 . seq 






1 


ATGTCGGCAA 


TGCTGCGTCC 


51 


GATGACCATC 


CGCACGCGGT 


101 


CCGGGCAGTC 


TATCAGGCCG 


151 


CTGATGGTTT 


CGGTTATGCC 


201 


TTCGAGCTTG 


GGGCTGAGGT 


251 


CGATGCGGCC 


GGGGATGTTG 


301 


ATGTTGGTTT 


CGTCGCTGCG 


351 


CgcgccggcG 


Cgttataatg 


401 


TTTGGacatc 


gaggcggctg 


451 


TCGATAAcgg 


TTACGTCGTT 



; identified in N. gonorrhoeae <SEQ ID 187>: 

GACAAGCAGC CCGCCGCgcc gCGCCTGTAT 
CGTCTGCAAA ACGTAAAACC TGCAATGCGC 
GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 
gaATATGGAA AGGCTGCCGt TTTcGTTGTT 
ATTCGAGGTA TtcgctGGAA CGGACGCGCG 
AACAGGTCGG CGGCAACTTT GCAGGCGACG 
GGagaGCGCG AGcagcaagt cggcatcttC 
tgAAGGGGGA TGCGccgttg ccgaAAACGG 
CCTGTTTCCT GCAATGCTTT TTCGTCGATG 
GTTGGTGATG GCGGCAAGGT TTTGCGCGAC 
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5 01 GGTAGAACCT ACCTGCCCGT TGCCTAAAAT GAGGATTTTC ACGGTATGGG 
5 51 TCGCCGGGTG A 

This corresponds to the amino acid sequence <SEQ ID 188; ORF 046.ng>: 
g04 6.pep 

1 MSAMLRPTSS PPRRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 
51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 
101 MLVSSLRESA SSKSASSAPA RYNVKGDAPL PKTVWTSRRL PVSCNAFSSM 
151 S ITVTSLLVM AARFCATV EP TCPLPKMRIF TVWVAG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 189>: 

m04 6.seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAsT CCGC.r.sGC gCGcCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC GAATATGGAA AGGCTGCCGT TTTCGTTGTT 

2 01 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TTCGCTGGAA CGGACGCGCG 
251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

3 01 ATGTTGGTTT CGTCGCTGCG GGAGAGCGCG AGCAGCAAGT CGGCATCTTC 
351 CGCGCCGGCG CGTTCTAATG TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 

4 01 TTTGGACATC GAGGCGGCTG CCTGTTTCCT GCAATGCTTT TTCGTCGATG 

4 51 TCGATAACGG TTACGTCGTT GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 

5 01 GGTAGAACCT ACCTGTCCGT TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 
551 TCGCCGAGTG A 

This corresponds to the amino acid sequence <SEQ ID 190; ORF 046>: 

m04 6 .pep 

1 MSAMLRPTSX PXXRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 S ITVTSLLGM AARFCATV EP TCPLPKMRIF TVWVAE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 191>: 

a04 6 . seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAGT CCGCCGCGCC GCGCCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC GAATATGGAA AGGCTGCCGT TTTCGTTGTT 

201 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TTCGCTGGAA CGGACGCGCG 

251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

301 ATGTTGGTTT CGTCGCTGCG GGAGAGCGCG AGCAGCAAGT CGGCATCTTC 

351 CGCGCCGGCG CGTTCTAATG TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 

401 TTTGGACATC GAGGCGGCTG CCTGTTTCCT GCAATGCTTT TTCGTCGATG 

4 51 TCGATAACGG TTACGTCGTT GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 

501 GGTAGAACCT ACCTGTCCGT TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 

551 TCGCCGAGTG A 

This corresponds to the amino acid sequence <SEQ ID 192; ORF 046.a>: 

a04 6. pep 

1 MSAMLRPTSS PPRRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 SITVTSLLGM AARFCATVEP TCPLPKMRIF TVWVAE* 

m046/a046 98.4% identity over a 186 aa overlap 

10 20 30 40 50 60 

m04 6.pep MSAMLRPTSX PXXRACMMTIRTRSSAKRKTCNAPGQS I RPASCSVTSCSGLMVSVMPNME 

MMIMM i I I I | | | | | MINI I M I I I I I I I I I I I I I 1 I I I I I I I | | | | 

a04 6 MSAMLRPTSS PPRRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 
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10 



20 



30 



40 



50 



60 



70 80 90 100 110 120 

m04 6.pep RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 
I I I I I M I I I I I I I i I I I I I I I I I I I I ! I I I I 1 I M I I I I I I M I I I I I | | | | | | | | | | | 

a04 6 RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m04 6 . pep RSNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II | | | | || M I I I I I I I I I I I | 

a04 6 RSNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 

130 140 150 160 170 180 



m04 6. pep 
a046 



TVWVAEX 
I I I I I I I 
TVWVAEX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 046 shows 97.3% identity over a 185 aa overlap with a predicted ORF (ORF 046.ng) 
from N. gonorrhoeae: 

m046/g046 

10 20 30 40 50 60 

m04 6 .pep MSAMLRPTSXPXXRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 

minimi i r i r 1 1 r 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 j i j i j 1 1 1 1 1 f j f 1 1 1 1 1 1 r i 

g04 6 MSAMLRPTSSPPRRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 6 . pep RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

1 1 1 M M 1 1 II M M I M 1 1 M 1 1 M 1 1 1 II 1 1 1 1 1 M II M 1 1 1 M 1 1 1 II 1 1 II M 1 1 

9046 RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSAS SAPA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m04 6 .pep RSNVKGDAPLPKTWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 

I M I M 1 1 M 1 1 M M M I M M M I M M II M M I 1 1 II 1 1 1 M 1 1 1 1 1 1 II II 1 1 

g04 6 RYNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLVMAARFCATVEPTCPLPKMRIF 

130 140 150 160 170 180 



m04 6 .pep 
g046 



TVWVAEX 

Mill 
TVWVAGX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 193>: 

g047 . aeq 

1 ATGGTCATCA TACAGGCGcg gcGCGGCGGG CTGCTTGTCG GACGCAGCAT 

51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT GTTTGCCGCC GCCGCCGAAA ACATCGGGGC 

201 GGTCATACCC gaATTGCGCC CCAAAGAAAC CAGCACCCGC CGCATCATGA 

251 TTGCCGGCGG CGGCAACATc tgctACCGCC TCGCCAAGCA GCTCGAACAC 
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3 01 GCATAcaacG TCAAAATCAT CGAATGCCGG CCGCGCcgtg ccgaATGGAT 

3 51 AGCCGAAAAC ctcgAcaaCA CCCTCGTCCT GCAAGGTTCG Gcaaccgacg 

4 01 aAaccctgct cgAcaacgaa tacatcgacg aaatcgaCGT ATTCTGCGCC 
4 51 CTGACCAACG ACGACGAAAG CAACATTAtg tCCGCCCTTT TGGCGAAAAA 
501 CCTcggcgCG AAGCgcgtca tcggCATCGT CAACCGCTCA AGCTACGTCG 
551 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 
601 ACCATCGGCT CGATACTCGC CCACATCCGG CGCGGCGACA TCGTTGCCGT 
651 CCACCCCATC CGGCGCGGCA CGGCGGAAGC CATCGAAGTC GTCGCGCACG 
701 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGCATCAG CGGCATCAAA 
751 TGGCCCGAAG GCTGCCACAT TGCCGCCGTC GTCCGCGCCG GAACCGGCGA 
8 01 AACCATTATG GGACACCATA CCGAAACCGT CATCCAAGAC GGTGACCACA 
851 TCATCTTTTT CGTCTCGCGC CGGCGCATCC TGAACGAACT GGAGAAACTC 
901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 

This corresponds to the amino acid sequence <SEQ ID 194; ORF 047.ng>: 

g047 .pep 

1 MVIIQARRGG LLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 
51 IEGDEILFAA AAENIGAVIP ELRPKETSTR RIMIAGGGNI CYRLAKQLEH 

101 AYNVKIIECR PRRAEWIAEN LDNTLVLQGS ATDETLLDNE YIDEIDVFCA 

151 LTNDDESNIM SALLAKNLGA KRVIGIVNRS SYVDLLEGNK I DIWSPHLI 

2 01 TIGSILAHI R RGDIVAVHPI RRGTAEAIEV VAHGDKKTSA IIGRRISGIK 
251 WPEGCHIAAV VRAGTGETIM GHHTETVIQD GDHI IFFVSR RRILNELEKL 

3 01 IQVKMGFFG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 195>: 

m047.seq 

1 ATGGTCATCA TACAGgCGcG C . . syGCGGA sTGCTTGTCG GACGCAGCAT 

51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT ATTTGCCGCC GCCGCCGAAA AGATCGGCGC 

201 GGTCATACCC GAATTGCGCC CCAAAGAAAC CCAAAGAAAC CAGCcCmgmm 

251 GcATCATGAT TkCCGGCGGC GGCAACATCG GCTACCGTCT CGCCAAGCAG 

301 CTCGAACACG CATACAACGT yAAAATCATC GAATGCCGGC CGCGCCGTGC 

351 CGAATGGATA GCCGAAAACC TCGACAACAC CCTCGTCyTG CAAGGTT CGG 

4 01 CAACCGACGA AACCCTGCTC GACAACGAAT ACATCGACGA AATCGACGTA 
451 TTCTGCGCCC TGACCAACGA CGACGAAAGC AACATTATGT CCGCCCTTTT 
501 GGCGAaAAAC CTCGGCGCGA AGCGCGTCAT CGGCATCGTC AACCGCTCAA 
551 GCTACGTCGA TTTGCTCGAA GGCAACAAAA TCGACATCGT CGTCTCCCCC 
601 CACCTCATCA CCATCGGCTC GATACTCGCC CACATCCGGC GCGGCGACAT 
651 CGTTGCCGTC CACCCCATCC GGCGCGGCAC GGCGGAAGCC ATCGAAGTCG 
701 TCGCACACGG CGACAAAAAA ACTTCCGCCA TCATCGGCAG GCGCATCAGC 
751 GGCATCAAAT GGCCCGAAGG CTGCCACATT GCCGCCGTCG TCCGCGCCGG 
801 AACCGGCGAA ACCATTATGG GACACCATAC CGAAACCGTC ATCCAAGACG 
851 GCGACCACAT CATCTTTTTC GTCTCGCGCC GGCGCATCCT GAACGAACTG 
901 GAAAAACTCA TCCAGGTCAA AATGGGCTTT TTCGGATAA 

This corresponds to the amino acid sequence <SEQ ID 196; ORF 047>: 

m047 .pep 

1 MVIIQARXXG XLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETQRN QPXXIMIXGG GNIGYRLAKQ 

101 LEHAYNVKI I ECRPRRAEWI AENLDNTLVL QGSATDETLL DNEYIDEIDV 

151 FCALTNDDES NIMSALLAKN LGAKRVIGIV NRSSYVDLLE GNKID IWSP 

201 HLITIGSILA HI RRGDIVAV HPIRRGTAEA IEWAHGDKK TSAIIGRRIS 

251 GIKWPEGCHI AAWRAGTGE TIMGHHTETV IQDGDHIIFF VSRRRILNEL 

3 01 EKLIQVKMGF FG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 197>: 

a047 . seq 

1 ATGGTCATCA TACAGGCGCG GCGCGGCGGA CTGCTTGTCG GACGCAGCAT 
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51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT ATTTGCCGCC GCCGCCGAAA ACATCGGCGC 

201 GGTCATACCC GAATTGCGCC CCAAAGAAAC CAGCACCCGC CGCATCATGA 

251 TTGCCGGCGG CGGCAACATC GGCTACCGTC TCGCCAAGCA GCTCGAACAC 

301 GCATACAACG TCAAAATCAT CGAATGCCGG CCGCGCCGTG CCGAATGGAT 

351 AGCCGAAAAC CTCGACAACA CCCTCGTCCT GCAAGGTTCG GCAACCGACG 

4 01 AAACCCTGCT CGACAACGAA TACATCGACG AAATCGACGT ATTCTGCGCC 

4 51 CTGACCAACG ACGACGAAAG CAACATTATG TCCGCCCTTT TGGCGAAAAA 

501 CCTCGGCGCG AAGCGCGTCA TCGGCATCGT CAACCGCTCA AGCTACGTCG 

551 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 

601 ACCATCGGCT CGATACTCGC CCACATCCGG CGCGGCGACA TCGTTGCCGT 

651 CCACCCCATC CGGCGCGGCA CGGCGGAAGC CATCGAAGTC GTCGCACACG 

701 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGCATCAG CGGCATCAAA 

751 TGGCCCGAAG GCTGCCACAT TGCCGCCGTC GTCCGCGCCG GAACCGGCGA 

801 AACCATTATG GGACACCATA CCGAAACCGT CATCCAAGAC GGCGACCACA 

851 TCATCTTTTT CGTCTCGCGC CGGCGCATCC TGAACGAACT GGAAAAACTC 

901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 

This corresponds to the amino acid sequence <SEQ ID 198; ORF 047. a>: 

a047 .pep 

1 MVIIQARRGG LLVGRSIADI AQDLPDGADC QICAVYRNNR LI VPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETSTR RIMIAGGGNI GYRLAKQLEH 

101 AYNVKIIECR PRRAEWIAEN LDNTLVLQGS ATDETLLDNE YIDEIDVFCA 

151 LTNDDESNIM SALLAKNLGA KRVIGIVNRS SYVDLLEGNK ID IVVSPHLI 

201 TIGSILAHIR RGDIVAVHPI RRGTAEAIEV VAHGDKKTSA IIGRRISGIK 

251 WPEGCHIAAV VRAGTGETIM GHHTETVIQD GDHIIFFVSR RRILNELEKL 

301 IQVKMGFFG* 

m047/a047 96.5% identity over a 312 aa overlap 

10 20 30 40 50 60 

MVIIQARXXGXLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVI IEGDEILFAA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I ! I I I t I I I I I I I I I I I | | | | | | | | | | | 

MVIIQARRGGLLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVI IEGDEILFAA 
10 20 30 40 50 60 

70 80 90 100 110 120 

AAENIGAVI PELRPKETQRNQPXXIMIXGGGNIGYRLAKQLEHAYNVKI IECRPRRAEWI 
I I I I I M i I I I I I I I I I : : III I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | j 

AAENIGAVI PELRPKETSTRR IMIAGGGNIGYRLAKQLEHAYNVKI IECRPRRAEWI 

70 80 90 100 110 

130 140 150 160 170 180 

AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | I | 
AENLDNTLVLQGSATDETLLDNEYI DE I DVFCALTN DDE SN IMSALLAKNLGAKRVIG I V 
120 130 140 150 160 170 

190 200 210 220 230 240 

NRSSYVDLLEGNKIDIVVSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEVVAHGDKK 

I I I I I I I I I I I I M I I I I I i I I I I I I I I I I I I ! I I I I I I I f I I I I I I I I I I I I I I I I I I | 

NRSSYVDLLEGNKIDIVVSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEVVAHGDKK 
180 190 200 210 220 230 

250 260 270 280 290 300 

TSAIIGRRISGIKWPEGCHIAAVVRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 
M 1 I I I I I I I I I I M I I I I I I I I I I I I i I I I I I i I I I I I I ! I I I I 1 I I I I I I I I I I I i i I 
TSAIIGRRISGIKWPEGCHIAAVVRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 
240 250 260 270 280 290 

310 

EKL IQVKMG FFGX 
I I I I I I I I I I I I I 



m047 . pep 
a047 

m04 7 . pep 
a047 

m047 . pep 
a047 

m04 7 . pep 
a047 

m047 .pep 
a047 

m047 . pep 
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a04 7 EKLIQVKMGFFGX 
300 310 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 047 shows 96.2% identity over a 312 aa overlap with a predicted ORF (ORF 047.ng) 
from N. gonorrhoeae: 

m047/g045 

m047 .pep MVI I QARXXGXLVGRS IADI AQDLPDGADCQI CAVYRNNRLI VPAPQTVI IEGDE I LFAA 60 

lllllll I 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 M 

g04 7 MVI I QARRGGLLVGRS I AD I AQDLPDGADCQI CAVYRNNRLI VPAPQTVI IEGDE I LFAA 60 

m047.pep AAENIGAVIPELRPKETQRNQPXXIMIXGGGNIGYRLAKQLEHAYNVKIIECRPRRAEWI 120 

llllllllllllllllh ^ III Mill MUM I III 1 1 1 ; 1 1 M I Ml MM 

g04 7 AAENIGAVI PELRPKETSTRR IMI AGGGNI CYRLAKQLEHAYNVKI I ECRPRRAEWI 117 

m047 . pep AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 180 

Ml i M IMM II III MMMIIMI 1 1 IMMMIM MMI 1 1 II I M I II I M Mi 

g04 7 AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 17 7 

m047.pep NRSS YVDLLEGNKIDI WS PHLITIGS ILAHI RRGDIVAVHPIRRGTAEAI EWAHGDKK 240 

I MMMI Ml II I MIMMMMMMMMMMM I MMMI 1 1 Mill I M Ml 

g04 7 NRSS YVDLLEGNKIDIVVSPHL I TIGS ILAHI RRGDIVAVHPIRRGTAEAI EWAHGDKK 23 7 

m047 .pep TSAIIGRRISGIKWPEGCHIAAWRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 300 

M M M 1 1 1 M M 1 1 1 II Ml MM II II M M M Ml M 1 1 1 M M 1 1 M 1 1 M I M h . 

g047 TSAIIGRRISGIKWPEGCHIAAWRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 297 

m04 7.pep EKLIQVKMGFFGX 313 

lllllllllllll 
g0 4 7 EKLIQVKMGFFGX 310 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 199>: 

g048 . seq 

1 ATGCTCGACA AAGGCGAGGA GTTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTc ggcCCcgTCG ATCCGGTCGG CGATGAAGTC GTCGGTCCCG 

101 CAGGTCCGAC CACAGCCACC CGCATGGACA AATTTACCCG CCAAATGCTC 

151 AAACAAACCG GCCTCTTGGG CATGATCGGC AAATCCGagc gcgGcgcggc 

201 cacctGCGAA GCcatCGCCG ACAACAAGGC CGTGTACCTC ATGGCAGTCG 

251 GCGGCGCGGC ATACCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

3 01 GCGTTCCCCG AATTGGGTAT GGAAGCCGTT TACGAATTTG AAGTCAAAGA 
351 TATGCCCGTA ACCGTCGCCG TGGACAGCAA AGGCGAATCC ATCCACGCCA 

4 01 CCGCCCCGCG CAAATGGCAG GCGAAAATCG GCATCATCCC CGTCGAGTCT 
451 TGA 

This corresponds to the amino acid sequence <SEQ ID 200; ORF 048. ng>: 
g04 8.pep 

1 MLDKGEELPV DFTNRLIYYV GPVDPVGDEV VGPAGPTTAT RMDKFTRQML 

51 KQTGLLGMIG KSERGAATCE AIADNK AVYL MAVGGAAYLV AKAI KSSKVL 

101 AFPELGMEAV YEFEVKDMPV TVAVDSKGES IHATAPRKWQ AKIGIIPVES 

151 * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 20 1>: 

m04 8 . seq 

1 ATGCTCAACA AAGGCGAAGA ATTGCCCGTC GATTTCACCA ACCGCCTGAT 
51 TTACTACGTC GGCCCCGTCG ATCCGGTCGG CGATGAAGTC GTCGGTCCGG 
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101 CAGGTCCGAC CACAGCCACC CGCATGGACA AATTCACCCG CCAAATGCTC 

151 GAACAAACCG ACCTCTTGGG CATGATCGGC AAATCCGAGC GCGGCGTGGC 

201 CACCTGCGAA GCCATCGCCG ACAACAAAGC CGTGTACCTC ATGGCAGTCG 

251 GCGGCGCGGC GTATCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

3 01 GCGTTCCCCG AATTGGGCAT GGAAGCCATT TACGAATTTG AAGTCAAAGA 
351 CATGCCCGTA ACCGTCGCCG TAGATAGCAA AGGCGAATCC ATCCACGCCA 

4 01 CCGCCCCGCG CAAATGGCAG GCGAAAATCG GCATCATCCC CGTCGAATCT 
451 TGA 

This corresponds to the amino acid sequence <SEQ ID 202; ORF 048>: 

m04 8 - pep 

1 MLNKGEELPV DFTNRLIYYV GPVDPVGDEV VGPAGPTTAT RMDKFTRQML 

51 EQTDLLGMIG KSERGVATCE AIADNK AVYL MAVGGAAYLV AKAI KSSKVL 

101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPRKWQ AKIGIIPVES 

151 * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 203>: 

a04 8 . seq 

1 ATGCTCGACA AAGGCGAAGA ATTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTC GGCCCCGTCG ATCCGGTCGG CGACGAAATC GTCGGCCCAG 

101 CAGGTCCGAC CACCGCCACC CGCATGGACA AATTCACCCG CCAAATGCTC 

151 GAACAAACCG ACCTCTTGGG CATGATCGGC AAATCCGAGC GCGGCGCGGC 

201 CACCTGCGAA GCCATCGCCG ACAACAAAGC CGTGTACCTC ATGGCAGTCG 

2 51 GCGGCGCGGC GTATCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

301 GCGTTCCCCG AATTGGGCAT GGAAGCCATT TACGAATTTG AAGTCAAAGA 

351 CATGCCCGTA ACCGTCGCCG TAGACAGCAA AGGCGAATCC ATCCACGCCA 

4 01 CCGCCCCGCC CCAATGGCAG GCGAAAATCG GCATCATCCC CGTCAAATCT 

4 51 TGA 

This corresponds to the amino acid sequence <SEQ ID 204; ORF 048.a>: 

a048.pep 

1 MLDKGEELPV DFTNRLIYYV GPVDPVGDEI VGPAGPTTAT RMDKFTRQML 
51 EQTDLLGMIG KSERGAATCE AIADNKA VYL MAVGGAAYLV AKAI KSSKVL 
101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPPQWQ AKIGIIPVKS 
151 * 

m048/a048 96.0% identity over a 1 50 aa overlap 

10 20 30 40 50 60 

m04 8.pep MLNKGEELPVDFTNRLIYYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIG 

11:11111 IIMMIIM :IM! ! I I ! I i I I I I I I I I I I I I I I 

a04 8 MLDKGEELPVDFTNRLIYYVG PVDPVGDEIVGPAGPTTATRMDKFTRQMLEQTDLLGMIG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 8.pep KSERGVATCEAIADNKAVYLMAVGGAAYLVAKA I KSSKVLAFPELGMEAI YEFEVKDMPV 

M I I I: I M I I I I I I I i II II II I I I II I M I I Ml I I I I I ! I I I | | I ! | | | | | | | | m I 
a04 8 KSERGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPV 

70 80 90 100 110 120 

130 140 150 

m04 8 . pep TVAVDSKGES IHATAPRKWQAKIGIIPVESX 

I I I I I I I I I I I I I I I I : I I I I I I I I I I : I I 
a04 8 TVAVDSKGES I HATAPPQWQAKIG 1 1 PVKSX 

130 140 150 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 048 shows 96.4% identity over a 150 aa overlap with a predicted ORF (ORF 048.ng) 
from N. gonorrhoeae: 
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m048/g048 



m048 .pep 




10 20 30 40 50 60 

MLNKGEELPVDFTNRLIYWGPVDPVGDEWGPAGPTTATRMDKFTRQMLEQTDLLGMIG 



g048 



10 20 30 40 50 



m04 8 -pep 



70 80 90 100 110 120 

KSERGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPV 



g048 




70 80 90 100 110 120 



m04 8 -pep 



130 140 150 

TVAVDS KGES IHATAPRKWQAKI G 1 1 P VESX 



g048 



I M 1 1 1 II 1 1 II I Mill 1 1 II II ! II II 1 1 

TVAVDS KGES I HATAPRKWQAKI G 1 1 P VE SX 



130 140 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 205>: 



1 ATGCGGGCGC AGGCGTTTGA TCAACCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTTCAG GATATTGATT 

101 TGGACGGGCA TCAACGCCTC TTCCGCACCG CCTTCGCCGT TTTCCGCAAC 

151 CCCGTCTGCC GCCGTACCGG ATTCTGCCGC ATCGGCGTTT TCCCCGCCCT 

201 CAATCTGTGC GGTTTCAAAT TCGGCACTGT CTTTTTTGGC ATCGAACCGG 

251 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAAccggca tTTGCAGGGA 

3 01 AGCCTgcgcg TTGAGCCAGT TTTCCTGAAG GACGATCATC GGGTCGGTTT 

351 CGACTTCCTC GCCGCAATCG GCAACGGCgc tGTTGTGTTC TTCCTGCCAT 

401 TTCTTCAGAT ACGCCTTTAA 



This corresponds to the amino acid sequence <SEQ ID 206; ORE 049.ng>: 

g04 9.pep w 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ DIDLDGHQRL FRTAFAVFRN 

51 PVCRRTGFCR IGVFPALNLC GFKFGTVFFG IEPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFD FL AAIGNGAWF FLPFL QIRL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 207>: 

m04 9.seq (partial) 



1 ATGCGGGCGC AGGCGTTTGA TCAGCCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTTCAG GATATTGATT 

101 TGGACGGGCA TCAACGTTTC TTCCGCATCG TTTTCCCCGT TTTCCGAAAC 

151 CGCCGGCTCA TTCGTGCCGG ATTCTGCCTC GTCGGCGTTT TCCCCGCTTT 

2 01 CAATCTGTCC GGTTTCAAAT TCGACACTGT CTTTTTTGGT ATCAAACCGG 

2 51 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAACCGACA TTTGCAGGGA 

301 AGCCTGCGCG TTGAGCCAGT TTTCCTGAAG GACGATCATC GGGTCGGTTT 

351 CGACTTCCTC GCCGCAATCG GCAACGGCGG CATTGTGTTC CTCCTGCCAT 

4 01 TTTTTCAGAT ACGCCTT. . . 



This corresponds to the amino acid sequence <SEQ ID 208; ORF 049>: 

m049.pep (partial) 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ DIDLDGHQRF FRIVFPVFRN 

51 RRLIRAGFCL VGVFPAFNLS GFKFDTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFD FL AAIGNGGIVF LLPFF QIRL . 



g049 . seq 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 209>: 

a04 9.seq 
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1 ATGCGGGCGC AGGCGTTTGA TCAGCCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTTCAG AATATTGATT 

101 TGGACGGGCA TCAACGCTTC TTCCGCACCG CCTTCGCCGT TTTCCGCAAC 

151 CCCGTCTGCC GCCGTACCCG ATTCTGCCGC ATCGGCGTTT TCCCCGCCTT 

201 CAATCTGTCC GGTTTCAAAT TCGGCACTGT CTTTTTTGGC ATCAAACCGG 

251 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAACCGACA TTTGCAGGGA 

301 AGCCTGCGCG TTGAGCCAGT TTTCCTGAAG GACGATCATC GGGTCGGTTT 

351 CGACTTCCTC GCCGCAATCG GCAACGGCGG CATTGTGTTC CTCCTGCCAT 

4 01 TTTTTCAGAT ACGCCTT 



This corresponds to the amino acid sequence <SEQ ID 210; ORF 049.a>: 

a049 -pep 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ NIDLDGHQRF FRTAFAVFRN 

51 PVCRRTRFCR IGVFPAFNLS GFKFGTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVE PVFLK DDHRVGFD FL AAIGNGGIVF LLPFF QIRL 

m049/a049 90.6% identity over a 139 aa overlap 



!0 20 30 40 50 60 

m04 9.pep MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRFFRIVFPVFRNRRLIRAGFCL 
! I M M I M I I I I M I I I I I I I I I I I | | | | : | | | | | | | | | | | :| M| , ,. ,, 

a04 9 MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQNIDLDGHQRFFRTAFAVFRN PVCRRTRFCR 

10 20 30 40 50 60 

7 0 80 90 100 110 120 

m04 9 . pep VGVFPAFNLSGFKFDTVFFGIKPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 
= 1111111111111 IN II 111 II II I I ! I! II M II I I I I I I I I M | | M III I I II I 
a04 9 IGVFPAFNLSGFKFGTVFFGIKPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 

70 80 90 100 110 120 

130 139 
mO 4 9 . pep AAIGNGGIVFLLPFFQIRL 
I I I I I I I I I I I I I I I I I I I 
a04 9 AAIGNGGIVFLLPFFQIRL 

130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 049 shows 86.3% identity over a 139 aa overlap with a predicted ORF (ORF 049.ng) 
from N. gonorrhoeae: 

m049/g049 



!0 20 30 40 50 60 

m04 9 . pep MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRFFRIVFPVFRNRRLIRAGFCL 

I i ! Ml Mi I II 1 1 1 1 III i 1 1 1 1 |M M 1 1 , 1 II 1 1! |: ! I I 1 1 1 1 |:||| 

g04 9 MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRLFRTAFAVFRNPVCRRTGFCR 



10 



20 



30 



40 



50 



60 



70 80 90 100 110 120 

m049 . pep VGVFPAFNLSGFKFDTVFFGIKPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 

: 1 1 1 1 1 • 1 1 MM MMMMMMMMMMMMMMMMMMMMMMI 

9° 49 IGVFPALNLCGFKFGTVFFGIEPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 



70 



80 



90 



100 



110 



120 



130 139 
m04 9.pep AAIGNGGIVFLLPFFQIRL 

M M I h M I M M M 1 1 1 

g04 9 AAIGNGAWFFLPFLQIRLX 

130 140 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 21 1>: 

g050 . seq 

1 atgggcgCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGg 

51 cacgcccGAA AAAGccgtgt TGATGGcaaA AGAATCCCTG ATGAGCCACA 

101 TCGAcatCca aGaATTGCAG GAAAAAGCCG CGTccggggc ggaattgtcc 

151 accaccgaAG ccCTGCGCCT cGAACTCTTT GAAAAGGTCA ACGCGCTGGG 

2 01 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GACGTGAAAA 

2 51 TCCTCGATTA CCCGACCCAT GCCGCCTCCA AACCGATTGC CATGATTCCC 

301 AACTGTGCcg ccacCCGcca cgtcgAATTT GAATTGgACG GCTCAGGtcc 

351 TGTCGAactc acgccGCcgc gtgtCGAAGA CTGA 



This corresponds to the amino acid sequence <SEQ ID 212; ORF 050.ng>: 

gOSO.pep 

1 MGAGWCPPGI LGIGIGGTPE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVED* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 213>: 

m050 . seq 

1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGTATCG GCATCGGCGG 

51 C.agCCgAA AAAGCCGTGC TGATGGCAAA AGAGTCCCTG ATGAGCCACA 

101 TCGACATTCA AGAATTGCAG GAAAAGGCCG CGTCCGGCGC GgAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAAAAAGTCA ACGCGCTGGG 

201 CATCGGCGCA CAAGGCTTGG GCGGACTGAC CACCGTGTTG GACGTGAAAA 

251 TCCTCGATTA TCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

3 01 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 

351 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA TGGCCCGATT TGA 

This corresponds to the amino acid sequence <SEQ ID 214; ORF 050>; 

m050 .pep 

1 MGAGWCPPGI LGIGIGGXAE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDGPI * 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 215>: 

a050.seq 

1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGG 

51 TACGCCCGAA AAAGCCGTGT TGATGGCGAA AGAATCCCTG ATGAGCCACA 

101 TCGACATCCA AGAATTGCAG GAAAAAGCCG CGTCCGGCGC GGAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAAAAAGTCA ACGCGCTAGG 

201 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GACGTGAAAA 

251 TCCTCGATTA CCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

301 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 

351 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA CTGGCCC 

This corresponds to the amino acid sequence <SEQ ID 216; ORF 050.a>: 

a050.pep 

1 MGAGWCPPGI LGIGIGGTPE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDWP 

m050/a050 97.7% identity over a 129 aa overlap 

10 20 30 40 50 60 

m050 . pep MGAGWCPPGI LGIGIGGXAE KAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 

I I I M M I I I I I I M I I: MINIMI MIMIMMMM M Ml MMMII I I I) | 
a050 MGAGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m050 . pep EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 
I I I I I I I M I I I I I I I I I 1 I I I I I I i I I I I I I I I II I I I I I I I I I ! I I I M | || | | M | | 

a050 EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 

70 80 90 100 110 120 



130 

m050 . pep TPPRVEDGPIX 

I I I I I I I i 
a050 TPPRVEDWP 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 050 shows 98.4% identity over a 127 aa overlap with a predicted ORF (ORF 050.ng) 
from N. gonorrhoeae: 

m050/g050 

10 20 30 40 50 60 

MGAGWCPPGILGIGIGGXAEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 

IMMIIMIMIIIIh MINI MIIMIMIIUI I; IM Ml i I II, I Mi Ml 

MGAGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 
10 20 30 40 50 60 

70 80 90 100 110 120 

EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 

1 1 M 1 1 M I M I M M I M I II I M II M M 1 1 1 M I M I M I M M I M I M 1 1 1 M M 

E KVNALG I GAQGLGGLTTVLDVKI LD YPTHAAS KP I AMI PNCAATRHVEFELDGSG PVEL 
70 80 90 100 110 120 



m050 -pep 
g050 

m050 .pep 
g050 



130 

m050 .pep TPPRVEDGPIX 

MINN 
g050 TPPRVEDX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 217>: 

g050-l.seq 

1 ATGACCGTTA TCAAGCAAGA AGACTTTATT CAAAGTATCT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCAAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG GATGTGCAAT 

251 GGGATGCGGA CATGAGCGTG GAAAAGATGG TTAACGAAGG CGTACGCCGC 

301 GCCTACACTT GGGAAGGCAA CACCCTGCGC GCTTCCGTCC TCGCCGATCC 

351 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCACA 

401 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

451 GGCGGCTCTG AAAACAAATC CAAACTCGCT ATGCTCAACC CTTCCGACAA 

50 i CATCGTCGAT TGGGTATTGA AAACCATCCC GACGATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGCAcgcC CGAAAAAGCC 

601 GTGTTGATGG cgaAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAAAA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAG GTCAACGCGC TGGGCATCGG CGCGCAAGGC 

7 51 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 

801 CCATGCCGCC TCCAAACCGA TTGCCATGAT TCCCAACTGT GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GTCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGACC CG ATCTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGATAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCGCACAAAC GCCTCGTCAA TATGCTCGAC AAAGGCGAGG AGTTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCC GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

1201 AAATTTACCC GCCAAATGCT CAAACAAACC GGCCTCTTGG GCATGATCGG 
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1251 CAAATCCGAG CGCGGCGCGG 

1301 CCGTGTACCT CATGGCAGTC 

1351 ATCAAATCTT CCAAAGTCTT 

1401 TTACGAATTT GAAGTCAAAG 

14 51 AAGGCGAATC CATCCACGCC 

1501 GGCATCATCC CCGTCGAGTC 



CCACCTGCGA AGCCATCGCC GACAACAAGG 
GGCGGCGCGG CATACCTCGT GGCAAAAGCC 
GGCGTTCCCC GAATTGGGTA TGGAAGCCGT 
ATATGCCCGT AACCGTCGCC GTGGACAGCA 
ACCGCCCCGC GCAAATGGCA GGCGAAAATC 
TTGA 



This corresponds to the amino acid sequence <SEQ ID 218; ORF 050-1. ng>: 

g050-l -pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM DVQWDADMSV EKMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGG KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVED*PDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVNMLD KGEELPVDFT NRLI YYVGPV DPVGDEVVGP AGPTTATRMD 

401 KFTRQMLKQT GLLGMIGKSE RGAATCEAIA DNKAVYLMAV GGAAYLVAKA 

4 51 IKSSKVLAFP ELGMEAVYEF EVKDMPVTVA VDSKGESIHA TAPRKWQAKI 

501 GIIPVES* 

g050-l/pl4407 

sp|Pl4 407|FUMB_ECOLI FUMARATE HYDRATASE CLASS I, ANAEROBIC (FUMARASE) 
>gi 1 280063 1 pir | | B44511 fumarate hydratase (EC 4.2.1.2) fumB, iron-dependent - Escherichia coli 
>gi 1 146048 (M27058) anaerobic class I fumarase (EC 4.2.1.2) [Escherichia coli] Length = 548 

Score - 172 bits (432), Expect = 4e-42 

Identities = 138/488 (28%), Positives = 216/488 (43%), Gaps = 22/488 (4%) 

Query: 11 QSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAENNRPICQDTG 70 

Q+ DA + H K L+ E + K Q L NS + A+ P CQDTG 
Sbjct: 53 QAFHDASFMLRPAHQKQVAAILHDPEASEND KYVALQFLRNSEIAAKGVLPTCQDTG 109 

Query: 71 IATVFLKVGMDVQWDADMSVEKMVNEGVRRAYTWEGNTLRASVLADPAGKRQNTKDNTPA 130 

A + KG V W E+ +++GV YEN + A K NT N PA 

Sbjct: 110 TAII VGKKGQRV-WTGGGD-EETLSKGVYNTYI-EDNLRYSQNAALDMYKEVNTGTNLPA 166 

Query: 131 VIHMSIVPGGKVEVTCAAKGGGSENKSKL AMLNPSDNI VDWVLKTI PTMGAGWCP 185 

I + V G + + C AKGGGS NK+ L A+L P + ++++ + + T+G CP 

Sbjct: 167 QIDLYAVDGDEYKFLCVAKGGGSANKTYLYQETKALLTPG-KLKNFLVEKMRTLGTAACP 225 

Query: 186 PXXXXXXXXXTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEKVNXXX 245 

P T + L + +H EL + + L EL E+ 

Sbjct: 226 PYHIAFVTGGTSAETNLKTVKLASAHY-YDELPTEGNEHGQAFRDVQLEQELLEEAQKLG 284 

Query: 246 XXXXXXXXXTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSG PVELTPP 301 

D++++ P H AS P+ M +C+A R+++ ++ + G +E P 

Sbjct: 285 LGAQFGGKYFAH-DIRVIRLPRHGASCPVGMGVSCSADRNIKAKINREGIWIEKLEHNPG 343 

uery: 302 RVEDXPDLTYSPDNGKRVDVDKLTKE EVAS WKTGDVLLLNGKI LTGRDAAHKRLVNM 358 

+ +VD+++ KE +++ + L L G 1+ GRD AH +L + 

Sbjct: 344 QYIPQELRQAGEGEAVKVDLNRPMKEILAQLSQYPVSTRLSLTGTIIVGRDIAHAKLKEL 403 

Query: 359 LDKGEELPVDFTNRLIYYXXXXXXXXXXXXXXXXXTTATRMDKFTRQMLKQTGLLGMIGK 418 

+ D G+ELP + IYY TTA RMD + + G + M+ K 

Sbjct: 404 IDAGKELPQYIKDHPIYYAGPAKTPAGYPSGSLGPTTAGRMDSYVDLLQSHGGSMIMLAK 463 

Query: 419 SERGAATCEAIADNKAVYLMAVGG-AAYLVAKAIKSSKVLAFPELGMEAVYEFEVKDMPV 477 

R +A + YL ++GG AA L ++IK + +A+PELGMEA+++ EV+D P 

Sbjct: 464 GNRSQQVTDACHKHGGFYLGSIGGPAAVLAQQSIKHLECVAYPELGMEAIWKIEVEDFPA 523 

Query: 478 TVAVDSKG 485 

+ VD KG 
Sbjct: 524 FILVDDKG 531 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 21 9>: 

m050-l. seq 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AGCTACTATC ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCTGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCGGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG AACGTCCAAT 

251 GGGATGCGGA CATGAGCGTG GAAGAGATGG TTAACGAAGG CGTACGCCGC 
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301 GCCTACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTCC TCGCCGATCC 

351 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCATA 

401 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

4 51 GGCGGCTCTG AAAACAAATC CAAACTCGCC ATGCTCAATC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATCCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGT ATCGGCATCG GCGGCACGCC CGAAAAAGCC 

601 GTGCTGATGG CAAAAGAGTC CCTGATGAGC CACATCGACA TTCAAGAATT 

651 GCAGGAAAAG GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TGGGCATCGG CGCACAAGGC 

751 TTGGGCGGAC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTATCCGAC 

801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGGCC CGATTTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCAAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT' CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCG GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

1201 AAATTCACCC GCCAAATGCT CGAACAAACC GACCTCTTGG GCATGATCGG 

1251 CAAATCCGAG CGCGGCGTGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 

1401 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGATAGCA 

14 51 AAGGCGAATC CATCCACGCC ACCGCCCCGC GCAAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCGAATC TTGA 

This corresponds to the amino acid sequence <SEQ ID 220; ORF 050- 1>: 

m050-l .pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM NVQWDADMSV EEMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGG KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVEDWPDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVDMLN KGEELPVDFT NRLIYYVGPV DPVGDEWGP AGPTTATRMD 

401 KFTRQMLEQT DLLGMIGKSE RGVATCEAIA DNKAVYLMAV GGAAYLVAKA 

451 IKSSKVLAFP ELGMEAIYEF EVKDMPVTVA VDSKGESIHA TAPRKWQAKI 

501 GIIPVES* 

m050-l/g050-l 98.2% identity in 507 aa overlap 

10 20 30 40 50 60 

mOSO-l.pep MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 
I I I I t I I I I I I I I I I t I I I I I I I ( I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
g050-l MTVIKQEDFI QSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m050-l . pep NNRPICQDTGIATVFLKVGMNVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 
I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I : I I I I I I I I I I I I I I I i I I I I I I I I I I I I 
g050-l NNRPICQDTGIATVFLKVGMDVQWDADMSVEK^4VNEGVRRAYTWEGNTLRASVLADPAGK 

70 80 90 100 110 120 

130 140 150 160 170 180 

m050-l.pep RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
g050-l RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m050-l . pep AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

g050-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 
190 200 210 220 230 240 

250 260 270 280 290 300 

m050-l .pep VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I 11 I I I 
g050-l VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP ' 

250 260 270 280 290 300 
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310 320 330 340 350 360 

m050-l .pep PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLN 
llll! I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I! : I I : 
g050-l PRVEDXPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVNMLD 
310 320 330 340 350 360 

370 380 390 400 410 420 

m050-l .pep KGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I : I I I I I I I I I I I 
g050-l KGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLKQTGLLGMIGKSE 

370 380 390 400 410 420 

430 440 450 460 470 480 

m050-l.pep RGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPVTVA 
I I : I I I I I I f I I 1 I 1 M I I I I I I I I II I I I I I f I I I I I I I I I I I I I : I I I I I I I I I I I I I 
g050-l RGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAVYEFEVKDMPVTVA 

430 440 450 460 470 480 

490 500 
m050-l.pep VDSKGESIHATAPRKWQAKIGIIPVESX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g050-l VDSKGESIHATAPRKWQAKIGIIPVESX 
490 500 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 22 1>: 

a050-l . seq 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AACCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGCAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGATACCGGT ATCGCGACCG TGTTTTTGAA AGTCGGTATG GATGTGCAAT 

251 GGGATGCAGA CATGAGCGTC GAAGAGATGG TTAACGAAGG CGTGCGCCGC 

301 GCCTACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTTC TCGCCGACCC 

351 CGCCGGCAAA CGCCAAAATA CCAAAGACAA CACGCCCGCC GTCATCCATA 

4 01 TGAGCATCGT GCCGGGCG AC AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

451 GGCGGTTCTG AAAACAAATC CAAACTCGCC ATGCTCAACC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATTCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGTACGCC CGAAAAAGCC 

601 GTGTTGATGG CGAAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAAAA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TAGGCATCGG CGCGCAAGGC 

751 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 

801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGGCC CGATTTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCGAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGACGAAAT CGTCGGCCCA GCAGGTCCGA CCACCGCCAC CCGCATGGAC 

1201 AAATTCACCC GCCAAATGCT CGAACAAACC GACCTCTTGG GCATGATCGG 

1251 CAAATCCGAG CGCGGCGCGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 

1401 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGACAGCA 

1451 AAGGCGAATC CATCCACGCC ACCGCCCCGC CCCAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCAAATC TTGA 

This corresponds to the amino acid sequence <SEQ ID 222; ORF 050-1. a>: 

a050-l .pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM DVQWDADMSV EEMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGD KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVEDWPDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVDMLD KGEELPVDFT NRLIYYVGPV DPVGDEIVGP AGPTTATRMD 

4 01 KFTRQMLEQT DLLGMIGKSE RGAATCEAIA DNKAVYLMAV GGAAYLVAKA 

451 IKSSKVLAFP ELGMEAIYEF EVKDMPVTVA VDSKGESIHA TAPPQWQAKI 

501 GIIPVKS* 
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a050-l/m050-l 98.4% identity in 507 aa overlap 

10 20 30 40 50 60 

a050-l .pep MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 
I 1 I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I | | | | ) | 
m050-l MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 
10 20 30 40 50 60 

70 80 90 100 110 120 

a050-l .pep NNRPICQDTGIATVFLKVGMDVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 
I I I I I I I I I I I I I I I I I I I I : I 1 I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I 
m050-l NNRPICQDTGIATVFLKVGMNVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 
70 80 90 100 110 120 

130 140 150 160 170 180 

a050-l .pep RQNTKDNTPAVIHMSIVPGDKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 
I I I I I II I I I I II I I I I I I I I I I II I I I I I I II I I I I I I I I I I I | I | | | | | | | | | J | | | 
m050-l RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTI PTMG 

130 140 150 160 170 180 

190 200 210 220 230 240 

a050-l .pep AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 
I I I I M II I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I | | I 
m050-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 

190 200 210 220 230 240 

250 260 270 280 290 300 

a050-l .pep VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 
I I I I M I I I I I I I I I I I II I I I I I I I I I II I I I I I II I i I I I I M I | I | || | | | | | | | I I 
m050-l VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 

250 260 270 280 290 300 

310 320 330 340 350 360 

a050-l . pep PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLD 

I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I II I I Illill Hill: 

m050-l PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLN 
310 320 330 340 350 360 

370 380 390 400 410 420 

a050-l . pep KGEELPVDFTNRLI YYVGPVDPVGDEIVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 

I I M I I I I I I I I I I I I I I I I I : I I I I I I I I I I II I I I I I I ! I I I I I 

m050-l KGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 

370 380 390 400 410 420 

430 440 450 460 470 480 

a050-l .pep RGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPVTVA 

I I: I I I I I I I I I I I I I I I I I I I I II I II i I ! i I I ! II I I I I I I I I M I I I I I I | I 

m050-l RGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPVTVA 

430 '440 450 460 470 480 

490 500 
a050-l.pep VDSKGESIHATAPPQWQAKIGIIPVKSX 
I I I I I I I I I I I I I : I I I I I I I I I I : I I 
m050-l VDSKGESIHATAPRKWQAKIGIIPVESX 

490 500 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 223>: 

g052.seq 

1 ATGGCTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGCGAGCCG ACGGG CGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 

151 AAGGGGTTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCGG CTCACTCCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

3 01 AGGCTGCGGC TGGAAACCAC ATGGTCGCCC GCCTGCAGGA AGGTGAAAAA 

3 51 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 224; ORF 052.ng>: 

g052.pep 
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1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DTR LTPMPNLVTM LLIKPTWPN 

101 RLRLETTWSP ACRKVKNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 225>: 

m052 . seq 

1 ATGGCTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGCGAGCCG ACGGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 

151 AAGGGGTTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCGG CTCACTCCGA 

2 51 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 
301 AGGCTGCGGC TGGAAACCAC ATGGTCGCCC GCCTGCAGGA AGGTGAAAAA 

3 51 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 226; ORF 052>: 

m052.pep 

1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DTR LTPMPNLVTM LLIKPTWPN 

101 RLRLETTWSP ACRKVKNAA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 227>: 

a052.seq 

1 ATGGCTTTGG TCGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGAGAGCCG ACAGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCTCCC 

151 AAGGGATTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCGTTTA TATCAGTCGG CGACACGTGA CTCACTTCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

301 AGGCTGCGGC TGGAAATCAC ATGGTCGCCC GCCTGCAAAA AGGTGAAAAA 

351 CGCCGCCTGA 



This corresponds to the amino acid sequence <SEQ ID 228; ORF 052.a>: 

a052 .pep 

1 MALVAEETEI SAPCFKG*EP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DT* LTSMPNLVTM LLIKPTWPN 

101 RLRLEITWSP ACKKVKNAA* 



m052/a052 95.8% identity over a 1 19 aa overlap 

10 20 30 40 50 60 

m052 . pep MALVAEETEI SAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 

I I I I I I I I I ! I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I M M I I I I I I I 
a052 MALVAEETEI SAPCFKGXEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 

10 20 30 40 50 60 



70 80 90 100 110 120 

m052 . pep SLVLALTAAFHSFISVGDTRLTPMPNLVTMLLIKPTVVPNRLRLETTWSPACRKVKNAAX 
I I I I I I I I I I I ! I I I I I I I II I I I I I I I I I I I I I I I I I ! I I I I I I I M I : I I I I I I I 
a052 SLVLALTAAFHSFISVGDTXLTSMPNLVTMLLIKPTVVPNRLRLEITWSPACKKVKNAAX 

70 80 90 100 110 120 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 052 shows 95.8% identity over a 1 19 aa overlap with a predicted ORF (ORF 052.ng) 
from N. gonorrhoeae: 

m052/g052 

10 20 30 40 50 60 

m052 . pep MALVAEETEI SAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 

1 1 1 MM I IMMMIMIMIIMIII Mill IN MM Ml I M I II I MM Ml II I 
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g052 MALVAEETEISAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m052 . pep S LVLALTAAFHS F I S VGDTWLTSMPNLATMLL I KPTWPNRLRLE I TWS PACKKVKNAAX 

I Ml II III II III Hill II II I hi III III I M II I I III IIMIhillMIl 
g 0 5 2 S LVLALTAAFHS F I S VGDTRLTPMPNLVTMLL I KPTWPNRLRLETTWS P ACRKVKNAAX 

70 80 90 100 110 120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 229>: 

g073 . seq 

1 ATGTGTATGC CATACGCAAT AAGGGTTTCA GACGGCATCT GCCGCATTTT 

51 TCCGCCGATG CCGTCTGAAA CACGCAATCA GCGCGCGAGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA ACATATTCCA AACCGACCGA CAGGCGCACC 

151 AGTCCGGGGC GGatacCGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

2 01 GTGCGTGGTT GTCCACGGAT TGGTGATGGT CGAGCGCACG TCGCCGAGGT 
251 TGGCGGTACG GGAAAAGAGT TCCACGACTT TCCACGCGGC TGCTTGGTCG 

3 01 GCGACTTCAA AACCGATGAC GATGCCGCCG CCGTTTTGCT GTTTGCGGAT 
351 AAGCTCCGCC TGCGGATGGT CGGGCAATCC GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 230; ORF 073. ng>: 

g073 .pep 

1 MCMPYAIRVS DGICRIFPPM PSETRNQRAS ACFKSSIKSP TYSKPTDRRT 

51 SPGRIPAASF SSGCILPCW VHGLVMV ERT SPRLAVREKS STTFHAAAWS . 

101 ATSKPMTMPP PFCCLRISSA CGWSGNPV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23 1>: 

m073 . seq 

1 ATGTGTATGC CATATAAGAT AAGGGTTTCA GACGGCATCT GCTGTCCAAT 

51 GCCGTCTGAA ACACGCAATC AGCGTGCGAG TGCCTGTTTC AAATCGTCAA 

101 TCAAATCGCC AACATATTCC AAACCGACCG ACAGGCGCAC CAATCCGGGG 

151 CGGATGTTGG CGGCGAGTTT TTCTTCGGGC TGCATCCTGC CGTGCGTGGT 

2 01 TGTCCACGGG TGGGTAATGG TCGAGCGCAC GTCACCGAGG TTGGCGGTGC 

2 51 GGGAAAAGAG TTCCACGCCG TCCACAACTT TCCACGCCGC TTCTTGATCG 
301 GCAACTTCAA AGCCGATGAC GATGCCGCCG CCGTTTTGCT GTTTGCGGAT 

3 51 AAGCGCCGCC TGAGGATGGT CGGACAATCC GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 232; ORF 073>: 

m073 .pep 

1 MCMPYKIRVS DGICCPMPSE TRNQRASACF KSSIKSPTYS KPTDRRTNPG 

51 RMLAASF SSG CILPCWVHG WVMV ERTSPR LAVREKSSTP STTFHAASXS 

101 ATSKPMTMPP PFCCLRISAA XGWSDNPV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 233>: 

a073 . seq 

1 ACGTGTATGT CATATAAGAT AAGGGTTTCA GACGGCATTT GCGGTGTTTT 

51 TCCGCCGATG CCGTCTGAA. CACGCAATCA GCGCGCGAGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA ACATATTCCA AACCGACCGA CAGGCGCACC 

151 AATCCGGGGC GGATGTTGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

201 GTGCGTGGTT GTCCACGGAT GGGTAATGGT CGAGCGCACG TCGCCGAGGT 

251 TGGCGGTACG GGAGAAAAGT TCGACGCCGT CCACGACTTT CCACGCGGCT 

301 GCTTGGTCGG CGACTTCAAA GCCGATGACG ATGCCGCCGC CGTTTTGCTG 

351 TTTGCGGATA AGCTCCGCCT GAGGATGGTC GGGTAATCCG GTGTAA 

This corresponds to the amino acid sequence <SEQ ID 234; ORF 073. a>: 

a073 .pep 

1 TCMSYKIRVS DGICGVFPPM PSEXRNQRAS ACFKSSIKSP TYSKPTDRRT 

51 NPGRMLAASF SSGCILPCW VHGWVMV ERT SPRLAVREKS STPSTTFHAA 
101 AWSATSKPMT MPPPFCCLRI SSA*GWSGNP V* 

m073/a073 92.3% identity over a 130 aa overlap 
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m073 . pep 
a073 

m07 3.pep 
a073 



10 20 30 40 50 

MCMPYKIRVSDGICC PMPSETRNQRASAC FKSSIKSPTYSKPTDRRTNPGRMLAASF 
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TCMSYKIRVSDGICGVFPPMPSEXRNQRASAC FKSSIKSPTYSKPTDRRTNPGRMLAASF 
10 20 30 40 50 60 

60 70 80 90 100 110 

SSGCILPCVVVHGWVMVERTSPRLAVREKSSTPSTTFHAASXSATSKPMTMPPPFCCLRI 

I | | I I I I I I I I I I II II II MINIMI! I: IIMMIIM1 II 

SSGCILPCVVVHGWVMVERTSPRLAVREKSSTPSTTFHAAAWSATSKPMTMPPPFCCLRI 
70 80 90 100 110 120 



m073.pep 
a073 



120 129 
SAAXGWSDNPVX 
I : I I I I I I I I I 
SSAXGWSGNPVX 
130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 073 shows 87.0% identity over a 131 aa overlap with a predicted ORF (ORF 073.ng) 

from N. gonorrhoeae: 
m073/g073 



m073 .pep 
g073 



10 20 30 40 50 

MCMPYKIRVSDGICC PMPSETRNQRASAC FKSSIKSPTYSKPTDRRTNPGRMLAASF 

MM! MMMM I M M M M M M M M M M M M M M M h M h MM 

MCMPYAIRVSDGICRIFPPMPSETRNQRASACFKSSIKSPTYSKPTDRRTSPGRIPAASF 
10 20 30 40 50 60 



m073 .pep 
g073 



60 70 80 90 100 110 

SSGCILPCWVHGWVMVERTSPRLAVREKSSTPSTTFHAASXSATSKPMTMPPPFCCLRI 

MMMMMMI MMMMMMMMM 1 1 1 1 1 : MMMMMMMMM 

SSGCILPCVWHGLVMVERTSPRLAVREKSST- - -TFHAAAWSATSKPMTMPPPFCCLRI 
70 80 90 100 110 



m073 .pep 
g073 



120 129 
SAAXGWSDNPVX 

hi Mi MM 

SSACGWSGNPVX 
120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 235>: 

g075 . seq 

1 ATGCCGCCTT ACTTCATCAC CCTCTTAACG ATGGAAAATA CAAAAAGCGC 

51 GGCGAAAACG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCCGTTTCGG GCAACGCTGC GTTTGCCTGT 

151 GCCGCCAAAG CCAGCGGGGC GGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTT ACGAAATTTT 

251 TAAAAAAATG TGTTTGCGGG CTTTGTGAAG GTTTTAGAGA CCGCCTGCCG 

3 01 GGCCTCTTAA ACTTAATCTT CTTTTTCGTA GAATCCGAAA ATTACAAATT 
351 CCCCGCCTAT CTCTTCCAAT GCCGAGCTAA AAGCGTCTTC ATAGCTGTCA 

4 01 TATTTACCGG CTGA 

This corresponds to the amino acid sequence <SEQ ID 236; ORF 075.ng>: 

g07 5 .pep 

1 MPPYFITLLT MENTKSAAKT PTTIQPASIP SAFAASKAFF AVSGNAAFAC 

51 AAKASGAAVT TASFAPYLRQ VLINFMIFSF TKFLKK CVCG LCEGFRDRLP 

101 GLLNLIFFFV ESENYKFPAY LFQCRAKSVF IAVIFTG* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 237>: 

m07 5 . seq 

1 ATGCCGTCTT ACTTCATCAC TCTCTTAACG ATGGAAAATA CAAAAAGCGC 

51 GGCGAAAATG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCCGTATCGG GCAACGTTGC ATTTGCATGT 

151 GCGGCCAAAG CCAGGGGAGC AGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTC AAAAAGTGTT 

251 TGGCGGTAAT GGATGGAGCG TTTTTCAGAC GACCGCCGAA CATCCGAAAA 

3 01 TCAGTCTTTC AAAAATCCGA ATACGACAAA TTCGTATTGG TTGCCGATTT 

351 CTTCCAAACC TGCGTTAATC GCTTCTTCGA AGTCGTAGAA ATAATCGGCA 

401 TTGGTGATTA A 

This corresponds to the amino acid sequence <SEQ ID 238; ORF 075>: 

m075.pep 

1 MPSYFITLLT MENTKSAAKM PTTIQPASIP SAFAASKAFF AVSGNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLINFMIFSF KKCLAVMDGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEWE IIGIGD* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 075 shows 65.7% identity over a 137 aa overlap with a predicted ORF (ORF 075.ng) 
from N. gonorrhoeae: 

m075/g075 

10 20 30 40 50 60 

MPSYFITLLTMENTKSAAKMPTTIQPAS I PS AFAASKAFFAVSGNVAFAC AAKARGAAVT 

|| 1 1 1 ) 1 1 1 1 } 1 1 1 1 1 1 1 MINIMUM IIIIMIIMI I Ml II MM Mill 

MPPYFITLLTMENTKSAAKTPTTIQPASIPSAFAASKAFFAVSGNAAFACAAKASGAAVT 
10 20 30 40 50 60 

70 80 90 100 110 

TASFAPYLRQVLINFMIFSF KKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVAD 

I M M 1 1 1 1 M 1 1 M 1 1 1 1 1 II h « =1 i h= •• 5 1 I ::: : I 

TASFAPYLRQVLINFMIFSFTKFLKKCVCGLCEGFRDRLPGLLNLIFFFVESENYKFPAY 
70 80 90 100 110 120 



m075 .pep 
g075 

m075 .pep 
g075 



120 130 
m075 .pep FFQTCVNRFFEWEI IGIGDX 

Ml - 1=1 = 1 
g 0 7 5 LFQCRAKSVFI AV I FTGX 

130 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 239>: 

a075 . seq 

1 ATGCCGTCTT ACTTCATCAC TCTCTTAACG ATGGAAAAGA CAAAAAGCGC 

51 GGCGAAAACG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCTGTATCGG GCAACGTTGC ATTTGCATGT 

151 GCGGCCAAAG CCAGGGGAGC AGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTC AAAAAGTGTT 

251 TGGCGGTAAT GGATGGAGCG TTTTTCAGAC GACCGCCGAA CATCCGAAAA 

301 TCAGTCTTTC AAAAATCCGA ATACGACAAA TTCGTATTGG TTGCCGATTT 

351 CTTCCAAACC TGCGTTAATC GCTTCTTCGA AGTCGTAGAA ATAATCGGCA 

4 01 TTGGTGATTA A 

This corresponds to the amino acid sequence <SEQ ID 240; ORF 075. a>: 

a075 .pep 

1 MPSYFITLLT MEKTKSAAKT PTTIQPASIP SAFAASKAFF AVSGNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLINFMIFSF KKCLAVM DGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEWE IIGIGD* 
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m075/a075 98.5% identity over a 136 aa overlap 

10 20 30 40 50 60 

MPSYFITLLTMENTKSAAKMPTTIQPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 
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MPSYFITLLTMEKTKSAAKTPTTIQPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 

10 20 30 40 50 60 

70 80 90 100 110 120 

TAS FAPYLRQVLINFMI FS FKKCLAVMDGAFFRRPPN IRKSVFQKSEYDKFVLVADFFQT 
I I I I | | | I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I M I 

TAS FAPYLRQVLINFMI FSFKKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVADFFQT 
70 80 90 100 110 120 

130 
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C VN R FFE V VE 1 1 G I G DX 
130 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 241 >: 

g080 . seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTGGTTTAT 

151 TCCGATAAGA AGGCATTGGG CAGTTTGGCG AAAGAATACA T C C ATGGGAA 

2 01 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAAGCCTAC CGCCGGTATC 
251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA TACGGTTGAG 

3 01 GTCGTCCTGA CCGAGCGCAA GCCGGTTGCA CGTTGGGGCG ACCATGCCTT 
351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

4 01 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 
451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 
501 GATGACCTAT ACGGCACGTT CGGCGTGGAA TGTCGTTTTG GACAACGGCA 
551 TCACCGTCAG GCTCGGACGG GAAAAcgaGA TGAAACGCCT CCgGCTTTTT 
601 ACcgAAGCGT GGCAGCATCT gttgcGTAAG AATAAAAATC GGTTATCCTA 
651 TGTGGATATG Aggtataagg acggatttTC agtcccccat gctCCCGACG 
701 GTTTACCCGA AAAAGAATcc gAAGAATatt gggaacaggt ttgggacata 
751 ttacggcctg gcgtcggaaa cggttcgacg caaatttcaa tcagttatAA 
801 GGGCAGacga acaatggaac AGcagtaa 

This corresponds to the amino acid sequence <SEQ ID 242; ORF 080.ng>: 

g080 .pep 

1 MWDNAEAMER LTRWLLVMMA MLLAA SGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKALGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWNWL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVPH APDGLPEKES EEYWEQVWDI 

251 LRPGVGNGST QISISYKGRR TMEQQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 243>: 

m080 . seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTGGTTTAT 

151 TCCGATAAGA AGACATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

3 51 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

4 01 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 
4 51 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 
501 GATGACCTAT ACGGCACGTT CGGCGTGGAT TGTCGTTTTG GACAACGGCA 
551 TCACCGTCAG GCTCGGACGG GAAAACGAGA TGAAACGCCT CCGGCTTTTT 
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601 ACCGAAGCGT GGCAGCATCT GTTGCGTAAA AATAAAAATC GGTTATCCTA 
651 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTTCCGACG 
701 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 2441 ; ORF 080>: 

m080 .pep 

1 MWDNAEAMER LTRWLLVMMA MLLA ASGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKTLGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWIWL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVRY ASDGLPEKES EE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 080 shows 97.9% identity over a 242 aa overlap with a predicted ORF (ORF 080.ng) 
from TV. gonorrhoeae: 

m080/g080 
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mO 8 0 . pep MWDNAFAMERLTRWLLVMMAML 
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I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
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m080.pep EEX 
II 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 245>: 

a080. seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTAGTTTAT 

151 TCCGATAAGA AAGCATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGTTTGGAC AGACCCGGAA 

4 01 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 

501 GATGACCTAT ACGGCACGTT CGGCGTGGAT TGTCGTTTTG GACAACGGCA 
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551 TCACCGTCAG GCTCGGACGG GAAAACGAGA TGAAACGCCT CCGGCTTTTT 

601 ACCGAAGCGT GGCAACATCT GTTGCGTAAA AATAAAAATC GGTTATCCTA 

651 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTCCCGACG 

7 01 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 246; ORF 080.a>: 

a080 .pep 

1 MWDNAEAMER LTRWLLVMMA MLLA ASGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKALGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 VVLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWIVVL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVRY APDGLPEKES EE* 

m080/a08o 99.2% identity over a 242 aa overlap 
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a 0 8 0 EGNVFEARLDRPGMPVFRGAEGTS AEMLRR YDE FST VLAKQGLG IKEMT YT ARS AW I VVL 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 247>: 

g081 . seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AACAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGATA 

101 TTCGGGAAGG CGATGTGTTT TTCGCATTGG CGGGCGGGCG GTTTGACGCG 

151 CATGATTTTG TTGGAGGCGT ATTGTCTGCG GGCGCGGCGG CGGTTGTGGT 

2 01 TTCGCGCGAA GATTGCGCGG CTTTGGGCGG CGCGTTGAAA GTCGATGACA 
251 CGCTTGCCGC GTTGCAAACG TTGGCGAAGG CGTGGCGCGA TAATGTGAAC 

3 01 CCGTTTGTGT TCGGCATTAC CGGTTCGGGC GGCAAGACGA CGGTGAAGGA 
351 GATGCTGGCT GCGGTATTGC GCCGCCGTTT CGGCGATGAT GCCGTTTCGG . 

4 01 CGACGGCAGG CAACTTCAAC AACCACAtcg gaTTGCCGCT GACTTTATTG 
451 AAATtaaAcg aAAAACACCG CTATGCCGTG ATTGAAATGG GCATGAACCA 
501 TTTTGGcgaa ctggcggtTt taacgcaaaT CGCCAAACCC GATGCCGCTT 
551 TGGtcaACAA CGCCCTGCGC GCCCATGTCG GATGCGGTTt cgacggagtg 
601 GGCGATATTG CCAAAGcgaa aagcGAGATT TatgcagGct tATGTTCAGA 
651 CGGCATGGCA CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 
701 CGGCAACGTT TAATTTGAAT ACGTGCACTT TCGGCGTCGA TAGCGGCGAT 
751 GTCCGCGCGG AAAATATCGT GCTGAAACCT TTGTCGTGCG AATTTGATTT 
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801 GGTGTGCGGC GACGAGCGCA CTGCCGTGGT GCTGCCTGTT CCCGGCCGCC 

851 ACAATGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGCTGCCGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG CAAGGCTTCA GCAACATCAA 

951 AGGCCGTCTG AACGTCAAAG CCGGCATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAATCCC GACAGTATGA AAGCCGCGGT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG CATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGCGAGGAc gaAGCCGCCG CCATGCACGC CGAagtcgGC GCGTACGCCC 

1151 GCGACCAAGG CATCGAAGCG GCTTATTTTG TCGGCGACAA CAGCGTCGAA 

12 01 GCGGcggaAA AATTTGGCGC GGACGGTTTG TGGTTCGCCG CCAAAGACCC 

12 51 GTTGATTCAA GTGTTGAGCC ACGATTTGCC CGAACGCGCC ACCGTGTTGG 

13 01 TGAAAGGTTC GCGCTTTATG CAGAtggAAG AAGTGGTCGA GGCATTGGAG 
1351 GATAAGTga 

This corresponds to the amino acid sequence <SEQ ID 248; ORF 081.ng>: 

g081.pep 

1 MKPLDLNFIC QALKLPMPSE NKPVSRIVTD SRDIREGDVF FALAGGRFDA 

51 H DFVGGVLSA GAAAVWS RE DCAALGGALK VDDTLAALQT LAKAWRDNVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDD AVSATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTQIAKP DAALVNNALR AHVGCGFDGV 

201 GDIAKAKSEI YAGLCSDGMA . LIPQEDANMA VFKTATFNLN TCTFGVDSGD 

251 VRAENIVLKP LSCEFDLVCG DERTAWLPV PGRHNVHNAA AAAALALAAG 

3 01 LSLNDVAEGL QGFSNIKGRL NVKAGIKGAT LIDDTYNANP DSMKAAVDVL 

3 51 ARMPAPRIFV MGDMGELGED EAAAMHAEVG AYARDQGIEA AYFVGDNSVE 

4 01 AAEKFGADGL WFAAKDPLIQ VLSHDLPERA TVLVKGSRFM QMEEWEALE 
451 DK* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 249>: 

m081.seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGTTT TTCGCATTGG CGGGCGAGCG GTTTGACGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCT GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGTGCTG CAATGGATGG CGCGTTGAAA GTCGATGACA 

2 51 CGCTTGCCGC ATTGCAAACG CTGGCAAAGG CGTGGCGTGA AAATGTGAAT 

3 01 CCGTTTGTGT TCGGCATTAC CGGTTCGGGC GGCAAGACGA CGGTGAAGGA 

3 51 AATGCTGGCT GCGGTATTGC GCCgCCGTTT CGGCGATGAT GCCGTGTTGG 

4 01 CGACGGCAGG CAACTTCAAC AACCATATCG GATTGCCGCT GACTTTGTTG 
4 51 AAGTTAAACG AAAAACACCG CTATGCCGTG ATTGAAATGG GCATGAACCA 
501 TTTCGGCGAA CTGGCGGTTT TAACGCAmAT CGCCAAACCA AATGCCGCAT 
551 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 
601 GGCGATATTG CCAAAGCGAA AAGCGAGATT TACCAAGGTT TATGTTCAGA 
651 CGGCATTGCA CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 
701 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAGCGGCGAT 
751 GTTCACGCGG AAAATATTGT GCTGAAACCG TTGTCGTGCG AATTTGATTT 
801 GGTGTGCGGC GATGAGCGCG CCGCCGTGGT GCTGCCTGTT CCCGGCCGCC 
851 ACAATGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGCTGCGGGT 
901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAATATCAA 
951 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAACCCT GACAGCATGA AAGCTGCGAT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGCGAACTG GGCGAGGACG AAGCCGCCGC TATGCACGCC GAAGTCGGCG 

1151 CGTATGCCCG CGACCAAGGC ATCGAAGCGG CTTATTTTGT CGGCGACAAC 

1201 AGCGTCGAAG CGGCGGAAAA ATTTGGCGCG GACGGTTTGT GGTTCGCCGC 

1251 CAAAGACCCG TTGATTCAAG TGTTGCGCCA CGATTTGCCC GAACGCGCCA 

1301 CCGTGTTGGT GAAAGGTTCG CGCTTTATGC AGATGGAAGA AGTGGTCGAG 

1351 GCATTGGAGG ATAAGTGA 

This corresponds to the amino acid sequence <SEQ ID 250; ORF 08 1>: 

m081 .pep 

1 MKPLDLNFIC QALKLPMPSE SKPVSRIVTD SRDIRAGDVF FALAGERFDA 

51 H DFVEDVLAA GAAAVWS RE DCAAMDGALK VDDTLAALQT LAKAWRENVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDD AVLATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTXIAKP NAALVNNAMR AHVGCGFDGV 
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201 
251 
301 
351 
401 
451 



GDIAKAKSEI 
VHAENIVLKP 
LSLNDVAEGL 
ARMPAPRIFV 
SVEAAEKFGA 
ALEDK* 



YQGLCSDGIA 
LSCEFDLVCG 
KGFSNIKGRL 
MGDMGELGEL 
DGLWFAAKDP 



LIPQEDANMA 
DERAAWLPV 
NVKSGIKGAT 
GEDEAAAMHA 
LIQVLRHDLP 



VFKTATLNLN 
PGRHNVHNAA 
LIDDTYNANP 
EVGAYARDQG 
ERATVLVKGS 



TRTFGIDSGD 
AAAALALAAG 
DSMKAAIDVL 
IEAAYFVGDN 
RFMQMEEWE 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 08 1 shows 94. 1 % identity over a 455 aa overlap with a predicted ORF (ORF 08 1 .ng) 
from N. gonorrhoeae: 

m081/g081 



m081 .pep 
g081 

m081 .pep 
g081 

m081 .pep 
g081 

m081 .pep 
g081 

m081 .pep 
g081 

m081 .pep 
g081 

m081 .pep 
g081 



10 20 30 40 50 60 

MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGERFDAHDFVEDVLAA 

I M 1 1 1 1 M Mill IM llhlllllll II Mill MINIMI MINIM MM 

MKPLDLNFICQALKLPMPSENKPVSRIVTDSRDIREGDVFFALAGGRFDAHDFVGGVLSA 
10 20 30 40 50 60 

70 80 90 100 110 120 

GAAAVWSREDCAAMDGALKVDDTLAALQTI^KAWRENWPFVFGITGSGGKTTVKEMLA 

llllll lllllllh MINIMI III IMIMI MIIIIMI I MINI II MMMI 

GAAAVWS REDC AALGGALKVDDTLAALQTLAKAWRDNW PFVFGI TGS GG KTTVKEMLA 
70 80 90 100 110 120 

130 140 150 160 170 180 

AVLRRRFGDDAVIATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTXIAKP 

MMMMMM MMMMMMMMMMMMMMMMMMMMM MM 

AVLRRR FGDD AVS AT AGNFNNH I GL PLTLL KLNE KHRY AV I EMGMNH FGE L AVLTQ I AKP 
130 140 150 160 170 180 

190 200 210 220 230 240 

NAALVNNAMRAHVGCGFDGVGDIAKAKSEIYQGLCSDGIALIPQEDANMAVFKTATLNLN 

M M M M M M M M M M M M M M M I M M M M M M M M M M M M M M 

DAALVNNALRAHVGCGFDGVGD I AKAKS E I YAGLCSDGMALI PQEDANMAVFKTATFNLN 
190 200 210 220 230 240 

250 260 270 280 290 300 

TRTFGIDSGDVHAENIVLKPLSCEFDLVCGDERAAVVLPVPGRHNVHNAAAAAALALAAG 

I I Ihlll Ihlllll MMIIIMII llllhl I MIIMI IMIIII IIIMIIM! 

TCTFGVDSGDVRAENIVLKPLSCEFDLVCGDERTAWLPVPGRHOTHNAAAAAALALAAG 
250 260 270 280 290 300 

310 320 330 340 350 360 

LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAIDVIARMPAPRIFV 

1 1 1 1 Ml 1 1 IMM IM IMMIMIN Ml INI Ml M I M M;M II N Mil I II I 

LSLNDVAEGLQGFSNIKGRLNVKAGIKGATLIDDTYNANPDSMKAAVDVLARMPAPRIFV 
310 320 330 340 350 360 

370 380 390 400 410 420 

MGDMGELGELGEDEAAAMHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 

MINIMI M M M M M 1 1 M M M M M M M M M M M 1 1 M M M I II M I 

MGDMGELGE DEAAAMHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 

370 380 390 400 410 



430 440 450 

m0 8 1 . pep LIQVLRHDLPERATVLVKGSRFMQMEEWEALEDKX 

MM! MMMMMMMMMMMMMMM 

g 0 8 1 L I QVLS HDL P ERATVLVKGS RFMQMEE WE ALEDKX 

420 430 440 450 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 25 1>: 

a081 . seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGTTT TTCGCATTGG CGGGCGGGCG GTTTGATGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCG GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGCGTTG CAATGGATGG CGCGTTGAAA GTCGATGACA 

2 51 CGCTTACCGC GTTGCAAATG TTGGCGAAGG CGTGGCGCGA GAATGTGAAC 

301 CCGTTTGTGT TCGGTATTAC CGGCTCGGGC GGCAAGACGA CGGTGAAGGA 

351 AATGTTGGCT GCGGTATTGC GCCGCCGTTT CGGCGATAAT GCCGTTTTGG 

4 01 CGACGGCAGG CAACTTCAAC AACC AC AT CG GATTGCCGTT GACTTTGTTG 

4 51 AAATTAAACG AAAAACACCG CTATGCCGTG ATTGAAATGG GTATGAACCA 

501 TTTTGGCGAA CTGGCGGTTT TGACACAAAT CGCCAAACCC GATGCCGCAT 

551 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 

601 GGCGATATTG CCAAAGCGAA AAGCGAGATT TATCAAGGCT TATGTTCAGA 

651 CGGCATGGCG CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 

701 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAG CGGCG AT 

7 51 GTCCACGCGG AAAATATCGT GCTGAAACCG TTGTCGTGCG AATTTGATTT 

801 GGTGTGCGGC AACGAGTGCG CAGCCGTGGT TCTGCCCGTT CCCGGCCGCC 

851 ACAATGTCCA CAACGCCGCC GCCGCCGCCG CGCTGTCTTT GGCTGCAGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAATATCAA 

951 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAACCCT GACAGCATGA AAGCTGCGGT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGTGAGGAC GAAGCCGCCG CCATGCACGC CGAAGTCGGC GCGTACGCCC 

1151 GCGACCAAGG CATCGAAGCG GCTTATTTTG TCGGCGACAA CAGCGTCGAA 

1201 GCGGCGGAAA AATTTGGCGC GGACGGTTTG TGGTTCGCCG CCAAAGACCC 

1251 GTTGATTCAA GTGTTGCGCC ACGATTTGCC CGAACGCGCC ACCGTGTTGG 

1301 TGAAAGGTTC GCGCTTTATG CAGATGGAAG AAGTGGTCGA GGCATTGGAG 

1351 GATAAGTGA 

This corresponds to the amino acid sequence <SEQ ID 252; ORF 081.a>: 

a081.pep 

1 MKPLDLNFIC QALKLPMPSE SKPVSRIVTD SRDIRAGDVF FALAGGRFDA 

51 HDFVEDVLAA GAAAWVSRE DCVAMDGALK VDDTLTALQM LAKAWRENVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDN AVLATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTQIAKP DAALVNNAMR AHVGCGFDGV 

201 GDIAKAKSEI YQGLCSDGMA LIPQEDANMA VFKTATLNLN TRTFGIDSGD 

251 VHAENIVLKP LSCEFDLVCG NECAAWLPV PGRHNVHNAA AAAALSLAAG 

301 LSLNDVAEGL KGFSNIKGRL NVKSGIKGAT LIDDTYNANP DSMKAAVDVL 

351 ARM PA PR I FV MGDMGELGED EAAAMHAEVG AYARDQGIEA AY FVGDNSVE 

401 AAEKFGADGL WFAAKDPLIQ VLRHDLPERA TVLVKGSRFM QMEEVVEALE 

451 DK* 

mosi/aOBi 96.7% identity over a 455 aa overlap 

10 20 30 40 50 60 

m081 . pep MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGERFDAHDFVEDVLAA 

t I I I I I I I I I I I 1 I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I i 1 1 I I I i I I I I I I I ! I I I 
a081 MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGGRFDAHDFVEDVLAA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m081 pep GAAAVVVSREDCAAMDGALKVDDTLAALQTLAKAWRENVNPFVFGITGSGGKTTVKEMLA 
I | | I I I I I I I I I : I I I I i I I I I I II : I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I 
a081 GAAAWVSRE DCVAMDGALKVDDTLTALQMLAKAWRENVNPFVFGITGSGGKTTVKEMLA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m081 pep AVLRRRFGDDAVLATAGN FNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTXIAKP 

| | | | | | I II : I I I I II I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I MM 
a081 AVLRRRFGDNAVLATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTQ1AKP 
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130 140 150 160 170 180 

190 200 210 220 230 240 

m081 . pep N7VA.LVNNAMRAHVGCGFDGVGDIAKAKSEIYQGLCSDGIALIPQEDANMAVFKTATLNLN 
: I I I I I I I I I I I I I I I I I II I I I I I I I I II I I II I I I I : I I II M I I I I I I I I I I I I I I I 
a081 DT^ALVNNAMRAHVGCGFDGVGDIAKAKSEIYQGLCSDGMALIPQEDANMAVFKTATLNLN 

190 200 210 220 230 240 



250 260 270 280 290 300 

m081 .pep TRTFGIDSGDVHAENIVLKPLSCEFDLVCGDERAAVVLPVPGRHNVHNAAAAAALALAAG 
I I I I I I I I II I I I I I I I 1 I I II I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I : I I I I 
a081 TRTFGIDSGDVHAENIVLKPLSCEFDLVCGNECAAVVLPVPGRHNVHNAAAAAALSLAAG 

250 260 270 280 290 300 



310 320 330 340 350 360 

m081.pep LSLN DVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAIDVLARMPAPRIFV 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I II I I I I I I 
a081 LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAVDVLARMPAPRIFV 

310 320 330 340 350 360 



370 380 390 400 410 420 

m081 .pep MGDMGELGELGEDEAAAMHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 
I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

a081 MGDMGELGE DEAAAMHAEVGAYARDQGIEAAY FVGDNSVEAAEKFGADGLWFAAKDP 

370 380 390 400 410 



430 440 450 

m081 . pep LIQVLRHDLPERATVLVKGSRFMQMEEWEALEDKX 

I I I I I II I I I I I I I I II I I I II I I I II I I I I I I I I I 
a081 L I QVLRH DLPE RAT VLVKGS R FMQMEE WEALE DKX 

420 430 440 450 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 253>: 

g082 . seq 

1 aTGTGGTTGT TGAAGTTGCC TGCCGTCGCC GAAACGGCAT CATCGCCGAA 

51 ACGGCGGCGC AATACCGCAG CCAGCATCTC CTTCACCGTC GTCTTGCCGC 

101 CCGAACCGGT AATGCCGAAC ACAAACGGGT TCACATTATC GCGCCACGCC 

151 TTCGCCAACG TTTGCAACGC GGCAAGCGTG TCATCGACTT TCAACGCGCC 

201 GCCCAAAGCC GCGCAATCTT CGCGCGAAAC CACAACCGCC GCCGCGCCCG 

251 CAGACAATAC GCCTCCAACA AAATCATGCG CGTCAAACCG CCCGCCCGCC 

3 01 AATGCGAAAA ACACATCGCC TTCCCGAATA TCGCGGCTGT CGGTTACGAT 

351 GCGCGACACG GGTTTGTTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

401 AGATGAAATT TAGGTCCAGT GGTTTCATAT TTGCTTTCGT TAATATTCGG 

451 GCGGCGGACA CATCGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 

501 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGCATTT TTTCTGTACG 

551 TATCATTTTT TAGACGTATT TTTAGCCGAT TTGCCTTTTC CCGCATACCA 

601 CGGCGCGGGG TCGTCGGACT GTCTGTCGAT AAAGGCAAGG TTATTGCCTT 

651 CGCCCGGCAC ATCGGGGACA TTCCCCCAAA AATCATAGCC GTCATCGGGC 

701 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 

This corresponds to the amino acid sequence <SEQ ID 254; ORF 082.ng>: 

g082.pep 

1 MWLLKLPAVA ETASSPKRRR NTAASISFTV VLPPEPVMPN TNGFTLSRHA 

51 FANVCNAASV SSTFNAPPKA AQSSRETTTA AAPADNTPPT KSCASNRPPA 

101 NAKNTSPSRI SRLSVTMRDT GLFSDGIGSL RAWQMKFRSS GFIFAFVNIR 

151 A ADTSVAADF FIACFAW KH RLFSHSHSAF FLYVSFFRRI FSRFAFSRIP 

201 RRGWGLSVD KGKVIAFARH IGDIPPKIIA VIGQLVGFDT RPTAESA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 255>: 

m082 . seq 

1 ATGnnGTTGT TGAAGTTGCC TGCCGTCGCC AACACGGCAT CATCGCCGAA 

51 ACGGCGGCGC AATACCGCAG CCAGCATTTC CTTCACCGTC GTCTTGCCGC 

101 CCGAACCGGT AATGCCGAAC ACAAACGGAT TCACATTTTC ACGCCACGCC 

151 TTTGCCAGCG TTTGCAATGC GGCAAGCGTG TCATCGACTT TCAACGCGCC 
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201 ATCCATTGCA GCACAATCTT CGCGCGAAAC CACAACCGCC GCCGCACCAG 

2 51 CAGCCAATAC GTCTTCAACA AAATCATGCG CGTCAAACCG CTCGCCCGCC 

301 AATGCGAAAA ACACATCGCC CGCGCGGATG TCGCGGCTGT CGGTTACGAT 

351 GCGCGACACG GGTTTGCTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

4 01 AGATGAAATT TAGGTCCAGT GGTTTCATAT TTACTTTCGT TAATATTCGG 

451 GCGGCGGACA CATCGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 

501 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGsATTT TTTCTGTACG 

551 TATCATTTTT TAGACGTATT TTTAGTCGAT TTGCCTTTTC CCGCATACCA 

601 CGGCGCGGGG TCGTCGGGCA GTCCGTCGAT AAAGGCAAGG TTATTGCCTT 

651 CGCCCTGCAC ATCGGGAACA TTCCCCCAAA AATCATAGCC GTCATCGGGC 

701 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 

This corresponds to the amino acid sequence <SEQ ID 256; ORF 082>: 

m082.pep 

1 MXLLKLPAVA NTASSPKRRR NTAASISFTV VLPPEPVMPN TNGFTFSRHA 

51 FASVCNAASV SSTFNAPSIA AQSSRETTTA AAPAANTSST KSCASNRSPA 

101 NAKNTSPARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS GFIFTFVNIR 

151 AADTSVAADF FIACFAW KH RLFSHSHSXF FLYVSFFRRI FSRFAFSRIP 

201 RRGWGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT RPTAESA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 082 shows 92.7% identity over a 247 aa overlap with a predicted ORF (ORF 082.ng) 
from N. gonorrhoeae: 

m082/g082 

10 20 30 40 50 60 

MXLLKLPAVANTASSPKRRRNTAASISFTVVLPPEPVMPNTNGFTFSRHAFASVCNAASV 

I MM Ml Ml II IMMMMIMIMM IMMMII Ml I M Ml MM MMMM 

MWLLKLPAVAETASSPKRRRNTAASISFTWLPPEPVMPNTNGFTLSRHAFANVCNAASV 
10 20 30 40 50 60 

70 80 90 100 110 120 

SSTFNAPSIAAQSSRETTTAAAPAANTSSTKSCASNRSPANAKNTSPARMSRLSVTMRDT 

IIIMII lllllllllllllll II IMIIIII II ! 1 1 1 M M M II M M M 1 1 

SSTFNAPPKAAQSSRETTTAAAPADNTPPTKSCASNRPPANAKNTSPSRISRLSVTMRDT 
70 80 90 100 110 120 

130 140 150 160 170 180 

GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAWKHRLFSHSHSXF 

Ihlll III 1 1 III II I II IMIhlill I IIIMII III III III 1 1 IIIMII Ml I 

GLFSDGIGSLRAWQMKFRSSGFIFAFVNIRAADTSVAADFFIACFAWKHRLFSHSHSAF 
130 140 150 160 170 180 

190 200 210 220 230 240 

FLYVS FFRRI FSRFAFSRI PRRGWGQSVDKGKVI AFALHIGNI PPKI I AVI GQLVGFDT 

1 1 1 M M I M 1 1 1 1 1 1 M I M M I M 1 1 M I M I M I II 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

FLYVS FFRRI FSRFAFSRI PRRGWGLSVD KG KVIAFARH I GDI PPKI I AVI GQLVGFDT 
190 200 210 220 230 240 



RPTAESAX 

IMIIIII 
RPTAESAX 



The following partial DNA sequence was identified inN meningitidis <SEQ ID 257>: 

a082.seq 

1 ATGTGGTTGT TGAAGTTGCC TGCCGTCGCC AAAACGGCAT TATCGCCGAA 
51 ACGGCGGCGC AATACCGCAG CCAACATTTC CTTCACCGTC GTCTTGCCGC 



m082 .pep 
g082 

m082 .pep 
g082 

m082 .pep 
g082 

m082 .pep 
g082 

m082 .pep 
g082 
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101 CCGAGCCGGT AATACCGAAC ACAAACGGGT TCACATTCTC GCGCCACGCC 

151 TTCGCCAACA TTTGCAACGC GGTAAGCGTG TCATCGACTT TCAACGCGCC 

201 ATCCATTGCA ACGCAATCTT CGCGCGAAAC CACAACCGCC GCCGCACCCG 

251 CAGCCAATAC GTCTTCAACA AAATCATGCG CATCAAACCG CCCGCCCGCC 

301 AATGCGAAAA ACACATCGCC CGCGCGGATG TCGCGGCTGT CGGTTACGAT 

351 GCGCGACACG GGTTTGCTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

401 AGATGAAATT TAGGTCCAGT GGTTTCATAT TTACTTTCGT TAATATTCGG 

4 51 GCGGCGGACA CATCGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 

501 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGCATTT TTTCTGTACG 

551 TATCATTTTT TAGACGTATT TTTAGTCGAT TTGCCTTTTC CCGCATACCA 

601 CGGCGCGGGG TCGTCGGGCA GTCCGTCGAT AAAGGCAAGG TTATTGCCTT 

651 CGCCCTGCAC ATCGGGAACA TTCCCCCAAA AATCATAGCC GTCATCGGGC 

701 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 



This corresponds to the amino acid sequence <SEQ ID 258; ORF 082.a>: 

a082.pep 

1 MWLLKLPAVA KTALSPKRRR NTAANISFTV VLPPEPVIPN TNGFTFSRHA 

51 FANICNAVSV SSTFNAPSIA TQSSRETTTA AAPAANTSST KSCASNRPPA 

101 NAKNTSPARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS GFIFT FVNIR 

151 AADTSVAADF FIACFAVV KH RLFSHSHSAF FLYVSFFRR1 FSRFAFSRIP 

201 RRGVVGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT RPTAESA* 

m082/a082 95.5% identity over a 247 aa overlap 

10 20 30 40 50 60 

m082 . pep MXLLKLPAVANTAS S PKRRRNTAAS I S FT VVLPPE PVMPNTNGFTFSRHAFAS VCNAAS V 

I 11111111:11 I I I I I I I I I ! : I I I I II I I I I I I : I I I I I I I I I I I I I I : : I I I : I I 
a082 MWLLKLPAVAKTALSPKRRRNTAANISFTVVLPPEPVIPNTNGFTFSRHAFANICNAVSV 

10 20 30 40 50 60 



70 80 90 100 110 120 

m082 . pep S STFNAPSIAAQSSRETTTAAAPAANTSSTKSCASNRS PANAKNTS PARMSRLSVTMRDT 

I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I II I I I I i 
a082 SSTFNAPSIATQSSRETTTAAAPAANTSSTKSCASNRPPANAKNTS PARMSRLSVTMRDT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m082.pep GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAVVKHRLFSHSHSXF 
I I I I I 1 I I I I I M I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I 
a082 GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAWKHRLFSHSHSAF 

130 140 150 160 170 180 

190 200 210 220 230 240 

m082.pep FLYVSFFRRIFSRFAFSRIPRRGVVGQSVDKGKVIAFALHIGNIPPKIIAVIGQLVGFDT 
I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I i I | I I | | | | || 
a082 FLYVSFFRRIFSRFAFSRIPRRGVVGQSVDKGKVIAFALHIGNIPPKIIAVIGQLVGFDT 

190 200 210 220 230 240 



m082.pep 
a082 



RPTAESAX 
I I I I I I I I 
RPTAESAX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 259>: 

g084 . seq 

1 ATGAAacaAT CCGcccgaat aAAAAATATG GATCAGACAT TAAAAAATAc 

51 attgggcatt tGCGCGcttt tagcctTTTG TTTTggcgcG gccaTCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGC 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GCTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCTT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

3 51 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 
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4 01 TTTGGAAATA TTGTGTATCT GTGGGGGTAT TTGCTGACGT AAAAAACTAT 

451 AAACGTCGCA GCAAAATATG GCTGACCATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGAGAAAA TCGccggcga taaAGATTGG CGAGaacctg 

551 atgccggcct gttgttgaat ATTTTcgacc tgtattaCga cttggctttc 

601 cgcgccggca cAATATGCCG CCAAGCGCGC CCAcattttg gaagCagcaa 

651 aaaaagcgtC AACATGGCAt atccgccaac ttgcgcccaa gTAtaa 

This corresponds to the amino acid sequence <SEQ ID 260; ORF 084.ng>: 

g084 .pep 

1 MKQSARIKNM DQTLKNTLGI CALLAFCFGA AIASG YHLEY EYGYRYSAVG 

51 ALASWFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTVW KYCVS VGVFADVKNY 

151 KRRS KIWLTI LLTLILSCAV ME KIAGDKDW REPDAGLLLN IFDLYYDLAF 

2 01 RAGTICRQAR PHFGSSKKSV NMAYPPTCAQ V* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 26 1>: 

m084 . seq 

1 ATGAAACAAT CCGCCcGAAT AAAa.ATATG AATCAGACAT TACTTTATAC 

51 ATTGGGCATT TGCGCGCTTT TAACCTTTnn nnnnnnnnnn nnnnnnnnnn 

101 nnnnnTATCA CCCnGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GTTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 

4 01 TTTGGAAATA TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 

4 51 AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGATAAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG 

551 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCT.TC 

601 CGCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 

651 AAAAAGCGTC AACATGGCAT ATCCGTCATG TTGCGCCCAA GTATAA 

This corresponds to the amino acid sequence <SEQ ID 262; ORF 084>: 

m084.pep 

1 MKQSARIKXM NQTLLYTLGI CALLTFXXXX XXXXX YHPEY EYGY RYSAVG 

51 ALASWFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTVW KYCVS GGVFADVKNY 

151 KRRS KIWLTI LLTLILSCAV MD KIASDKDL REPDAGLLLN IFDLYYDLAX 

201 RAGTICRQAR PHFGSSKKSV NMAYPSCCAQ V* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 084 shows 90.5% identity over a 231 aa overlap with a predicted ORF (ORF 084.ng) 
from N. gonorrhoeae: 

m084/g084 

10 20 30 40 50 

m084 . pep MKQSARI KXMNQTLLYTLGI CALLTF YHPEYEYGYRYSAVGALASWFLLL 

Mill I II hill Ml I III hi II I M 1 I I I I II I I I I I I I I I 1 I I 

g084 MKQS7VRIKNMDQTLKNTLGICALLAFCFGAAIASGYHLEYEYGYRYSAVGALASWFLLL 

10 20 30 40 50 60 

60 70 80 90 100 110 

m084 .pep LARGFPRVSSWLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 

1 1 1 1 1 1 1 II I II II 1 1 II I M I II II I II 1 1 M I II 1 1 1 1 1 II I M II M 1 1 II II II II - 

g084 LARGFPRVSSWLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 

70 80 90 100 110 120 

120 130 140 150 160 170 

m084.pep YFVQALFFIFGLTVWKYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 

I I I II I I I I I I I I II I II I I I II I I II I II I II I II I I I II I II II I I I h I I h I I I 
g084 YFVQALFFIFGLTVWKYCVSVGVFADVKNYKRRSKIWLTILLTLILSCAVMEKIAGDKDW 

130 140 150 160 170 180 
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180 190 200 210 220 

m084 .pep REPDAGLLLNIFDLYYDLAXRAGTICRQARPHFGSSKKSVNMAYPSCCAQVX 

I i| II I II I Ml III! II I 11)1 II 1 1 1 1 1 MM 1 1 II I !! II I Mill 

g084 REPDAGLLLNIFDLYYDLAFRAGTICRQARPHFGSSKKSVNMAYPPTCAQVX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 263>: 

a084 .seq 

1 ATGAAACAAT CCGCCCGAAT AAAAAATATG GATCAGACAT TAAAAAATAC 

51 ATTGGGCATT TGCGCGCTTT TAGCCTTTTG TTTTGGCGCG GCCATCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GTTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 

4 01 TTTGGAGATA TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 

•4 51 AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGATAAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG 

551 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCTTCC 

601 . GCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 

651 AAAAAGCGTC AACATGGCAT ATCCGTCATG TTGCGCCCAA GTATAA 

This corresponds to the amino acid sequence <SEQ ID 264; ORF 084.a>: 

a084 .pep 

1 MKQSARIKNM DQTLKNTLGI CALLAFCFGA AIASG YHLEY EYGYRYSAVG 

51 ALASVVFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WRYCVS GGVFADVKNY 

151 KRRSK IWLTI LLTLILSCAV MD KIASDKDL REPDAGLLLN IFDLYYDLAS 

201 XAGTICRQAR PHFGSSKKSV NMAYPSCCAQ V* 

mO84/a084 92.2% identity over a 231 aa overlap 

10 20 30 40 50 60 

MKQSARIKXMNQTLLYTLGICALLTFXXXXXXXXXYHPEYEYGYRYSAVGALASVVFLLL 
I I I [ I I I I 1:111 I I I I I I I I : I II I I I I I I I I I I I i I I I I I I I I I I 

MKQSARIKNMDQTLKNTLGICALLAFCFGAAIASGYHLEYEYGYRYSAVGALASVVFLLL 
10 20 30 40 50 60 

70 80 90 100 110 120 

LARGFPRVSSVVLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I 
LARGFPRVS SVVLLI YVGTTALYLPVGWLY GAPSYQIVGS I LESNPAEARE FVGNLPGSL 
70 80 90 100 110 120 

130 140 150 160 170 180 

YFVQALFFIFGLTVWKYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 
I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I 
YFVQALFFIFGLTVWRYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 
130 140 150 160 170 180 

190 200 210 220 230 

REPDAGLLLN I FDLYYDLAXRAGT I CRQARPHFGSSKKSVNMAYPSCCAQVX 

I I E I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

REPDAGLLLNIFDLYYDLASXAGTICRQARPHFGSSKKSVNMAYPSCCAQVX 
190 200 210 220 230 



m084 .pep 
a084 

m084 .pep 
a084 

m084 . pep 
a084 

m084 .pep 
a084 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 265>: 

g085.seq 

1 ATGGGCAAAG GGCAGGACTT CACGCCCCTG CGCGACGCGT TGAAAGATAA 
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51 GGCAAAAGGC GTGTTCCTGA TCGGCGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AACCTGACCG ACTGCGTCAC TTTGGAAGAG 

151 GCGGTTCAGA CGGCATACGC CCAAGCCGAA GCGGGCGATA TTGTCTTGCT 

201 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT tatCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 266; ORF 085. ng>: 

g085 .pep 

1 MGKGQDFTPL RDALKDKAKG VFLIGVDAPQ IRRDLDGCGL NLTDCVTLEE 
51 AVQTAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIEA FKA1»* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 267>: 

m085 . aeq 

1 ATGGGTAAAG GGCAGGACTT CACGCCCCTG CGCGATGCAC TGGTAGGCAA 

51 GGCAAAAGGC GTGTTCTTGA TTGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AATATGACCG ACTGCGCCAC TTTGG GAGAA 

151 GCCGTTCAGA CGGCATATGC CCAAGCCGAA GCAGGCGATA TTGTGTTGCT 

201 CAGCCCCGCC TGCGCGAGCT TTGATATGTT CAAAGGCTAC GCGCACCGTT 

2 51 CGGAAGTGTT TATCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 268; ORF 085>: 

m085 -pep 

1 MGKGQDFTPL RDALVGKAKG VFLIGVDAPQ IRRDLDGCGL NMTDCATLGE 
51 AVQTAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIEA FKAL* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 085 shows 94.7% identity over a 94 aa overlap with a predicted ORF (ORF 085. ng) 
from N. gonorrhoeae: 

m085/g085 



m085 .pep 
g085 



m085 .pep. 
g085 

70 80 90 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 269>: 

a085.seq 

1 ATGGGCAAAG GGCAGGACTT CACGCCCCTG CGCGACGCGC TTGCCGGCAA 

51 GGCAAAAGGC GTGTTCCTGA TCGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGATCTG AATATGACCG ACTGCGCCAC TTTGGAAGAA 

151 GCGGTTCAGA AGGCATATGC CCAAGCCGAA GCGGGCGATA TCGTGCTGCT 

201 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT TATCGGGGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 270; ORF 085. a>: 

a085 .pep 

1 MGKGQDFTPL RDALAGKAKG VFLIGVDAPQ IRRDLDGCDL NMTDCATLEE 
51 AVQKAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIGA FKAL* 

m085/a085 94.7% identity over a 94 aa overlap 



10 20 30 40 50 60 

MGKGQD FT PLRDALVGKAKGVFL I GVDAPQ I RRDLDGCGLNMTDCATLGE AVQTAYAQAE 

1 1 1 1 1 1 1 1 1 M 1 1 ! II 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 II 1 1 hi ! I : II 1 1 1 ! 1 1 1 1 1 1 1 

MGKGQDFTPLRDALKDKAKGVFLI GVDAPQ I RRDLDGCGLNLTDCVTLEEAVQTAYAQ/yS 
10 20 30 40 50 60 

70 80 90 

AGDIVLLSPACASFDMFKGYAHRSEVFIEAFKALX 

II I II 1 1 1 1 III 1 1 II I II II MIIIIIIMIIil 

AGDIVLLSPACASFDMFKGYAHRSEVFIEAFKALX 



10 20 30 40 50 60 

m085 . pep MGKGQDFTPLRDALVGKAKGVFLIGVDAPQIRRDLDGCGLNMTDCATLGEAVQTAYAQAE 
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I I I I I I I I M I I I I : I I I I I i ! I I I I I I I I I I I I I I I I I I I I I I I I I III! I I I 1 I I 
a085 MGKGQDFTPLRDALAGKAKGVFLIGVDAPQIRRDLDGCDLNMTDCATLEEAVQKAYAQAE 

10 20 30 40 50 60 

70 80 90 

AG DI VLLS PACAS FDMFKG YAHRS EVFIEAFKALX 
I I j I II I I I I I I I I I I I I I I I I I I I II I I I II I I 
AGDIVLLS PACAS FDMFKG YAHRSEVFIGAFKALX 
70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 27 1>: 

g086 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCGGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCGAAGG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GGCAGGCGGG GTTCGTCGTT GCCGGCCTTA TAGCGAGCGG TTTTTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CTTATCCGGC CTGTTGCTGG TAGCCGTATT GATTG CCGGG CGCGAAATCA 

251 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

301 GAGCTGTTCA AGCTGGCAGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

3 51 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

401 GGCGGGGGAC GGCCAACCTG ATTATGTCCG CCACCAATCC GCAGGCACGT 

451 CGTGAAACAT TAGAAATGTA CGgcCGTTTC CGGGCGATCA TCCTGCCGAT 

501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 

551 GTTCGTTTGT CGTCATTACC GTCATTACCG TTGGAATGCT GTTTCTGGCA 

601 GGATTGCCGT GGAAATATTT TTTTGTCCTG GTAGGCAGCG TCTTGGGTGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCTGAA GAATTCGGCT TCTTCGGGAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGtttgA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGctgCcg tTGATGTCCT ATGGcggTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTTG CGTATCGATT 

1151 ATGAAAACCG CCAGAAAATG CGCGGTTACC GGGTGGAGTA AA 

This corresponds to the amino acid sequence <SEQ ID 272; ORF 086.ng>: 

g086 -pep 

1 MWLMTAFGL LMIYSASV YL ASKEGGDQFF YLTRQ AGFW AGLIASGFLW 

51 FLCRMRTWRR L VPWIFALSG LLLVAVLI AG REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQAR 

151 RETLEMYGRF RA IILPIMLV AFGLVLIMV Q PDFGS FWIT VITVGMLFLA 

2 01 GLPWKY FFVL VGSVLGGMVL MITA APYRVQ RWAFLDPWK DPQGAGYQLT 
251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 
301 I FCYGWLWR AFSIGKQSRD LGLTFNAYIA SGIGIWIGIQ SFFNIGVNIG 

3 51 ALPTKGLTL P LMSYG GSSVF FMLISMMLLL RI DYENRQKM RGYRVE * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 273>: 

m086 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCAAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CCTATCCGGC CTGTTGCTGG TAGTCGTATT GATTGCCGGG CGCGAAATCA 

251 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

3 01 GAGCTGTTCA AGCtGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 
351 CCGTGAAGAA GTGTTGcGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

4 01 GGCGGGGGAC GGCCAATCTG ATCATGTCCG CCACCAATCC GCAGrCACGT 
4 51 CGTGAaACAT TAGAAATGTA CGGCCGTwTC CGGGCGATCA TCCTGCCGAT 
501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 



m085 . pep 
a085 
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551 GTTCGTTTGT CGTCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

6 01 GGATTGCCGT GGAAATATTT TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGkrTCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA mCAAAgGyCT GACGCyGCCG Tg.AtGTCCw ATGGCGGTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTkG CGTATAGATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTATC GGGTGGAGTA A 

This corresponds to the amino acid sequence <SEQ ID 274; ORF 086>: 

m0 86 .pep 

1 MWLMTAFSL LMIYSASV YL ASKEGGDQFF YLTRQ AGFW AGLIASGLLW 

51 FLCRMRTWRR L VPWIFALSG LLLVWLI AG REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQXR 

151 RETLEMYGRX RA IILPIMLV AFGLVLIMV Q PDFGS FWIT VIAVGMLFLA 

201 GLPW KYFFVL VGSVLGGMVL MIT AAPYRVQ RWAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 

301 IFCYGWLWR AFSIGKQSRD LGLTFNAYIA SGIGIWIGXQ 5F FNIGVNIG 

351 ALPXKGLTX P XMSX GGSSVF FMLISMMLLX RI DYENRRKM RGYRVE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 086 shows 96.7% identity over a 396 aa overlap with a predicted ORF (ORF 086.ng) 
from N. gonorrhoeae: 

m086/g086 

10 20 30 40 50 60 

MWLMTAFS LLM I YS ASVYLAS KEGGDQFF YLTRQAGFWAGL I ASGLLWFLCRMRTWRR 

II I M I M M II I i I MM Ml Ml 1 1 1 II II I Ml 1 1 M M I M MM I Ml M II 1 1 1 

MWLMTAFGLLM I YS AS VY LAS KEGGDQFF YLTRQAGFWAGL I ASGFLWFLCRMRTWRR 
10 20 30 40 50 60 

70 80 90 100 110 120 

LVPWI FALSGLLLWVLI AGRE INGATRWI PLGPLNFQPTELFKLAVI LYLASLFTRREE 

i I M M M M I M I M I M M M I M I II I MM IMM II M 1 1 1 1 1 1 1 M I M II I M 

LVPWI FALSGLLLVAVLI AGRE INGATRWI PLGPLNFQPTELFKLAVILYLASLFTRREE 
70 80 90 100 110 120 

130 140 150 160 170 180 

VLRSMESLGWQSIWRGTANLIMSATNPQXRRETLEMYGRXRAIILPIMLVAFGLVLIMVQ 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 M II 1 1 M M 1 1 1 1 1 1 II M 1 1 

VLRSMESLGWQSIWRGTANLIMSATNPQARRETLEMYGRFRAIILPIMLVAFGLVLIMVQ 
130 140 150 160 170 180 

190 200 210 220 230 240 

PDFGSFWITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 

1 1 1 1 1 1 1 1 1 II h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

PDFGSFWITVITVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 
190 200 210 220 230 240 

250 260 270 280 290 300 

DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFI FAI I AEEFGFFGMCVL 

M M M M M M I i II 1 1 li II I II I M I II M II II M M M I M 1 1 1 1 1 M 1 1 1 M I 

DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFI FAI I AEEFGFFGMCVL 
250 260 270 280 290 300 

310 320 330 340 350 360 



m08 6 -pep 
g086 

m0 8 6 .pep 
g086 

m086 .pep 
g086 

m086 .pep 
g086 

m0 86 .pep 
g086 
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m086 . pep IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGXQSFFNIGVNIGALPXKGLTXP 

III I II I III MM III I III I Ml II 1 1 III I III II MINI 1 1 ! ! 1 1 If : ! It I I 

g086 I FCYGWLWRAFS IGKQSRDLGLTFNAYI ASGI GI W I GI QS FFNI GVNIGALPTKGLTLP 

310 320 330 340 350 360 

370 380 390 

m086 .pep XMSXGGSSVFFMLISMMLLXRIDYENRRKMRGYRVEX 

II II Ml II Mill III Ml Ml hi Mill I II 

g086 LMSYGGSSVFFMLISMMLLLRIDYENRQKMRGYRVEX 

370 380 390 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 275>: 

a086 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCAAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CCTATCCGGC CTGTTGCTGG TAGTCGTATT GATTGCCGGG CGCGAAATCA 

251 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

301 GAGCTGTTCA AGCTGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

351 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

4 01 GGCGGGGGAC GGCCAATCTG ATCATGTCCG CCACCAATCC GCAGGCACGT 

4 51 CGTGAAACAT TAGAAATGTA CGGCCGTTTC CGGGCGATCA TCCTGCCGAT 

501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 

551 GTTCGTTTGT CGTCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

601 GGATTGCCGT GGAAATATTT TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGCTGCCG TTGATGTCCT ATGGCGGTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTTG CGTATAGATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTACC GGGTGGAGTA A 

This corresponds to the amino acid sequence <SEQ ID 276; ORF 086.a>: 

a086 .pep 

1 M VVLMTAFSL LMIYSASV YL ASKEGGDQFF YLTRQA GFW AGLIASGLLW 

51 FLCRMRTWRR L VPWIFALSG LLLVVVLI AG REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQAR 

151 RETLEMYGRF RA IILPIMLV AFGLVLIMV Q PDFGS FVVIT VIAVGMLFLA 

201 GLPWKY FFVL VGSVLGGMVL MIT AAPYRVQ RVVAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 

301 IFCYGWLVV R AFSIGKQSRD LGLTFNAYIA SGIGIWIGIQ SFFNIGVNIG 

351 ALPTKGLTLP LMSYG GSSVF FMLISMMLLL RI DYENRRKM RGYRVE* 



m086/a086 98.0% identity over a 396 aa overlap 



10 20 30 40 50 60 

m08 6 . pep MVVLMTAFSLLMIYSASVYLASKEGGDQFFYLTRQAGFVVAGLIASGLLWFLCRMRTWRR 
I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I ! I I I I I I I I I I I I I I I I I I I I I I ! I I 
a08 6 MVVLMTAFSLLMIYSASVYLASKEGGDQFFYLTRQAGFWAGLIASGLLWFLCRMRTWRR 

10 20 30 40 50 60 



70 80 90 100 110 120 

m08 6.pep LVPWIFALSGLLLVVVLIAGREINGATRWIPLGPLN FQPTELFKLAVILYLASLFTRREE 

I I I I I I I i I I I i I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I 
a08 6 LVPWIFALSGLLLVVVLIAGREINGATRWIPLGPLN FQPTELFKLAVILYLASLFTRREE 
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70 80 90 100 110 120 

130 140 150 160 170 180 

VLRSMESLGWQSIWRGTANLIMSATNPQXRRETLEMYGRXRAIILPIMLVAFGLVLIMVQ 

| | | | | | | | I | I I M I I t I I I I I I I I I I I I I i I I I I I I I I 1 I I I I I I I I I ! I ! I I M 1 I 
VLRSMESLGWQSIWRGTANLIMSATNPQARRETLEMYGRFRAIILPIMLVAFGLVLIMVQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

PDFGSFVVITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRVVAFLDPWK 

| | | | | | M | | | | I I I I I I I I I II I I I I I I I 1 I t I I I I I I I I I I I 

PDFGSFVVITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 
190 200 210 220 230 240 

250 260 270 280 290 300 

DPQGAGYQLTHSLMAIGRGEW FGMGLGASLSKRGFLPEAHTDFI FAIIAEEFGFFGMCVL 

i I I I I I I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I I I II I II I I I I I I I I 

DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFI FAIIAEEFGFFGMCVL 
250 260 270 280 290 300 

310 320 330 340 350 360 

IFCYGWLVVRAFSIGKQSRDLGLTFNAYIASGIGIWIGXQSFFNIGVNIGALPXKGLTXP 

| | | I | | I | | | 1 1 | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I 1 
IFCYGWLVVRAFSIGKQSRDLGLTFNAYIASGIGIWIGIQSFFNIGVNIGALPTKGLTLP 

310 320 330 340 350 360 

370 380 390 

XMSXGGSSVFFMLISMMLLXRIDYENRRKMRGYRVEX 

! I | I I I I II I I I I I I I I I I I I I I I I i I I ! I I I I I 
LMSYGGSSVFFMLISMMLLLRIDYENRRKMRGYRVEX 
370 380 390 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 277>: 

g087. seq 

1 ATGGGCGGTA AAACCTTTAT GCTGATGGCG GGCGGAACGG GCGGACACAT 

51 TTTCCCAGCT CTGGCTGTGG CGGATTCATT GCGCGTGCGC GGTCATCATG 

101 TAATTTGGCT GGGCAGCAAG GATTCGATGG AAGAGCGCAT CGTGCCGCAA 

151 TACGGCATAC GCTTGGAAAC GCTGGCGATT AAAGGAATAC GCGGCAACGG 

201. CATCAAACGC AAGCTGATGC TTCCGTTTAC TCTGTACAAA ACCGTCCGCG 

251 AAGCGCAGCG GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 

3 01 GGCGGTTTTG TTACCTTTCC CGGCGGTCTG GCGGCGAAAC TCTTGGGCGT 

351 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGCTTG TCCAACCGCC 

401 AccTGTCGCg ctGGGCGAAA CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 

451 AGCCACGAAG GCGGTTTGGT CGGCAACCCC GTCCGCGCCG ATATTAGCAA 

501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG GCGCGAAGGC CGTCTGAAAA 

551 TTTTGGTGGT CGGCGGCAGT TTGGGTGCGG ACGTTTTGAA CAAAACCGTA 

601 CCGCAGGCGT TGGCACTGCT GCCTGAAGAG GTGCGCCCGC AGATGTACCA 

651 CCAGTCGGGG CGTAACAAGC TGGGCAATCT TCAGGCGGAT TATGACGCGT 

701 TGGGCGTGAA AGCGGAATGC GTGGAATTTA TTACCGACAT GGTGTCCGCC 

751 TACCGTGATG CCGATTTGGT GATTTGCCGT GCCGGCGCGC TGACGATTGC 

801 CGAGTTGACG GCGGCGGGGC TGGGCGCGTT GTTAGTGCCG TATCCTCACG 

851 CCGTTGATGA CCATCAAACC GCCAACGCGC GTTTCATGGT GCAGGCAGAA 

901 GCGGGGCTGC TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA 

951 AATCCTCGGC AGCCTCAACC GCGAAAAATG CCTCAAATGG GCGGAAAACG 

1001 CCCGTACGTT GGCATTGCCG CACAGCGCGG ATGACGTTGC CGAAGCCGCG 

1051 ATTGCGTGTG CGGCGTAAA 

This corresponds to the amino acid sequence <SEQ ID 278; ORF 087.ng>: 

g087 .pep 

1 MGGKTFMLMA GGTGGHIFPA LAVADSLRVR GHHVIWLGSK DSMEERIVPQ 

51 YGIRLETLAI KGIRGNGIKR KLMLPFTLYK TVREAQRIIR KHRVE CVIGF 

101 GGFVTFPGGL AA KLLGVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 



m086 .pep 
a086 

m086 . pep 
a086 

m08 6.pep 
a086 

m08 6.pep 
a086 

m086.pep 
a086 
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201 PQALALLPEE VRPQMYHQSG RNKLGNLQAD YDALGVKAEC VEFITDMVSA 

2 51 YRDADLVICR AG ALTIAELT AAGLGALLV P YPHAVDDHQT ANARFMVQAE 
301 AGLLLPQTQL TAEKLAEILG SLNREKCLKW AENARTLALP HSADDVAEAA 

3 51 IACAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 279>: 

m087 . seq 

1 ATGGGCGGTA AAACCTTTAT GCTGAwkkCG GGCGGAACGG GCGGACATAT 

51 TTTCCCCGCG CTGGCGGTGG CGGATTCATT GCGCGCGCGC GGCCATCATG 

101 TGATTTGGCT GGG CAGCAAG GATTCGATGG AAGAGCGTAT CGTGCCGCAA 

151 TACGGCATAC GCTTGGAAAC GCTGGCGATT AAAGGCGTGC GCGGCAACGG 

2 01 CATCAAACGC AAACTGATGC TGCCGGTTAC TTTGTATCAA ACCGTCCGCG 

2 51 AAGCGCAGCG GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 

3 01 GGCGGCTTCG TTACCTTCCC CGGCGGTTTG GCGGCGAAGC TATTArGCGT 

3 51 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGTTTG TCCAACCGCC 

4 01 ACCTGTCGCG CTGGGCGAAG CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 
451 AGCCACGAAG GCGGCTTGGT CGGCAACCCC GTCCGCGCCG ATATTAGCAA 
501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG GCGTGAAGGC CGTCTGAAAA 

5 51 TTTTGGTGGT CGGCGGCAGT TTGGGCGCGG ACGTTTTGAA CAAAAC CGTA 
601 CCGCATGCAT TGGCTTTGCT GCCCGACAAT GCGCGTCCGC ATATGTACCA 

6 51 CCAATCGGGA CGGGGCAAGC TGGGCATCTT GCAGGCGnnn nnnnnnnnnn 

7 01 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
751 nnnGCGGGAT TGGGTGCGTT GTTAGTGCCG TATCCTCACG CGGTTGACGA 
801 TCACCAAACC GCCAACGCGC GTTTTATGGT GCAGGCGGAG GCGGGATTGC 

8 51 TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA GATTCTCGGC 
901 GGCTTAAACC GCGAAAAATG CCTCAAATGG GCAGAAAACG CCCGTACGTT . 
951 GGCACTGCCG CACAGTGCGG ACGACGTGGC GGAAGCCGCG ATTGCGTGTG 

10 01 CGGCGTAA 

This corresponds to the amino acid sequence <SEQ ID 280; ORF 087>: 

m087 .pep 

1 MGGKTFMLXX GGTGGHIFPA LAVADSLRAR GHHVIWLGSK DSMEERIVPQ 

51 YGIRLETLAI KGVRGNGIKR KLMLPVTLYQ TVREAQRIIR KHRVE CVIGF 

101 GGFVTFPGGL AA KLLXVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 

201 PHALALLPDN ARPHMYHQSG RGKLGILQAX XXXXXXXXXX XXXXXXXXXX 

251 X AGLGALLV P YPHAVDDHQT ANARFMVQAE AGLLLPQTQL TAEKLAEILG 

3 01 GLNREKCLKW AENARTLALP HSADDVAEAA IACAA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 087 shows 83.9% identity over a 355 aa overlap with a predicted ORF (ORF 087.ng) 
from N. gonorrhoeae: 

m087/g087 

10 20 30 40 50 60 

m08 7 .pep MGGKTFMLXXGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYGIRLETLAI 

I M I I I I I I I I I I I I I I I I I I I I I I h I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g0 8 7 MGGKTFMLMAGGTGGHIFPALAVADSLRVRGHHVIWLGSKDSMEERIVPQYGIRLETLAI 

10 20 30 40 50 60 

70 80 90 100 110 120 

m087.pep KGVRGNGIKRKLMLPVTLYQTVREAQRIIRKHRVECVIGFGGFVTFPGGLAAKLLXVPIV 

M : II I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I 
g087 KGIRGNGIKRKLMLPFTLYKTVREAQRIIRKHRVECVIGFGGFVTFPGGLAAKLLGVPIV 

70 80 90 100 110 120 

130 140 150 160 170 180 

m087 . pep IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERFQGREG 

I I I II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g 0 8 7 IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADI SNLPVPAERFQGREG 

130 140 150 160 170 180 
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190 200 210 220 229 
m087 pep RLKI LWGGSLGADVLNKTVPHALALLPDNARPHMYHQSGRGKLGILQA 

uMM M MM M MMM MMIM i-M MMM MM MM ! Ill 

g08 7 RLKILWGGSLGADVLNKTVPQALALLPEEVRPQMYHQSGRNKLGNLQADYDALGVKAEC 

190 200 210 220 230 240 

230 240 250 

m087 pep -- -- AGLG ALLV P Y PHAVDDHQT AN AR FMVQ AE 

MINI I III III IMI II II MM III I 

g087 VEFITDMVSAYRDADLVICRAGALTIAELTAAGLGALLVPYPHAVDDHQTANARFMVQAE 

250 260 270 280 290 300 

260 270 280 290 300 310 

mO 8 7 pep AGLLLPQTQLTAEKLAEILGGLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 

I M 1 1 M I M 1 1 M 1 1 IM MMM 1 1 1 1 M MMM I M M M M M 1 1 1 1 1 M I 

g 0 8 7 AGLLLPQTQLTAEKLAEILGSLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 28 1>: 

a087 . seq 

1 ATGGGCGGTA AAACCTTTAT GCTGATGGCG GGCGGAACGG GCGGACATAT 

51 TTTCCCCGCG CTGGCGGTGG CGGATTCATT GCGCGCGCGC GGCCATCATG 

101 TAATTTGGCT GGGCAGCAAG GATTCGATGG AAGAGCGCAT CGTGCCGCAA 

151 TACGACATCC TGCTCGAAAC GCTGGCGATT AAAGGCGTGC GCGGCAACGG 

201 CATCAAACGC AAGCTGATGC TGCCGTTTAC TTTGTATCAA ACTGTCCGCG 

251 AAGCGCAGCA GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 

301 GGCGGCTTCG TTACCTTTCC CGGCGGTTTG GCGGCGAAGT TATTAGGCGT 

351 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGTTTG TCCAACCGCC 

4 01 ACCTGTCGCG CTGGGCGAAG CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 

4 51 AGCCACGAAG GCGGCTTGGT CGGCAACCCC GTCCGCGCCG ATATTAGCAA 

501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG GCGTGAAGGC CGTCTGAAAA 

551 TTTTGGTGGT CGGCGGCAGT TTGGGCGCGG ACGTTTTGAA CAAAACCGTA 

601 CCGCAGGCAT TGGCTTTGCT GCCCGACAAT GCGCGTCCGC AGATGTACCA 

651 CCAATCGGGA CGGGGCAAGC TGGGCAGCTT GCAGGCGGAT TACGACGCGC 

701 TGGGCGTGCA AGCGGAATGC GTGGAATTTA TTACCGATAT GGTGTCCGCC 

751 T AC CGCGATG CCGATTTGGT GATTTGCCGT GCCGGCGCGC TGACGATTGC 

801 CGAGTTGACG GCGGCGGGAT TGGGTGCGTT GTTAGTGCCG TATCCTCACG 

851 CCGTTGATGA CCATCAAACC GCCAACGCGC GTTTTATGGT GCAGGCGGAG 

901 GCGGGATTGC TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA 

951 GATTCTCGGC GGCTTAAACC GCGAAAAATG CCTCAAATGG GCAGAAAACG 

1001 CCCGTACGTT GGCACTGCCG CACAGTGCGG ACGACGTTGC CGAAGCCGCG 

1051 ATTGCGTGTG CGGCGTAA 

This corresponds to the amino acid sequence <SEQ ID 282; ORF 087.a>: 

a087 .pep 

1 MGGKTFMLMA GGTGGHIFPA LAVADSLRAR GHHVIWLGSK DSMEERIVPQ 

51 YDILLETLAI KGVRGNGIKR KLMLPFTLYQ TVREAQQIIR KHRVECVIGF 

101 GGFVTFPGGL AA KLLGVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILVVGGS LGADVLNKTV 

201 PQALALLPDN ARPQMYHQSG RGKLGSLQAD YDALGVQAEC VEFITDMVSA 

251 YRDADLVICR AG ALTIAELT AAGLGALLV P Y PHAVDDHQT ANARFMVQAE 

301 AGLLLPQTQL TAEKLAEILG GLNREKCLKW AENARTLALP HSADDVAEAA 

351 IACAA* 

m087/a087 85.4% identity over a 355 aa overlap 

10 20 30 40 50 60 

m087 . pep MGGKT FMLXXGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYGIRLETLAI 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I 1 I I .! I I I I I 
a087 MGGKTFMLMAGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYDILLETLAI 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m087 . pep KGVRGNGIKRKLMLPVTLYQTVREAQRIIRKHRVECVIGFGGFVTFPGGLAAKLLXVPIV 
I I I I I I I I I 1 I I I I I I I I 1 i I I I I I : I I I I i I I I I I I 1 I I I I I I I I I I I ! I I I I I I I I 
a Q87 KGVRGNGIKRKLMLPFTLYQTVREAQQIIRKHRVECVIGFGGFVTFPGGLAAKLLGVPIV 

70 - 80 90 100 110 120 

130 140 150 160 170 180 

m087 pep IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERFQGREG 

I | | I I I I I II I I I I I I I I I I I I I I I I I I I I I I i I I I I I I 1 I I I I I I I I I I I I I f I 

a 087 IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERFQGREG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m087 . pep RLKILVVGGSLGADVLNKTVPHALALLPDNARPHMYHQSGRGKLGILQAXXXXXXXXXXX 

Mill! I I I I I I I I M : I I I I I I I M II : I I I M M I I I I III 

a 087 RLKILVVGGSLGADVLNKTVPQALALLPDNARPQMYHQSGRGKLGSLQADYDALGVQAEC 

190 200 210 220 230 240 

250 260 270 280 

m087 . pep XX XXXXXXXXXAGLGALLVPYPHAVDDHQTANARFMVQAE 

: : I I I I I I I I I I I I I II I I I I I M I I II I I I 

a 087 VEFITDMVSAYRDADLVICRAGALTIAELTAAGLGALLVPYPHAVDDHQTANARFMVQAE 

250 260 270 280 290 300 

290 300 310 320 330 

m087 . pep AGLLLPQTQLTAEKLAEILGGLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 
| | | | | | I I I ! I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I i I I I I I 
a 087 AGLLLPQTQLTAEKLAEILGGLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 283>: 

g088.seq 

1 ATGTTTTTAT GGCTCGCACA TTTCAGCAAC TGGTTAACCG GTCTGAATAT 

51 TTTTCAATAC ACCACATTCC GCGCCGTTAT GGCGGCGTTG ACCGCCTTGG 

101 CGTTTTCCCT GATGTTCGGC CCGTGGACGA TACGCAGGCT GACCGCGCTC 

151 AAATGCGGGC AGGCAGTGCG TACCGACGGC CCGCAAACCC ACCTCGTCAA 

201 AAACGGCACG CCGACGATGG GCGGTTCGCT GATTCTGACC GCCATTACCG 

251 TGTCCACCCT GTTGTGGGGC AACTGGGCGA ACCCGTATAT CTGGATTCTC 

301 TTGGGCGTAC TGCTTGCCAC CGGTGCGCTC GGTTTTTACG ACGACTGGCG 

351 CAAAGTCGTT TATAAAGACC CCAACGGCGT GTCCGCCAAA TTCAAAATGG 

401 TGTGGCAGTC AAGCGTTGCC GTTatcgcCG GTttggcaTT GTTTTACctt 

451 gCcgcCAATT CCGCCAACAA TATTTTGATT GTCCCGtttT TCAAACAAAT 

501 CGCCCTGCCG CTGGGCGTGG TCGGCTTttt gGtgttgTCT TACCTGACCA 

551 TCGTCGGCAC ATCCAACGCC GTCAACCTCA CcgaCGGCTT GGACGGCCTT 

601 GCCGCcttcc cgttcgtcct cgttgccgcC GGGCTCGCCA ttttcgccTA 

651 CGTCAGCGGA CACTACCAAT TTTCCCAATA CCTCCAGCTT CCCTATGTCG 

701 CCGGCGCGAA CGAAGTCGCT ATATTCTGCA CCGCCATGTG CGGCGCGTGC 

751 CTCGGATTTT TGTGGTTCAA CGCCTATCCC GCGCAAGTCT TTATGGGCGA 

801 TGTCGGCGCG CTGGCATTGG GTGCCGCGCT CGGTaccGtt gCCGTcaTcg 

851 tCCGCCAAGA ATTTGTcctc gtcattaTGG GCGGTCTGTT cgtcgtagaa 

901 gccgtgTCCG TTATGCTTCa tgtcggCTGG TACAAGAAAA Ccaaaaaacg 

951 CATCTTcCTg acgGcaccga ttcatcacca ttaCCaactt cgatgCTGGa 

1001 aagaaacgca agtcgtcgtc CGTTtCTGGA TTAtTAccat cgtcgtggtt 

1051 tTgataggtt tGagtacccT caAAattcgc ggaaactatg ccgTCCGAAC 

1101 ACCTTTCAGA CGGCATTTGA ACGCGCAATA A 

This corresponds to the amino acid sequence <SEQ ID 284; ORF 088.ng>: 

g088 .pep 

1 MFLWLAHFSN WLTGLNIFQY TT FRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANPYIWIL 

101 LGVLLATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA VIAGLALFYL 

151 AANSANNILI VPFFKQI ALP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 
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201 AAFPFVLVAA GLAIFA YVSG HYQFSQYLQL PYVAGANE VA I FCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

301 AVSVM LHVGW YKKTKKRIFL TAPIHHHYQL RCWKETQ VW RFWIITIVW 

351 LIGLSTLKIR GNYAVRTPFR RHLNAQ* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 285>: 

m088 . seq 
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niiiiiiiiiiiiiiiiii 


501 


nnnnnnnnnn 


nnnGGCGTGG 


TCGGCTTTTT 


GGTGTTGTCT 


TACCTGACCA 


551 


TCGTCGGCAC 


ATCCAATGCC 


GTCAACCTCA 


CCGACGGCTT 


GGACGGCCTT 


601 


GCGACCTTCC 


CCGTCGTCCT 


CGTTGCCGCC 


GGCCTCGCCA 


TCTTCGCCTA 


651 


TGCCAGCGGC 


CACTCACAAT 


TTGCCCAATA 


CCTGCAATTA 


CCTTACGTTG 


701 


CCGGCGCAAA 


CGAAGTGGTG 


ATTTTCTGTA 


CCGCCATGTG 


CGGCGCGTGC 


751 


CTCGGTTTCT 


TGTGGTTTAA 


CGCCTATCCC 


GCGCAAGTCT 


TTATGGGCGA 


801 


TGTCGGTGCA 


TTGGCATTGG 


GTGCCGCGCT 


CGGTACCGTC 


GCCGTTATCG 


851 


TCCGCCAAGA 


GTTTGTCCTC 


GTCATTATGG 


GCGGATTATT 


TGTCGTAGAA 


901 


GCCGTATCCG 


TTATGCTTCA 


GGTTGGCTGG 


TATAAGAAAA 


CCAAAAAACG 


951 


CATCTTCCTG 


ATGGCGCCCA 


TCCATCACCA 


CTACGAACAA 


AAAGGCTGGA 


1001 


AAGAAACCCA 


AGTCGTCGTC 


CGCTTTTGGA 


TTATTACCAT 


CGTCTTGGTG 


1051 


TTGATCGGTT 


TGAGTACCCT 


CAAAATCCGC 


TGAACCTATG 


CCGTCTGAAC 


1101 


ATCTTTCAGA 


CGGCATTTGA 


ACGCGCAATA 


A 





1 MFLWLAHFSN WLTGLNIFQY TT FRAVMAAL TALAFSLMF G PWTIRRLTAXt 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANP YIWIL 

101 LGVLLATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA VIAGLALFYL 

151 AANSAN NILI VPFFKQI ALP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

2 01 AAFPFVLVAA GLAIFA YVSG HYQFSQYLQL PYVAGANE VA I FCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

301 AVSV MLHVGW YKKTKKRIFL TAPIHHHYQL RCWKETQ VW RFWIITIVW 

351 LIGLSTLKIR GNYAVRTPFR RHLNAQ* 



This corresponds to the amino acid sequence <SEQ ID 286; ORF 088>: 

m0 88 .pep 

1 MFLWLAHFSX WLTGLNXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

51 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

101 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 ATFPWLVAA GLAI FA YASG HSQFAQYLQL PYVAGANE W I FCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

3 01 AVSVM LQVGW YKKTKKRIFL MAPIHHHYEQ KGWKETQ VW RFWIITIVLV 

351 LIGL STLKIR XTYAVXTSFR RHLNAQ* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from ^.gonorrhoeae 

ORF 088 shows 91 .7% identity over a 205 aa overlap with a predicted ORF (ORF 088.ng) 
from N. gonorrhoeae: 

m088/g088 

10 20 30 

m0 8 8 . pep GWGFLVLS YLTI VGTSNAVNLTDGLDGLA 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
g0 8 8 I AG LALFYLAANS ANN I LI VP FFKQ I ALPLGWGFLVLS YLTI VGTSNAVNLTDGLDGLA 

150 160 170 180 190 200 
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40 50 60 70 80 90 

m0 8 8 . pep TFP WLVAAGLAI FAY AS GHS Q FAQ YLQLP YVAGANE W I FCTAMCGACLGFLWFNAYPA 

:|| II I Ml III I I hi II IhMMMIIIMIIhllllllllllllllllllill 
gO 8 8 AFPFVLVAAGLAI FAYVSGHYQFSQYLQLPYVAGANEVAI FCTAMCGACLGFLWFNAYPA 

210 220 230 240 250 260 



100 110 120 130 140 150 

m0 88 . pep QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFWEAVSVMLQVGWYKKTKKRIFLM 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I h I I I I I I I I I I I I I 
gO 8 8 QVFMGDVGALALGAALGTVAVI VRQEFVLVIMGGLFWEAVSVMLHVGWYKKTKKRI FLT 

270 280 290 300 310 320 



160 170 180 190 200 

m088 . pep APIHHHYEQKGWKETQVWRFWIITIVLVLIGLSTLKIRXTYAVXTSFRRHLNAQX 

III III h : I MM 1 1 III MM I hill 1 1 II I III MM I Ml II MM 

gO 8 8 AP I HHHYQLRC WKETQVWRFW 1 1 T I VWLI GLSTLKI RGNYAVRTPFRRHLNAQX 

330 340 350 360 370 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 287>: 

a088 . seq 

1 ATGTTTTTAT GGCTCGCACA TTTCAGCAAC TGGTTAACCG GTCTGAATAT 

51 TTTTCAATAC ACCACATTCC GCGCCGTCAT GGCGGCGTTG ACCGCCTTGG 

101 CGTTTTCCCT GATGTTCGGC CCGTGGACGA TACGCAGGCT GACCGCGCTC 

151 AAATGCGGGC AGGCAGTGCG TACCGACGGT CCGCAAACCC ACCTCGTCAA 

201 AAACGGCACG CCGACGATGG GCGGTTCGCT GATTCTGACC GCCATTACCG 

251 TGTCCACCCT GTTGTGGGGC AACTGGGCAA ACCCGTATAT CTGGATTCTC 

301 TTGGGCGTAT TGCTCGCCAC GGGCGCACTC GGTTTTTACG ACGACTGGCG 

351 CAAAGTCGTC TATAAAGACC CCAACGGCGT GTCCGCCAAA TTCAAAATGG 

4 01 TGTGGCAGTC AAGCGTTGCC ATTATCGCCG GTTTGGCATT GTTTTACCTT 

451 GCCGCCAATT CCGCCAACAA TATTTTGATT GTCCCGTTCT TCAAACAAAT 

501 CGCCCTGCCG CTGGGCGTGG TCGGCTTTTT GGTGTTGTCT TACCTGACCA 

551 TCGTCGGCAC ATCCAATGCC GTCAACCTCA CCGACGGCTT GGACGGCCTT 

601 GCGACCTTCC CCGTCGTCCT CGTTGCCGCC GGCCTCGCCA TCTTCGCCTA 

651 TGCCAGCGGC CACTCACAAT TTGCCCAATA CCTGCAATTA CCTTACGTTG 

701 CCGGCGCAAA CGAAGTGGTG ATTTTCTGTA CCGCCATGTG CGGCGCGTGC 

751 CTCGGTTTCT TGTGGTTTAA CGCCTATCCC GCGCAAGTCT TTATGGGCGA 

801 TGTCGGTGCA TTGGCATTGG GTGCCGCGCT CGGTACCGTC GCCGTCATCG 

851 TCCGCCAAGA GTTTGTCCTC GTCATTATGG GCGGATTATT TGTCGTAGAA 

901 GCCGTATCCG TTATGCTTCA GGTCGGCTGG TATAAGAAAA CCAAAAAACG 

951 CATCTTCCTG ATGGCGCCCA TCCATCACCA CTACGAACAA AAAGGCTGGA 

1001 AAGAAACCCA AGTCGTCGTC CGCTTTTGGA TTATTACCAT CGTCTTGGTG 

1051 TTGATCGGTT TGAGTACCCT CAAAATCCGC TGAACCTATG CCGTCTGAAC 

1101 ACCTTTCAGA CGGCATTTGA ACGCGCAATA A 

This corresponds to the amino acid sequence <SEQ ID 288; ORF 088.a>: 

a088 .pep 

1 MFLWLAHFSN WLTGLNIFQY TT FRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANPYIWIL 

101 LGVLLATGAL GF YDDWRKVV YKDPNGVSAK FKMVWQSS VA IIAGLALFYL 

151 AANSANNILI VPFFKQI ALP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 ATFPVVLVAA GLAIFA YASG HSQFAQYLQL PYVAGANE VV IFCTAMCGAC 

251 LGFLWFNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

301 AVSV MLQVGW YKKTKKRIFL MAPIHHHYEQ KGWKETQ VVV RFWIITIVLV 

351 LJGLSTLKIR *TYAV*TPFR RHLNAQ* 

m088/aO88 99.5% identity over a 205 aa overlap 



150 160 170 180 190 200 

m088 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGVVGFLVLSYLTIVGTSNAVNLTDGLDGLA 

I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I 
a088 IAGLALFYLAANSANNILIVPFFKQIALPLGVVGFLVLSYLTIVGTSNAVNLTDGLDGLA 
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150 160 170 180 190 200 

210 220 230 240 250 260 

m088 . pep TFPVVLVAAGLAIFAYASGHSQFAQYLQLPYVAGANEVVIFCTAMCGACLGFLWFNAYPA 
I I I I I I I I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a088 TFPVVLVAAGLAIFAYASGHSQFAQYLQLPYVAGANEVVIFCTAMCGACLGFLWFNAYPA 

210 220 .230 240 250 260 

270 280 290 300 310 320 

m088 . pep QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFVVEAVSVMLQVGWYKKTKKRIFLM 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
a088 QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFVVEAVSVMLQVGWYKKTKKRIFLM 

270 280 290 300 310 320 

330 340 350 360 370 

m088 . pep APIHHHYEQKGWKETQVVVRFWIITIVLVLIGLSTLKIRXTYAVXTSFRRHLNAQX 
I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I i I I 

a088 APIHHHYEQKGWKETQVVVRFWIITIVLVLIGLSTLKIRXTYAVXTPFRRHLNAQX 
330 340 350 360 370 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 289>: 

g089 . seq 

1 ATGCCGCCCA AAATCACGAA GAGCGGGTTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATG AATACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACACGGT GCGGCAAGCC TTGGAAGGTT 

151 TTGACGTGTT CCAGCAATGC TTCGCGCGGC AAACCGACGG CCTCGCACAA 

201 AGCCACGGCA GCCATAACGT TGGCGGCGTT GTGCAAACCT TGCAGCGGGA 

251 TGTCTTGCGT AG AAAT C AAA TCTTCATTGC CTTGTTTTAA ACAGCCCGTC 

3 01 CCGCGTTCCA ACCAAAAATC GGCTTCGTGT TCCAAGGAAA ACCGTTTCAC 
351 TTCACGCCCT GCCCGTTTCA TGGCGCGGCA GAACACGTCG TCCGCATTCA 

4 01 AAACCTGCAC TCCATCGCCA CGGAAAATCT CGGCTTTGGT ATGCGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 290; ORF 089.ng>: 

g089 -pep 

1 MPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGKPWKV 

51 LTCSSNASRG KPTASHKATA AITLAALCKP CSGMSCVEIK SSLPCFKQPV 

101 PRSNQKSASC SKENRFTSRP ARFMARQNTS SAFKTCTPSP RKISALVCA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 29 1>: 

m08 9 . seq 

1 ATGCCGCCCA AAATCACkAw GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCAGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATA AACACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGGAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGAC AAACCGATGG CCTCACACAA 

201 AGCCACkGCA GCCATGACGT TAGCGGCGTT GTGCAkACCT TGCAACGGwA 

251 TGTCTTGCGT GACAATCAAA TCTTCATTGC CTTGTTTCAG GCGGCCTGTC 

301 TCGCGTTCCA ACCAGAAATC AGCTTCGTGT TCCAACGAAA ACCATTTTAC 

351 CTCGCGCCCG GCACGCTTCA TCGCGCGGCA GAACGCATCG TCCGCATTCA 

401 AAACCTGCAC GCCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCATAG 

This corresponds to the amino acid sequence <SEQ ID 292; ORF 089>: 

m089 .pep 

1 MPPKITXSGF CKP AIAAAVA PTFVPLLSSI NTTPFFSPIF STRCGRPWKV 

51 LTCSSNASRD KPMASHKATA AMTLAALCXP CNGMSCVTIK SSLPCFRRPV 

101 SRSNQKSASC SNENHFTSRP ARFIARQNAS SAFKTCTPSP RKILALVCA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 089 shows 88.6% identity over a 149 aa overlap with a predicted ORF (ORF 089.ng) 
from N. gonorrhoeae: 

m089/g089 
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10 20 30 40 50 60 

m089 .pep MPPKITXSGFCKPAIAAAVAPTFVPLLSSINTTPFFSPIFSTRCGRPWKVLTCSSNASRD 

I I I I I I I I I I I I I I I I I I I I I I I I I I I h I I I I I I I I I I t I I I I : I I I I I I I I | | M I 
g0 8 9 MPPKITKSGFCKPAI AAAVAPTFVPLLSSMNTTPFFSPI FSTRCGKPWKVLTCSSNASRG 

10 20 30 40 50 60 



70 80 90 100 110 120 

m0 8 9 . pep KPMASHKATAAMTLAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 

II Mlllllhllllll I hi I II I I II llllh: I I | M II III! hil Ml III 
g089 KPTASHKATAAITLAALCKPCSGMSCVEIKSSLPCFKQPVPRSNQKSASCSKENRFTSRP 

70 80 90 100 110 120 



130 140 150 

m08 9 .pep ARFIARQNASSAFKTCTPSPRKILALVCAX 

1 1 hi ii hi 1 1 1 1 1 1 1 1 H 1 1 1 mill 

g 0 8 9 ARFMARQNTS S AFKTCTPS PRKI SALVCAX 

130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 293>: 

a089 . seq 

1 ATGCCGCCTA AAATCACGAA GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCA CCGACGTTCG TGCCTTTGCT GTCGTCGATG AACACCACGC 

101 CATTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGAAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGGC AAACCGACGG CTTCGCACAA 

201 GGCAACGGCA GCCATCACGT TAGTGGCGTT GTGCAAGCCT TGCAGCGGAA 

251 TATCTTGCGT GGCAATCAAA TCTT'CATTGC CTTGTTTCAG GCGACCTGTC 

301 TCACGTTCCA ACCAAAAATC GGCTTCGTAT TCCAACGAAA ACCATTTCAC 

351 CTCGCGCCCG GCGCGCTTCA TCGCACGACA GAACGCATCG TCCGCATTCA 

4 01 AAACCTGCAC ACCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 294; ORP 089.a>: 

a089.pep 

1 MPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGRP*KV 

51 LTCSSNASRG KPTASHKATA AITLVALCKP CSGISCVAIK SSLPCFRRPV 

101 SRSNQKSASY SNENHFTSRP ARFIARQNAS SAFKTCTPSP RKILALVCA* 



m089/a089 91 .9% identity over a 149 aa overlap 

10 20 30 40 50 60 

11108 9. pep MPPKITXSGFCKPAIAAAVAPTFVPLLSSINTTPFFSPIFSTRCGRPWKVLTCSSNASRD 

JIM!! I I I I I I I ! I I I I I I I 1 I I I I I ! : I I I I I I I I 1 I ! ! I M I I I I I I I I I I I I I 
a08 9 MPPKITKSGFCKPAIAAAVAPTFVPLLSSMNTTPFFSPIFSTRCGRPXKVLTCSSNASRG 

10 20 30 40 50 60 



70 80 90 100 110 120 

m089.pep KPMASHKATAAMTLAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 
II I I I I I I I I : I I : I I I I I : I : I I I : I I i I I I I I I I II I I I I I I I I I I I I I I I I I I I 
a08 9 KPTASHKATAAITLVALCKPCSGISCVAIKSSLPCFRRPVSRSNQKSASYSNENHFTSRP 

70 80 90 100 110 120 



130 140 150 

mO 8 9 . pep ARFIARQNASSAFKTCTPSPRKILALVCAX 
I I I I I II I I I II I I I I I II I I I I II I I I I I 
a 0 8 9 AR FI ARQN AS S AFKTCT PS PRK I LALVCAX 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 295>: 

g090 . seq 

1 ATGCGCGTAG TCGAGCAAAT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 
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51 TGTTCATCAC CGTCGCCGCA GTCGGGCGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAGCTGG AAAGCTCcca CACCCACACG TCCGCCTTTT TGCCTTCgcg 

151 ctgCAATtct gcctccaaga cgggcgtacc gatATTGCCC GCAATGAcgg 

201 tatccagccc gcacttgatg CAGAGatagc ggaccaggct ggttaccgTG 

251 GTTttgccgt tgctgCcggt aatcgCaatc accttgtcgC CGCGGCGGtt 

3 01 cAcaaTGTCc gccaGCAATt ggATGTCGCC TAgCACGCGC . ccgccgTTT 

351 TGCttga 

This corresponds to the amino acid sequence <SEQ ID 296; ORF 090.ng>: 

g090 .pep 

1 MRWEQIWA VEMVFGNVHH RRRSRAQAFG VFQLEAGKLP HPHVRLFAFA 

51 LQFCLQDGRT DIARNDGIQP ALDAEIADQA GYRGFAVAAG NRNHLVAAAV 

101 HNVRQQLDVA XHAXRRFA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 297>: 



m090 . seq 












1 


ATGCGCATAG 


TCGAGCAAGT 


CGTCGTAGCG 


GTCGAGATGG 


TCTTCGGAAA 


51 


TGTTCAGCAC 


CGTCGCCGCA 


GTCGGACGCA 


GGCTTTCGGT 


GTTTTCCAGT 


101 


TGGAAGCTGG 


AAAGCTCCAA 


CACCCACACG 


TCCGCCTTTT 


TGCCTTCGCG 


151 


CTGCCATTCC 


GCCTCCAAAA 


CCGGCGTGCC 


GATATTGCCC 


GCGATAACGG 


201 


TATCCAGCCC 


GCACTTGATA 


CAGAGATAGC 


CGACCAGGCT 


CGTTACCGTG 


251 


GTTTTGCCGT 


TGCTGCCGGT 


AATCGCAATT 


ACCTTGTCGT 


CCCGGCGGTT 


301 


CACAATGTCC 


GCCAGCAATT 


CGATGTCGCC 


CAACACGCGT 


. CCGCCGTTT 


351 


TGCTTGA 











This corresponds to the amino acid sequence <SEQ ID 298; ORF 090>: 

m0 90.pep 

1 MRIVEQVWA VEMVFGNVQH RRRSRTQAFG VFQLEAGKLQ HPHVRLFAFA 

51 LPFRLQNRRA DIARDNGIQP ALDTEIADQA RYRGFAVAAG NRNYLWPAV 

101 HNVRQQFDVA QHAXRRFA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 090 shows 83.9% identity over a 1 1 8 aa overlap with a predicted ORF (ORF 090.ng) 
from N. gonorrhoeae: 

10 20 30 40 50 60 

MRIVEQVWAVEMVFGNVQHRRRSRTQAFGVFQLEAGKLQHPHVRLFAFALPFRLQNRRA 

! I : I I h I I I I II I I I I hi I I I I h I II I I II II I I I I Millllllll I ||: h 
MRWEQIWAVEMVFGNVHHRRRSRAQAFGVFQLEAGKLPHPHVRLFAFALQFCLQDGRT 
10 20 30 40 50 60 

70 80 90 100 110 119 

D I ARDNG I Q PALDTE I ADQ ARYRGFAVAAGNRN YL WPAVHNVRQQFDVAQHAXRRFAX 

I I I I- I I M I I h I I I I I I I II I II II I III Hh 11111111 = 111 II I IN II 
D I ARNDG I QPALDAE I ADQAG YRGFAVAAGNRNHLVAAAVHNVRQQLDVAXHAXRRFAX 
70 80 90 100 110 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 299>: 

a090 . seq 

1 ATGCGCGTAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

51 TGTTCAGCAC TGTCGCCGCA GTCGGGCGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAACTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

151 CTGCAATTCC GCCTCCAAAA CCGGCGCGCC GATATTGCCC GCGATAACGG 

201 TATCCAGCCC ACACTTGATG CAGAGATAGC CGACCAGGCT CGTTACCGTG 

251 GTTTTGCCGT TGCTGCCGGT AATCGCAATC ACCTTGTCGC CGCGGCGGTT 

301 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT CCGCCGTTT 

351 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 300; ORF 090.a>: 



m090/g090 

m090 .pep 
g090 

m0 90 .pep 
g090 
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MRVVEQVVVA VEMVFGNVQH CRRSRAQAFG VFQLETGKLQ HPHVRLFAFA 
LQFRLQNRRA DIARDNGIQP TLDAEIADQA RYRGFAVAAG NRNHLVAAAV 
HNVRQQFDVA QHAXRRFA* 

91.5% identity over a 1 17 aa overlap 

10 20 30 40 50 60 

MRIVEQVVVAVEMVFGNVQHRRRSRTQAFGVFQLEAGKLQHPHVRLFAFALPFRLQNRRA 
I I : I I I I I I I ! I I I I I I I I I I ! I I : I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I 
MRVVEQVVVAVEMVFGNVQHCRRSRAQAFGVFQLETGKLQHPHVRLFAFALQFRLQNRRA 

10 20 30 40 50 60 

70 80 90 100 110 119 

DIARDNGIQPALDTEIADQARYRGFAVAAGNRNYLVVPAVHNVRQQFDVAQHAXRRFAX 
i I I I I I I I I I : I I : I I I I I I I I I I I I I I I I I II : I I : I I I I I I I I I I I I I I I I I I I I I 
DIARDNGIQPTLDAEIADQARYRGFAVAAGNRNHLVAAAVHNVRQQFDVAQHAXRRFAX 

70 80 90 100 110 



The following partial DNA sequence was identified in N. gonorrhoeae 

g090-l.seq This sequence contains multiple stop codons (not shown) 

This corresponds to the amino acid sequence < ORF 090-l.ng>: 

g090-l.pep (not shown) 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 2>: 

m090-l . seq 

1 ATGACGGCGT TTGCATTTCA GACGGCATCA CAAAGCCTTA 

51 TAAACACTTC CGAACGGTGC GCGTAGCCTT TGAACATATC 

101 CAGGCGGGGC TGAGCAACAC AATATCGCCT GCTTCGGCTT 

151 CGTCTGAACG GCTTCTCCCA AAGTGGCGCA GTCGGTCATA 

201 AGCCGTCCAA ATCGCGGCGG ATTTGCGGCG CATCGACACC 

251 ACGCCTTTTG CCTTGCCTAC CAGTGCATCG CGCAGGGGCG 

301 CCCTTTACCC ATGCCGCCCA AAATCACGAA GAGCGGATTT 

351 CAATCGCGGC GGCAGTCGCG CCGACATTCG TGCCTTTGCT 

4 01 AACACCACGC CGTTTTTCTC GCCGATTTTT TCCACGCGGT 

4 51 TTGGAAGGTT TTGACGTGTT CGAGCAATGC TTCGCGCGAC 

501 CCTCACACAA AGCCACGGCA GCCATGACGT TAGCGGCGTT 

551 TGCAACGGAA TGTCTTGCGT GACAATCAAA TCTTCATTGC 

601 GCGGCCTGTC TCGCGTTCCA ACCAGAAATC AGCTTCGTGT 

651 ACCATTTTAC CTCGCGCCCG GCACGCTTCA TCGCGCGGCA 

7 01 TCCGCATTCA AAACCTGCAC GCCGTCGCCA CGGAAAATCT 

7 51 ATGCGCATAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG 

801 TGTTCAGCAC CGTCGCCGCA GTCGGACGCA GGCTTTCGGT 

851 TGGAAGCTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT 

901 CTGCCATTCC GCCTCCAAAA CCGGCGTGCC GATATTGCCC 

951 TATCCAGCCC GCACTTGATA CAGAGATAGC CGACCAGGCT 

1001 GTTTTGCCGT TGCTGCCGGT AATCGCAATT ACCTTGTCGT 

1051 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT 

1101 GCTTGAACGC CTCAATATCC GGCTGCCGCT CGCTGATGCC 

1151 GCCAGAATAT CGAAACCGTT GTCCAGCGCA TCTTTCAGAC 

1201 AAACACCAAC CCGTCAAACA TCTTACCGAT TTGCGACACG 

1251 TCAGCTCCGC ATCATACGCA GCAACCTCCG CGCCGTTTTT 

1301 GCAATCATGG AAATACCCGT ACCGCCGAGT CCGGCGACGA 

1351 GTTTTGAAAA GTCATTTTGG TTTGTCCTAA 

This corresponds to the amino acid sequence <SEQ ID 3; ORF 090- 1>: 

m090-l .pep 

1 MTAFAFQTAS QSLKRFDKHF RTVRVAFEHr KARAGGAEQH NIACFGLGIC 

51 RLNGFSQSGA VGHIQAAAVQ IAADLRRIDT NQEHAFCLAY QCIAQGREVL 

101 PFTHAAQNHE ERILQTGNRG GSRADIRAFA VVDKHHAVFL ADFFHAVRQA 

151 LEGFDVFEQC FARQTDGLTQ SHGSHDVSGV VQTLQRNVLR DNQ IFIALFQ 

201 AACLAFQPEI SFVFQRKPFY LAPGTLHRAA ERIVRIQNLH AVATENLGFG 

2 51 MRIVEQVVVA VEMVFGNVQH RRRSRTQAFG VFQLEAGKLQ HPHVRLFAFA 

301 LPFRLQNRRA DIARDNGIQP ALDTEIADQA RYRGFAVAAG NRNYLVVPAV 

351 HNVRQQFDVA QHASAVLLER LNIRLPLADA GTESQNIETV VQRI FQTARV 

401 KHQPVKHLTD LRHAFRLQLR IIRSNLRAVF AQVGNHGNTR TAESGDEDFF 

4 51 VLKSHFGLS* 



a090 .pep 

1 
51 
101 

m09/a090 

m090.pep 
a090 

m090.pep 
a090 



AACGCTTCGA 
AAAGCTCGCG 
GGGCATATGC 
TTCAAGCCGC 
AATCAAGAAC 
TGAAGTCCTG 
TGCAAACCGG 
GTCGTCGATA 
GCGGCAGGCC 
AAACCGATGG 
GTGCAGACCT 
CTTGTTTCAG 
TCCAACGAAA 
GAACGCATCG 
TGGCTTTGGT 
TCTTCGGAAA 
GTTTTCCAGT 
TGCCTTCGCG 
GCGATAACGG 
CGTTACCGTG 
CCCGGCGGTT 
CCGCCGTTTT 
GGGACTGAGA 
GGCCCGTGTA 
CGTTCCGGCT 
GCGCAGGTAG 
GGATTTTTTT 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 303>: 

g091 . seq 

1 ATGGAAATAC CCGTGCCGCC AAGTCCGGCG AGGAGGATTT TTTTGTTTGA 

51 AAGTCATTTT GGTTTTGTCC TAAAACAAAT CATATTGGGC AGGAGACGTC 

101 CGCCCTTGCC CAAGCCGCTT TCAGACGGCA TCGCGAGCCG ATTAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCT GTTTTGGTCT CCGTTTTGAC 

2 01 AAGCCTTGCC AAGCCATTGT TGAGCGAGCG CAAGGTCTTG GCGCACGCCG 
251 CGTCCATCGT AATACATCAA GCCCAAATTG TATTGGGCTT GGGCATCCCC 
301 TTGTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 304; ORF 091.ng>: 

g091 -pep 

1 MEIPVPPSPA TRIFLFESHF GFVLKQTILG RRRPPLPKPL SDGIASRLIT 
51 RLQ ALVIVAA VLVSVLTSLA KPLLSERKVL AHAAS IVIHQ AQIVLGLGIP 
101 LF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 305>: 

m091 . seq 

1 ATGGAAATAC CCGTACCGCC GAGTCCGGCG ACGAGGATTT TTTTGTTTGA 

51 AAAGTCATTT TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGAGATGTC 

101 CGCCCCTGCC CAAGCCGCTT TCAGACGGCA TCGCGAGCTG TTCAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCC GTCTTGGTCT CCGTTTTGAC 

201 AAGCCTTGCC AAACCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

251 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGCTT GGGCTACCCC 

3 01 CTGCGC... 

This corresponds to the amino acid sequence <SEQ ID 306; ORF 091>: 

m091 .pep 

1 MEIPVPPSPA TRIFLFEKSF WFVLKQIILS RRCPPLPKPL SDGIASCSIT 
51 RLQ ALVIVAA VLVSVLTSLA KPFLCKGAVL AHAASFG IHH AQIVLGLGYP 
101 LR. 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 091 shows 84.2% identity over a 101 aa overlap with a predicted ORF (ORF 091. ng) 
from N. gonorrhoeae: 

m091/g091 



10 20 30 40 50 60 

m091.pep MEIPVPPSPATRIFLFEKSFWFVLKQIILSRRCPPLPKPLSDGIASCSITRLQALVIVAA 

IIIIIIIIIIIIIMIh I lilllllhil lllllllllllll MINIMUM 

g091 MEIPVPPSPATRIFLFESHFGFVLKQIILGRRRPPLPKPLSDGIASRLITRLQALVIVAA 

10 20 30 40 50 60 



70 80 90 100 

m091 . pep VLVSVLTSLAKPFLCKGAVLAHAASFGIHHAQIVLGLGYPLR 

IMIIIMMIhl : IMIIIh Ihllllllll II 

g091 VLVSVLTSLAKPLLSERKVLAHAASIVIHQAQIVLGLGIPLFX 

70 80 90 100 



The following partial DNA sequence was identified in//, meningiditis <SEQ ID 307>: 

a091. seq 

1 ATGGAAATAC CCGTGCCGCC AAGTCCGGCG ACGAGGATTT TTTTGTTTTG 

51 GAAATCATTT TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGGGATGTC 

101 TGATCCTGCT CAAGCCGCTT TCAGACGGCA TCGCGAGCTG TTCAATAACC 

151 CGCTTTCAGG CGTTGGTCAT TGTCGCAGCT GTCTTGGTAT CCGTTTTGAC 

201 AAGCCTTGCC AAGCCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

251 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGC 
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This corresponds to the amino acid sequence <SEQ ID 308; ORF 091. a>: 

a091 .pep 

1 MEIPVPPSPA TRIFLFWKSF WFVLKQIILS RGCLILLKPL SDGIASCSIT 
51 RFQ ALVIVAA VLVSVLTSLA KPFLCKGAVL AHAASFGIHH AQIVLG 

m09i/a09i 93.8% identity over a 96 aa overlap 

10 20 30 40 50 60 

m091 . pep MEIPVPPSPATRIFLFEKSFWFVLKQIILSRRCPPLPKPLSDGIASCSITRLQALVIVAA 
I I I I I I I I I I I I I I f I I I I I I I I II II I I I I i I I I I I II I I I I I I I : I I I I I I I I 
a091 MEIPVPPSPATRIFLFWKSFWFVLKQIILSRGCLILLKPLSDGIASCSITRFQALVIVAA 

10 20 30 40 50 60 

70 80 90 100 

m0 91 . pep VLVSVLTSLAKPFLCKGAVLAHAASFGIHHAQIVLGLGYPLR 

I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I 

a091 VLVSVLTSLAKPFLCKGAVLAHAASFGIHHAQIVLG 

70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 309>: 

g092 . seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGTGCGC 

51 AAACGGTCAG ACCTTTAAAA TAACGCCTTT ACGCACTAAA AACCAACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTAA GCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGTATCGCC GAAGTCTTGC ACAATTTGGG 

201 CTTTAAAGTT TCCGGTTCGG ATCAGGCGCG . AAATGCCGCT ACCGAGCATT 

251 TGAGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC AGAACACGTT 

3 01 AACGGTgcgg ATGTCGTCGT TGCCTCTACC GCCGTCAAGA AAGAAaatcC 
351 CGAAGTtgtC gcTGCGTTGG AGCGGCAAAT TCCCGTTATT CCGCGCGCCT 
401 TGATGCTGGC AGAGCTGATG CGCTTCCGTG ACGgcatcgc cattgccggT 

4 51 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 
501 GGCAGGACTC GACCCCACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 
551 GCACCAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 
601 GAATCCGATG CCTCTTTCCT ACATCTGACC CCGATTATGT CCGTCGTTAC 
651 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGC GTCGAAAAAC 
701 TGCATCAGGC GTTTATCGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 
751 GCCTTTTTGT GTGTTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 
801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 
851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATTCAC CGTCCATGTT 
901 CAAATGAAAG GACATGAGCA GGGGTCGTTT GAAGTCGTGC TGAATATGCC 
951 CGGCAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGc gtggcGCTgg 

1001 aagtcGgCGC ATcggttgAA GCGAtcCAAA AaggCTTGCT CGGCTTTGAA 

1051 GGCGTCGGCC GCCGCTTCCA AAAATAcggc gacatCAagt tgccaaacgg 

1101 cggGaccgCT TTgctGGTGG ACGATTAcgg ACACCACCCC GTCGAAATGG 

1151 CGGcaaccct tgccgcTGCA CGCGGCGCGT ATCCGGAAAA acgtTTGGTG 

1201 CtcgCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT -TGTTTGAAGA 

1251 CTTTACCAAA GTACTCAATA CCGTTGatgC GCTGGTACTG ACCGAAGTTT 

13 01 AtgccgccgG CGAAGAGCCG GTTGCCGCCG CCGactcCCG CGCCTTGGCG 

13 51 CGTGCTATCC GCGTATTGGG CAAACTTGAG CCGATTTACT GCGAAAatgt 

1401 cgccgACCTG CCGCAAATGC TGATGAATGT TTTACAGGAT Ggcgatgttg 

1451 tgttgAATAT GggTgcggga agcatcaacc gcgttccttc cgcgctgttg 

1501 gaattgtcga AACAGAtttg A 

This corresponds to the amino acid sequence <SEQ ID 310; ORF 092.ng>: 

g092 .pep 

1 MFFISIRYIF VRKLWCANGQ TFKITPLRTK NQPERNIMMK NRVSNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLSSLGIQ VYPGHTAEHV 

101 NGADVWAST AVKKENPEW AALERQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TAS I LGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

2 01 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 
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251 


AFLCVDSEHV 


RAILPKVSKP 


YATYGLDDTA 


DIYATDIENV 


GAQMKFTVHV 


301 


QMKGHEQGSF 


EWLNMPGRH 


NVLNALAAIG 


VALEVGASVE 


AIQKGLLGFE 


351 


GVGRRFQKYG 


DIKLPNGGTA 


LLVDDYGHHP 


VEMAATLAAA 


RGAYPEKRLV 


401 


LAFQPHRYTR 


TRDLFEDFTK 


VLNTVDALVL 


TEVYAAGEEP 


VAAADSRALA 


451 


RAIRVLGKLE 


PIYCENVADL 


PQMLMNVLQD 


GDWLNMGAG 


SINRVPSALL 


501 


ELSKQI* 










The following partial DNA sequence was identified in N. meningitidis <SEQ ID 


m092 . seq 












1 


ATGTTTTTTA 


TTTCAATCCG 


CTA1A1A1 1 1 


PTPSPfi A A A P* 

b Jl bALiAAAAL- 


1 i\x ovjCvj^oL 


51 


AAACGGTCAG 


CCCTTTAAAA 


TAAbbbb 111 


A L vj L.A 1 V_ \j AA 




101 


AACGCAACAT 


TATGATGAAA 


AAlbbAbl 1A 


bbAAbA 1 LLA 


111 luiLuuJ. 


151 


ATCGGCGGCG 


TCGGCATGAG 


LbuLA 1 bbbb 


bAAb 1 b 1 1 bb 


111 uuU 


201 


CTTTAAAGTT 


TCCGGTTCGG 


7\ 7\ pt/~i pp 

Al bAgbbbbb 


AAA 1 bbbbb 1 


a ppp a. rr* a. tt 


251 


TGGGCAGCCT 


GGGCATTCAA 


bl I lAlbbbb 


pppati c*r*cc* 

bbbA 1 Abbbb 


rr 1 aapa pptt 

L-O/xrtV— 1 1 


301 


AACGGTGCGG 


ATGTCGTCGT 


TACCTCTACb 


p , p , p i p"7 i p' a aaa 
bbbb I bAAAA 


A A r* A A A ATPP 


351 


CGAAGTTGTC 


GCTGCGTTGG 


AG C AG L AAA 1 


rp p^p* r^r** r p r p a ' i ' r r 1 
1 Lbbbl 1 Al 1 


PPPPPPP.PPP 
bbbbbbbbbb 


401 


TGATGTTGGC 


GGAGTTGATG 


CGCl ibbblb 


A PPPP A TP'P' P 1 

AbbbbA 1 bbb 


bAl Ibbbbbb 


451 


ACGCACGGCA 


AAACCACGAC 


P 1 A OP 1 A O P , P""T , P' 

bAbbAbbb 1 b 


Abbbbb 1 bbA 


1 V-L. 1 L-ooL-LjL. 


501 


GGCAGGACTT 


GACCCGACTT 


TCbTi A 1 bbb 


p'P'P' P7\ a sptp 
bbbbAAAb 1 b 


a a pp f"' pr* pip 

AAbbbbbbAb 


551 


GCACTAACGC 


CCGCTTGbbb 


A A A P 1 C* C*r* AAT 

AAAbb bb AA 1 


A r« A TPPTTPP 

AbAl bbl lbb 


b b AAb b b b Ab 


601 


GAGTCGGATG 


CATCCTTTC1 


P" P» A / r "* r P/™' A P 1 A 

GCAbb 1 bAbA 


ppn a TTATPT 

LbbAl lAlbl 


PPPTPPTTSP 
bbb 1 bbl lAb 


651 


CAATATCGAC 


GAAGACbA 1 A 


IbbAlAbb I A 


pppp fi a a n r 1 
bbbbbAbAbb 


PfPPA A A A A 


701 


TGCATCAGGC 


GTTTAI CGAT 


1 1 bAibbAbb 


PTSTPPPPTT 
blAlobbbi 1 


CT 2\ PP.PP A A a 


751 


GCCTTTTTGT 


GTATTGACAG 


PV*« A A P» A PPTP 

bGAAbAbb 1 b 


bbbbbbAl 1 1 


TPPPPHfiPT 
1 bbL-L-AAALj 1 


801 


GAGCAAACCT 


TATGCTACTT 


Abbb 111 bbA 


b b A 1 AL. V~ Liv_ V_ 


vxrt\*jrll V- 1/W-Va 


851 


CCACCGACAT 


CGAAAACGTC 


GGCGCGCAAA 


TGAAATTCAC 


CGTCCATGTT 


901 


CAAATGAAAG 


GACATGAGCA 


GGGGTCGTTT 


GAAGTCGTGC 


TGAATATGCC 


951 


CGGCAGACAC 


AACGTGCTGA 


ACGCATTGGC 


AGCCATCGGC 


GTGGCGCTGG 


1001 


AAGTCGGCGC 


ATCGGTTGAA 


GCGATCCAAA 


AAGGCTTGCT 


CGGCTTTGAA 


1051 


GGCGTCGGCC 


GCCGCTTCCA 


AAAATACGGC 


GACATCAAGT 


TGCCAAACGG 


1101 


CGGGACCGCG 


CTCTTGGTGG 


ACGACTACGG 


ACACCACCCC 


GTCGAAATGG 


1151 


CGGCGACCCT 


TGCCGCCGCA 


CGCGGCGCGT 


ATCTGGAAAA 


ACGTTTGGTA 


1201 


CTCGCCTTCC 


AGCCGCACCG 


CTATACCCGC 


ACGCGCGATT 


TGTTTGAAGA 


1251 


CTTTACCAAA 


GTCCTCAATA 


CCGTTGACGC 


GCTGGTGCTG 


ACCGAAGTTT 


1301 


ATGCCGCCGG 


TGAAGAGCCG 


ATTGCCGCCG 


CCGATTCCCG 


CGCTCTTGCC 


1351 


CGCGCCATCC 


GCGTGTTGGG 


CAAACTCGAG 


CCGATTTACT 


GCGAAAACGT 


1401 


TGCCGATCTG 


CCCGAAATGC 


TGTTGAACGT 


TTTGCAGGAC 


GGCGACATCG 


1451 


TGTTGAATAT 


GGGCGCGGGA 


AGCATCAACC 


GCGTCCCCGC 


CGCGCTGCTG 


1501 


GCATTGTCGA 


AACAGATTTG 


A 






This corresponds to the amino acid sequence <SEQ ID 3 12; ORF 092>: 


m092 .pep 












1 


MFFISIRYIF 


VRKLWRANGQ 


PFKITPLRIE 


NPPERNIMMK 


NRVTNIHFVG 


51 


IGGVGMSGIA 


EVLHNLGFKV 


SGSDQARNAA 


TEHLGSLGIQ VYPGHTAEHV 


101 


NGADVWTST 


AVKKENPEW AALEQQIPVI 


PRALMLAELM 


RFRDGIAIAG 


151 


THGKTTTTSL 


TAS I LGAAGL 


DPTFVIGGKL 


NAAGTNARLG 


KGEYIVAEAD 


201 


ESDASFLHLT 


PIMSWTNID 


EDHMDTYGHS 


VEKLHQAFID 


FIHRMPFYGK 


251 


AFLCIDSEHV 


RAILPKVSKP 


YATYGLDDTA 


DIYATDIENV 


GAQMKFTVHV 


301 


QMKGHEQGSF 


EWLNMPGRH 


NVLNALAAIG 


VALEVGASVE 


AIQKGLLGFE 


351 


GVGRRFQKYG 


DIKLPNGGTA 


LLVDDYGHHP 


VEMAATLAAA 


RGAYLEKRLV 


401 


LAFQPHRYTR 


TRDLFEDFTK 


VLNTVDALVL 


TEVYAAGEEP 


IAAADSRALA 


451 


RAIRVLGKLE 


PIYCENVADL 


PEMLLNVLQD 


GDIVLNMGAG 


SINRVPAALL 


501 


ALSKQI* 











Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 092 shows 96.6% identity over a 506 aa overlap with a predicted ORF (ORF 092.ng) 
from N. gonorrhoeae: 

m092/g092 



10 20 30 40 50 60 

m092 .pep MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVGIGGVGMSGIA 
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g092 



M 1 1 1 1 1 1 1 1 1 Ml I MM MMMI M M M M M M h M M M M M M M M 

MFFISIRYIFVRKLWCANGQTFKITPLRTKNQPERNIMMKNRVSNIHFVGIGGVGMSGIA 
10 20 30 40 50 60 



70 80 90 100 110 120 

m092 . pep EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADVWTSTAVKKENPEW 

I M M M M M M M M M M M h 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 M M 1 1 1 1 1 

g 092 EVLHNLGFKVSGSDQARNAATEHLSSLGIQVYPGHTAEHVNGADVWASTAVKKENPEW 

70 80 90 100 110 120 

130 140 150 160 170 180 

m092 . pep AALEQQI PVI PRALMLAELMRFRDGI AI AGTHGKTTTTSLTAS ILGAAGLDPTFVIGGKL 

M M M M M M M M M M M M M M M M M M M M M I M M M M M M M M I 

g092 AALERQI PVI PRALMLAELMRFRDGIAIAGTHGKTTTTSLTAS ILGAAGLDPTFVIGGKL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m092 . pep NAAGTNARLGKGEYIVAEADESDASFLHLTPIMSWTNIDEDHMDTYGHSVEKLHQAFID 

M M M M Ml M M M M M M M M M M M M I M M M M M M M M M M I M I 

g092 NAAGTNARLGKGEYIVAEADESDASFLHLTPIMSWTNIDEDHMDTYGHSVEKLHQAFID 

190 200 210 220 230 240 

250 260 270 280 290 300 

m0 92 . pep FIHRMPFYGKAFLCIDSEHVRAILPKVSKPYATYGLDDTADIYATDIENVGAQMKFTVHV 

i M M M M M M h M 1 1 1 1 1 1 1 1 1 1 M I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

g092 F I HRMPFYGKAFLCVDSEHVRAI LPKVS KP YATYGLDDTADI YATD I ENVGAQMKFTVHV 

250 260 270 280 290 300 

310 320 330 340 350 360 

m092 . pep QMKGHEQGSFEWLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

M M M M M M M M M M M M M M M M M M M M M M M M M M M M M M 

g092 QMKGHEQGSFEWLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

310 320 330 340 350 360 

370 380 390 400 410 420 

mO 92 . pep DIKLPNGGTALLVDDYGHHPVEMAATLAAARGAYLEKRLVLAFQPHRYTRTRDLFEDFTK 

M M M M M M M M M M M M M M M M M M M M M M M M M M M M M I 

g092 DIKLPNGGTALLVDDYGHHPVE^4AATLAAARGAYPEKRLVLAFQPHRYTRTRDLFEDFTK 

370 380 390 400 410 420 



430 440 450 460 470 480 

m092 . pep VLNTVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPI YCENVADLPEMLLNVLQD 

M M M M M M M ! M M h M M I M M M M M M M M M M M M h I h I M 1 1 

g092 VLNTVDALVLTEVYAAGEEPVAAADSRALARAIRVLGKLEPIYCENVADLPQMLMNVLQD 

430 440 450 460 470 480 



490 500 
m092 .pep GDIVLNMGAGSINRVPAALLALSKQIX 

1 1 M M M I M M M h 1 1 1 MMM 

g 0 9 2 GD WLNMGAGS I NRVP S ALLELS KQIX 

490 500 



The following partial DNA sequence was identified inN. meningitidis <SEQ ID 313>: 

a092 . seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGCGCGC 

51 AAACGGTCAG CCCTTTAAAA TAACGCCTTT ACGCATCGAA AATCCACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTGA CCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGTATCGCC GAAGTCTTGC ACAATTTGGG 

201 TTTTAAAGTT TCCGGTTCGG ATCAGGCGCG AAATGCCGCT ACCGAGCATT 

251 TGGGCAGCCT G GG CAT TCAA GTTTATCCCG GCCATACCGC AGAACACGTT 
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301 AACGGTGCGG ATGTCGTCGT TACCTCTACC GCCGTCAAAA AAGAAAATCC 

351 CGAAGTTGTC GCTGCGTTGG AGCAGCAAAT TCCCGTTATT CCGCGCGCCC 

4 01 TGATGTTGGC GGAGTTGATG CGCTTCCGTG ACGGCATCGC CATTGCCGGC 

4 51 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 

501 GGCAGGACTT GACCCGACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 

551 GCACCAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 

601 GAGTCGGATG CATCCTTTCT GCACCTGACA CCGATTATGT CCGTCGTTAC 

651 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGT GTTGAGAAGC 

7 01 TGCATCAGGC GTTT AT CG AT TTCATCCACC GTATGCCCTT CTACGGCAAA 

751 GCCTTTTTGT GTATTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 

801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 

851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATTCAC CGTCCATGTT 

901 CAAATGAAAG G AC AT GAG C A GGGGTCGTTT GAAGTCGTGC TGAATATGCC 

951 CGGCAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGC GTGGCGCTGG 

1001 AAGTCGGCGC ATCGGTTGAA GCGATCCAAA AAGGCTTGCT CGGCTTTGAA 

1051 GGTGTCGGCC GCCGCTTCCA AAAATACGGC GACATCAAGT TGCCAAACGG 

1101 TGGAACCGCG CTCTTGGTGG ACGACTACGG ACACCACCCC GTCGAAATGG 

1151 CGGCGACCCT TTCCGCCGCA CGCGGCGCGT ATCCGGAAAA ACGTTTGGTA 

1201 CTCGCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 

1251 CTTTACCAAA GTCCTCAATA CCGTTGACGC GCTGGTGCTG ACCGAAGTTT 

1301 ATGCCGCCGG TGAAGAGCCG ATTGCCGCCG CTGATTCCCG CGCTCTTGCC 

1351 CGCGCCATCC GCGTGTTGGG CAAACTCGAG CCGATTTACT GCGAAAACGT 

1401 TGCCGATCTG CCCGAAATGC TGTTGAACGT TTTGCAGGAC GGCGACATCG 

14 51 TGTTGAATAT GGGTGCGGGA AGCATCAACC GCGTCCCCGC CGCGCTGCTG 

1501 GAATTGTCGA AACAGATTTG A 

This corresponds to the amino acid sequence <SEQ ID 314; ORF 092.a>: 

a092 .pep 

1 MFFISIRYIF VRKLWRANGQ PFKITPLRIE NPPERNIMMK NRVTNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLGSLGIQ VYPGHTAEHV 

101 NGADVWTST AVKKENPEVV AALEQQIPVI PRALMLAELM RFRDGIAIAG. 

151 THGKTTTTSL TASILGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDASFLHLT PIMSVVTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 

251 AFLCIDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 QMKGHEQGSF EWLNMPGRH NVLNA LAAIG VALEVGASVE AI QKGLLGFE 

351 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLSAA RGAYPEKRLV 

401 LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP IAAADSRALA 

4 51 RAIRVLGKLE PIYCENVADL PEMLLNVLQD GDIVLNMGAG SINRVPAALL 

501 ELSKQI* 

m092/a092 99.4% identity over a 506 aa overlap 

10 20 30 40 50 60 

m092 . pep MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVGIGGVGMSGIA 

I j i 1 1 I 1 1 i 1 1 1 1 1 I ! 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 I 1 1 1 1 1 I I I I I 1 1 I I I I I I 

a 092 MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVGIGGVGMSGIA 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 92 . pep EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADVVVTSTAVKKENPEVV 

1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 M 

• a0 92 EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADVVVTSTAVKKENPEVV 

70 80 90 100 110 120 

130 140 150 160 170 180 

m092.pep AALEQQIPVIPRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 

i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

a092 AALEQQIPVIPRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m092.pep NAAGTNARLGKGEYIVAEADESDASFLHLTPIMSVVTNIDEDHMDTYGHSVEKLHQAFID 
I I I I I I II I I I II I I I I I I I II I I I II I I I I ! I I I I I I I I I I I I I I ! I I I I I I I I I I I II 
a0 92 NAAGTNARLGKGEY I VAEADESDASFLHLTP IMS VVTNIDEDHMDTYGHS VEKLHQAFID 
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190 200 210 220 230 240 

250 260 270 280 290 300 

FIHRMPFYGKAFLCIDSEHVRAILPKVSKPYATYGLDDTADIYATDIENVGAQMKFTVHV 

MM I M ! I I I I I I I I I I I I I I I M 1 I I I I I M I I I I I I I i I i I I I I I I I! I I I I 

FIHRMPFYGKAFLCIDSEHVRAILPKVSKPYATYGLDDTADIYATDIENVGAQMKFTVHV 

250 260 270 280 290 300 

310 320 330 340 350 360 

QMKGHEQGSFEVVLNMPGRHNVLNAL7VAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 
M I I I I I I t I I I I I I f I I I I I M t I I I I ! I I I i M I I I I I ! I I II i I I I I I I I I M I i 1 I 
QMKGHEQGSFEVVLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

310 320 330 340 350 360 

370 380 390 400 410 420 

DIKLPNGGTALLVDDYGHHPVEMAATL7VAARGAYLEKRLVLAFQPHRYTRTRDLFEDFTK 

M I I I I I I I I I I I I I I I I I 1 I I I I I i I : i I I I I I I I I I I I I I ! I ! I I I i 1! I I I I ! I I I 
DIKLPNGGTALLVDDYGHHPVEMAATLSAARGAYPEKRLVLAFQPHRYTRTRDLFEDFTK 
370 380 390 400 410 420 

430 440 450 460 470 480 

VLNTVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPIYCENVADLPEMLLNVLQD 
M I i I I I I I I I I I I I I I I I I I I I I I I 1 I I I M I I I M I M I I I I I I M I I I II I I M I I I 
VLNTVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPIYCENVADLPEMLLNVLQD 

430 440 450 4 60 470 480 

490 500 
G D I V LNMG AG S IN RV P AALLAL S KQ I X 
M I I I I I I I i I I I i I I i I I I I I I I I I 
GDIVLNMGAGSINRVPAALLELSKQIX 

490 500 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3 15>: 

g093 . seq 

1 aTGCAGAATt ttgGCAAAGT ggccgtATTG ATGGGtggtT TTTCCAGCGA 

51 ACGAGAaatc tcgcTGGACA GCgGTACCGC CATTTTGAAC GCCTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGACC CTAAGGAAAC GCCGTTATCC 

151 GAACTGAAGG AGCGGGGCTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCCTCCG CCATCGGCAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTACCCGTTC CCGAGTTCGC 

3 51 CGTACTGTAC GATGATACCG ATTTCGATGC CGTCGAAGAA AAATTGGGTC 
401 TGCCGATGTT TGTGAAGCCG GCGGCCGAAG GCAGCAGCgt cggcgtggta 

4 51 aAAGTCAAAG AAAaaggccg TCTGAAAAGC GTTtacgaag aatTGAaaCA 
501 CCTTcagggg cgaAAtcatt gccgAacgTT TTATCGGCGG CGGCGAATAT 
551 TCCTGCCCCG TCCTGAACGG CAAAGGGCTG CCCGGCATAC ACATCATCCC 
601 CGCAACCGAG TTTTACGAc t acgaagccaa GtacaaCCGA GACGAcacca 
651 tttaTCAATG TCCTTCGGAA GATTTGACCG AAGCCGAAGA AAGC CTGATG 
701 CGCGAACTGG CGGTTCGCGG CGCACAGGCA ATCGGTGCGG AAGGCTGCGT 
751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 
801 TCAACACCCT GCCCGGTATG ACCGGCCATA G 

This corresponds to the amino acid sequence <SEQ ID 316; ORF 093. ng>: 

g093 .pep 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKERGFQTA FNILHGTYGE DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLY DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKEKGRLKS VYEELKHLQG RNHCRTFYRR RRIFLPRPER QRAARHTHHP 

201 RNRVLRLRSQ VQPRRHHLSM SFGRFDRSRR KPDARTGGSR RTGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDRP * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 317>: 

m093 . seq 



m092 .pep 
a092 

m092.pep 
a092 

m092 .pep 
a092 

m092 .pep 
a092 

m092 .pep 
a092 
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1 ATGCAGAATT TTGGCAAAGT GGCCGTATTG ATGGGCGGTT TTTCCAGCGA 

51 ACGAGAAATC TCGCTGGACA GCGGCACCGC CATTTTGAAT GCTTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGATC CTAAAGAAAC CCCATTGTCT 

151 GAATTGAAGG CACAAGGTTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCrAA GACGGGGCGG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCATCCG CCATCGGCAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTGCCCGTTC CCGAGTTCGC 

351 CGTCCTGCAC GACGACACTG ATTTCGATGC CGTCGAAGAA AAATTGGGCC 

4 01 TGCCGATGTT TGTGAAACCG GCGGCCGAAG GCAGCAGCGT AGGCGTGGTA 

451 AAAGTCAAAG GAAAAGGCCG TCTGAAAAGC GTTTACGAAG AATTGAAACA 

501 CCTTCAGGG. CGAAATCATT GCCGAACGTT TTATCGGCGG CGGCGAATAT 

551 TCCTGCCCCG TCCTGAACGG CAAAGGGCTG CCCGGCATAC ACATCATTCC 

601 CGCAACCGAG TTTTACGACT ACGAAGCCAA GTACAACCGC GACGACACCA 

651 TTTATCAATG TCCTTCGGAA GATTTGACCG AAGCCGAAGA AAGCCTGATG . 

701 CGCGAACTGG CGGTTCGCGG CGCGCAGGCA ATCGGTGCGG AAGGCTGCGT 

751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 

801 TCAACACCCT GCCCGGTATG ACGAGCCATA G 

This corresponds to the amino acid sequence <SEQ ID 318; ORF 093>: 

m093 .pep 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKAQGFQTA FNILHGTYGX DGAVQGALEL LGI PYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKGKGRLKS VYEELKHLQX RNHCRTFYRR RRIFLPRPER QRAARHTHHS 

2 01 RNRVLRLRSQ' VQPRRHHLSM SFGRFDRSRR KPDARTGGSR RAGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDEP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 093 shows 96.7% identity over a 276 aa overlap with a predicted ORF (ORF 093. ng) 
from N. gonorrhoeae: 

m093/g093 

10 20 30 40 50 60 

MQNFGKVAVLMGGFSSEREISLDSGTAILNALKS KGI DAY AFDPKETPLS ELKAQGFQTA 

I 1 1 1 1 1 1 1 1 1 1 M II I 1 1 1 1 1 1 1 1 1 1 1 1 I ! 1 1 1 1 1 1 1 1 ! 1 1 1 ! 1 1 1 1 1 1 I I 1 1 MUM 

MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKERGFQTA 
10 20 30 40 50 60 

70 80 90 100 110 120 

FN I LHGT YGXDGAVQGALELLG I P YTGS GVAAS AI GMDKYRCKL I WQALGL PVPEFAVLH 

MINIMI II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 = 

FNILHGTYGEDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLY 
70 80 90 100 110 120 

130 140 150 160 170 180 

DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHLQXRNHCRTFYRR 

1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 llllillllllllll IMIIIIIII 

DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKEKGRLKSVYEELKHLQGRNHCRTFYRR 
130 140 150 160 170 180 

190 200 210 220 230 240 

RRIFLPRPERQRAARHTHHSRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 

I I I ! I I I I II I II JIM I I I i | 1 I | I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I E I 

RRIFLPRPERQRAARHTHHPRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 
190 200 210 220 230 240 

250 260 270 

RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDEPX 

hi 1 1 III II ! II IMMIIIMMM I H II I hi I 

RTGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDRPX 
250 260 270 



m093 .pep 
g093 

m093 .pep 
g093 

m093 .pep 
g093 

m093 .pep 
g093 

m093 -pep 
g093 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 319>: 

a093 . seq 

1 ATGCAGAATT TTGGCAAAGT GGCCGTATTG ATGGGCGGTT TTTCCAGCGA 

51 ACGAGAAATC TCGCTGGACA GCGGCACCGC CATTTTGAAT GCTTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGATC CCAAGGAAAC CCCATTGTCT 

151 GAATTGAAGG CACAAGGTTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCATCCG CCATCGGCAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTGCCCGTTC CCGAGTTCGC 

351 CGTCCTGCAC GACGACACTG ATTTCGATGC CGTCGAAGAA AAATTGGGCC 

401 TGCCGATGTT TGTGAAACCG GCGGCCGAAG GCAGCAGCGT AGGCGTGGTA 

451 AAAGTCAAAG GAAAAGGCCG TCTGAAAAGC GTTTACGAAG AATTGAAACA 

501 CTTTCAGGG. CGAAATCATT GCCGAACGGT TTATCGGCGG CGGCGAATAT 

551 TCCTGCCCTG TGTTGAACGG CAAAGGCCTG CCCGGCATAC ACATCATCCC 

601 CGCGACCGAG TTTTATGACT ACGAAGCCAA GTACAACCGC AACGACACCA 

651 TTTATCAATG TCCTTCGGAA GATCTGACCG AAGCCGAAGA AAGCCTGATG 

701 CGCGAACTGG CGGTTCGCGG CGCGCAGGCA ATCGGTGCGG AAGGCTGCGT 

751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 

801 TCAACACCCT GCCCGGTATG ACCGGCCATA G 

This corresponds to the amino acid sequence <SEQ ID 320; ORF 093. a>: 

a093.pep 

1 MQN FGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKAQGFQTA FNILHGTYGE DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGVV 

151 KVKGKGRLKS VYEELKHFQX RNHCRTVYRR RRIFLPCVER QRPARHTHHP 

201 RDRVL*LRSQ VQPQRHHLSM SFGRSDRSRR KPDARTGGSR RAGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDRP* 



m093/a093 95.7% identity over a 276 aa overlap 



10 20 30 40 50 60 

m093 .pep MQN FGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKAQGFQTA 

I I I I I I I I I I I I I I I I I 1 1 I I I 1 1 i 1 1 1 1 I 1 I 1 1 I 1 1 I I I I I I M I I I I I I I I I I I i I I I 

a093 MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKAQGFQTA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m093.pep FNILHGTYGXDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLH 
I I I I I I I I I I I I I I I I I I I I I t I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a093 FNILHGTYGE DGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLH 

70 80 90 100 110 120 



130 140 150 160 170 180 

m093 . pep DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHLQXRNHCRTFYRR 
I I I I I I I I I ! I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I I I II I : I I I I I I I I III 
a093 DDTDFDAVEEKLGLPMFVKPAAEGSSVGVVKVKGKGRLKSVYEELKHFQXRNHCRTVYRR 

130 140 150 160 170 180 



190 200 210 220 230 240 

m093.pep RRIFLPRPERQRAARHTHHSRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 
I I I I I I MM I I II I I Mill I II M I I : I I I I II M II I M I M II II I II I 1 
a093 RRIFLPCVERQRPARHTHHPRDRVLXLRSQVQPQRHHLSMSFGRSDRSRRKPDARTGGSR 

190 200 210 220 230 240 



250 260 270 

m093 . pep RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDEPX 
I M II I II I II M I I I I M I I I I I I II M I I M M I I 
a0 93 RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDRPX 

250 260 270 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 32 1>: 

g094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTGG GGTCGAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTAccggc aatggcgatg 

151 cCGTCACGGA AGCGCATCAG CTCTGCCAGC ATCAAGGCGC GCGGAATAAC 

201 GGGAATTTGC CGCTCCAACG CAgcgacaAC TTCGGgattT TCTTTCTTGA 

251 CGGCGGTAGA GGCAACGACG ACATccgcAC CGTTAACGTG TTCTGCGGTA 

3 01 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 322; ORF 094.ng>: 

g094 .pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMAM 
51 PSRKRISSAS IKARGITGIC ' RSNAATTSGF SFLTAVEATT TSAPLTCSAV 
101 WPG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 323>: 

m094.seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTA GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTGG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AGCGCATCAA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCGGCGGTA 

301 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 324; ORF 094>: 

m094 .pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMA M 
51 PSRKRINSAN IRARGITGIC CSNAATTSGF S FLTAVEVTT TSAPLTCSAV 
101 WPG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 094 shows 95.1% identity over a 103 aa overlap with a predicted ORF (ORF 094.ng) 
from N. gonorrhoeae: 

10 '20 30 40 50 60 

MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLWWLPCVPAMAMPSRKRINST^N 

Ml Ml 1 1 II I Mill II II Mil! IMII Ml! II IN II ■ II 1 1 II 1 1 M I! I MM: 

MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLWWLPCVPAMAMPSRKRISSAS 
10 20 30 40 50 60 

70 80 90 100 

I RARGI TGI CCSNAATTSGFSFLTAVEVTTTS APLTCS AVWPGX 

hllllllll I! I M II MIMIMIMIIIM I MM Mill 

IKARGITGICRSNAATTSGFSFLTAVEATTTSAPLTCSAVWPGX 
70 80 90 100 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 325>: 

a094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTCG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AGCGCATCAA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCTGCGGTA 

301 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 326; ORF 094.a>: 



m094/g094 

m094 .pep 
g094 

m094 .pep 
g094 
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a094 .pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVVV VLPCVPAMA M 
51 PSRKRINSAN IRARGITGIC CSNAATTSGF SFLTAVEVTT TSAPLTCSAV 
101 WPG* 

m094/a094 100.0% identity over a 103 aa overlap 

10 20 30 40 50 60 

m094 .pep MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLVVVVLPCVPAMAMPSRKRINSAN 
I I I I I I I M I I I I I I I I I I I ! I I I I I I I 1 M I I I I I I I I I I i I I I I I I I I I I I 1 I I I I I I 

a094 MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLVVVVLPCVPAMAMPSRKRINSAN 

10 20 30 40 50 60 



70 80 90 100 

m094 .pep IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 

I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I- I I I I I I I I I I I I 
a0 94 IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 

70 80 90 100 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 327>: 

g095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AACACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGTTTT TCGACGCTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGGGTCA GTGTAGGAAA 

3 01 GAGGCATCGG ATCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

3 51 CGGGCGTTGG TGCCTGCGGC GTTGA 

This corresponds to the amino acid sequence <SEQ ID 328; ORF 095.ng>: 

g095 .pep 

1 MS FHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDKR L MQFFDAVPVG IHMVFV DIGN DGHNRGQCRK 

101 EASDRRLRQR CIRLCPSGRW CLRR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 329>: 

m095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AATACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGTTTT TCGACGCTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGTGTCA GTGCAGAAAG 

301 GATGCATCCG ACCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

351 CGGGCGTTAG TGCCTGCGGC GTTGA 

This corresponds to the amino acid sequence <SEQ ID 330; ORF 095>: 

m095 .pep 

1 MS FHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDKR L MQFFDAVPVG IHMVFV DIGN DGHNRCQCRK 

101 DASDRRLRQR CIRLCPSGRX CLRR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 095 shows 97.6% identity over a 124 aa overlap with a predicted ORF (ORF 095. ng) 
from N. gonorrhoeae: 

m095/g095 

10 20 30 40 50 60 



m095 . pep MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 

MM I II Ml Mill IIMMI IIIIMMIIMM llllll III II Mil I MM! II! 
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gO 95 MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 

10 20 30 40 50 60 



70 80 90 100 110 120 

m095 . pep HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRX 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

gO 95 HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRGQCRKEASDRRLRQRCIRLCPSGRW 

70 80 90 100 110 120 



m0 95 .pep CLRRX 
Mill 

g095 CLRRX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33 1>: 

a095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AATACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGCTTC TCAACACTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGTGTCA GTGCAGAAAG 

301 GATGCATCCG ACCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

351 CGGGCGTTGG TGCCTGCGGC GTTGA 



This corresponds to the amino acid sequence <SEQ ID 332; ORF 095. a>: 

a095.pep 

1 MSFHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

.51 NTQKGFAVEG HTVDEIDKRL MQLLNTVPVG IHMVFVDIGN DGHNRCQCRK 

101 DASDRRLRQR CIRLCPSGRW CLRR* 

m095/a095 96.0% identity in 124 aa overlap 



10 20 30 40 50 60 

m095.pep MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 
I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I I ! I 
a095 MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m095 . pep HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRX 

I I I I I I I I I I I I :::: I I I I I M M I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a095 HTVDEIDKRLMQLLNTVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRW 

70 80 90 100 110 120 



m095.pep CLRRX 
I I I I I 

a095 CLRRX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 

333>: 

g096 . seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGTGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAAATCT TCCGAAGGAC ATTGAtaaat ggtgTCGTCT CGGttgtaCt 
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201 tggcttcgta gTCGTAAAAC TCGGTTGCGG GGATGATGTG TATGCCGGGC 

251 AGCCCTTTGC CGTTCAGGAC GGGGCAGGAA TATTCGCCGC CGCCGATAAA 

301 AcgtTcggca atgaTTtcgc ccctgAAGGT GttTCAattc ttcgtaAACG 

351 CTTTTCAGAc ggccttTTTC TTTGA 



This corresponds to the amino acid sequence <SEQ ID 334; ORF 096.ng>: 

g096 .pep 

1 MAGHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLCAANR QFAHQAFFGF 

51 GQIFRRT LIN GVVSVVLGFV VVKL GCGDDV YAGQPFAVQD GAGIFAAADK 

101 TFGNDFAPEG VSILRKRFSD GLFL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 335>: 

m096.seq 

1 ATGGCTCGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGCGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAAATCT TCCGAAGGAC ATTGATAAAT GGTGTCGTCG CGGTTGTACT 

2 01 TGGCTTCGTA GTCGTAAAAC TCGGTTGCGG GAATGATGTG TATGCCGGGC 

251 AGCCCTTTGC CGTTCAGGAC GGGGCAGGAA TATTCGCCGC CGCCGATAAA 

301 ACGTTCGGCA ATGATTTCGC CC. TGAAGGT GTTTCAATTC TTCGTAAACG 

351 CTTTTCAGAC GGCCTTTTCC TTTGA 

This corresponds to the amino acid sequence <SEQ ID 336; ORF 096>: 

m096.pep 

1 MARHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLRAANR QFAHQAFFGF 

51 GQIFRRT LIN GWAVVLGFV VVKL GCGNDV YAGQPFAVQD GAGIFAAADK 

101 TFGNDFAXEG VSILRKRFSD GLFL* 

m096/g096 96.0% identity in 124 aa overlap 

10 20 30 40 50 60 

m096 . pep MARHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLRAANRQFAHQAFFGFGQIFRRTLIN 
I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | M 
g096 MAGHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLCAANRQFAHQAFFGFGQIFRRTLIN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m096 . pep G V VA VVLG FV VVKLGCGN DVYAGQ P FAVQDG AG I FAAADKT FGN D FAXE GVS I LRKR FS D 

I I I = I I I I II I I I I I I I : I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I 1 I M 1 
g096 GWSVVLGFVWKLGCGDDVYAGQPFAVQDGAGI FAAADKT FGNDFAPEGVS I LRKRFSD 

70 80 90 100 110 120 



m096.pep GLFLX 
MM! 

g096 GLFLX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 337>: 

a096.seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGCGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAGATCT TCCGAAGGAC ATTGATAAAT GGTGTCGTTG CGGTTGTACT 

201 TGGCTTCGTA GTCATAAAAC TCGGTCGCGG GGATGATGTG TATGCCGGGC 

251 AGGCCTTTGC CGTTCAACAC AGGGCAGGAA TATTCGCCGC CGCCGATAAA 

301 CCGTTCGGCA ATGATTTCGC CCT . GAAAGT GTTTCAATTC TTCGTAAACG 

351 CTTTTCAGAC GGCCTTTTCC TTTGA 

This corresponds to the amino acid sequence <SEQ ID 338; ORF 096.ng>: 
a096 .pep 

1 MAGHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLRAANR QFAHQAFFGF 
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51 GQIFRRT LIN GVVAVVLGFV VIKL GRGDDV YAGQAFAVQH RAG IF AAA DK 
101 PFGND FAXES VSILRKRFSD GLFL* 



m096/a096 92.7% identity in 124 aa overlap 

10 20 30 40 50 60 

m096.pep MARHTGQGVD FQQIE FAVG I FEE I DAHAAFRT DCLRAANRQFAHQAFFG FGQI FRRTLIN 

II I M I I I I I I I I I I I I I I I I I II I It I I I I I I I I I I I I II I I I I I I | | | | | I | | | M I 
a096 MAGHTGQGVD FQQIE FAVG I FEE I DAHAAFRT DCLRAANRQFAHQAFFG FGQI FRRTLIN 

10 20 30 40 50 60 



70 80 90 100 110 120 

m096.pep GVVAVVLG FVVVKLGCGNDVYAGQPFAVQDGAG I FAAADKT FGN D FAXEGVS I LRKRFS D 

I M I I I I I I I I : I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I 
a096 G WAV V LG FVVIKLGRGDDVYAGQAFAVQHRAG I FAAADK PFGND FAXES VSILRKRFSD 

70 80 90 100 110 120 



m096.pep GLFLX 
I I I I I 



a096 GLFLX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 339>: 

g097 . seq 

1 ATGGATATTT CAAAACAAAC ATTGCTGGAT AGGGTTTTTA ACCTGAAGGC 

51 AAACGGTACG ACGGTACGTA CCGAGTTGAT GGCGGGTTTG ACGACCTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAATC CCCTGATTTT GGGCGAGACC 

151 GGAATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CATCCGCCAT 

201 CGGCTGTTTT GTCATGGGTT TTATCGGCAA CTATCCGATT GCGCTTGCCC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

3 01 GGCGTGCCTT GGCAGGTGGC GTTGGGTGCG GTGTTCATTT CCGGTCTGAT 
351 TTTCATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCCG CCGGTATCGG TTTGTTTTTG 
4 51 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 
501 CTTGGTCGGC TTGGGCGATA TTCATCAGCC CAGCGCACTG TTGGCATTGT 
551 TCGGTTTTGT CATGGTGGTC GTATTGGGGT ATTTCCGCGT TCAAGGCGCA 
601 ATCATCATCA CCATTCTGAC GATTACCGTC ATTGCCAGCC TGATGGGTTT 
651 GAACGAGTTT CACGGCGTGG TCGGCGAAGT ACCGGGCATT GCGCCGACCT 
701 TTATGCAGAT GGATTTTAAA GGTCTGTTTA CCGTCAGCAT GGTCAGCGTG 
751 ATTTTCGTCT TCTTCTTGGT CGATTTGTTC GACAGTACCG GAACGCTGGT 
801 CGGCGTATCC CACCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 
851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CCATTGTGGC AGGTGCGGCT 
901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGCGCGG CGGGCGTATC 
951 GGCAGGCGGA CGGACCGGCC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TGGCGTGTCT GATGTTCTCC CCATTGGCGA AAAGTGTTCC GGTATTTGCC 

1051 ACCGCGCCCG CACTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGACATTGAT TGGGACGATA TGACTGAAGC CGCGCCCGCG TTCCTGACCA 

1151 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCCTTCGGC 

12 01 TTCATCAGCT ATGCCGTGGT CAAACTTTTG TGTCGCCGGA CTGGGGACGT 
1251 GCCGCCTATG GTATGGGTTG TTGCCGTATT GTGGGCATTG AAATTCTGGT 

13 01 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 340; ORF 097. ng>: 

g097 .pep 

1 MDISKQTLLD RVFNLKANGT TVRTE LMAGL TTFLTMCYIV IV NPLILGET 

51 GMDMG AVFVA TCIASAIGCF VM GFIGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAG IGLFL 

151 ALISLKGAGI IV ANPATLVG LGDIHQPS AL LALFGFVMW VLGYF RVQGA 

201 IIITILTITV IASLMGL NEF HGWGEVPGI APTFMQMDFK GLFTVSMVSV 

251 IFVFFLV DLF DSTGTLVGVS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

3 01 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPVFA 

351 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 
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4 01 FISYAWKLL CRRTGDVPPM VWWAVLWAL KFWYLG* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 341 >: 



m A Q *7 eon 
uiU ? 1 . oc<4 




J- 


a Tan a c a ptt 






i ni 




151 


ggcatggata 


2 01 




251 


CGGGGATGGG 


3 01 


GGCGTGCCTT 


351 


TTTTATCCTG 


401 


TGCCTATGGG 


451 


GCACTGATTT 


501 


CTTGGTCGGT 


551 


TCGGTTTTGC 


601 


ATCATCATCA 


651 


GAATGAATTT 


701 


TTATGCAGAT 


751 


ATTTTCGTCT 


801 


CGGCATATCC 


851 


TGAAACGCGC 


901 


TTGGGTACTT 


951 


GGCAGGCGGA 


1001 


TCGCCTGCCT 


1051 


ACCGCGCCCG 


1101 


GGATATTGAT 


1151 


TTGTTTTCAT 


1201 


TTCATCAGTT 


1251 


TCCGCCTATG 


1301 


ATTTGGGCTG 



CAAAACAAAC ACTGTTGGAC GGGATTTTTA AGCTGAAGGC 
ACGGTGCGTA CCGAGTTGAT GGCGGGTTTG ACAACTTTTT 



GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 
GGCAGGTTGC GTTGGGTGCG GTGTTCATCT CCGGTCTGAT 
TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 
TTTGAAAATG TCGATTGCTG CCGGTATCGG TTTGTTTTTG 
CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 
TTGGGCGATA TTCATCAGCC GTCCGCGTTG TTGGCATTGT 



CACCGTGCCG GGCTGCTGGT GGACGGTAAG CTGCCCCGCC 



CCCTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 



GTATGGATTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 
A 

This corresponds to the amino acid sequence <SEQ ID 342; ORF 097>: 

m097 .pep 

1 MDTSKQTLLD GIFKLKANGT TVRTE LMAGL TTFLTMCYIV IVN PXILGET 

51 GMDMG AVFVA TCIASAIGCF VM GFVGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAGIGLFL 

151 ALISLKGAGI IVA NPATLVG LGDIHQPS AL LALFGFAMW VLGHF RVQGA 

201 IIITILTITV IASLMGLN EF HGIIGEVPSI APTFMQMDFE GLFTVSMVSV 

251 I FVFFLV DLF DSTGTLVGIS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

3 01 LGTSSTTPYV ESAAGVSAGG R TGLTAVTVG VLMLACLMF S PLAKSVPAFA 

3 51 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 

4 01 FISYAWKLL CRRTKDVPPM VWIVAVLWAL KFWYLG * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 097 shows 96.3% identity over a 436 aa overlap with a predicted ORF (ORF 097.ng) 
from N. gonorrhoeae: 

m09l/q091 

10 20 30 40 50 60 

m0 9 7 . pep MDTSKQTLLDGI F KLKANGTTVRTE LMAGLTTFLTMC Y I V I VN PX I LGETGMDMGAVFVA 

II lllllll H :| I II 1 1 Ml! II 1 1 M I Mi 1 1 1 II 1 1 M I lllllllllllllll 

g097 MD I SKQTLLDRVFNLKANGTTVRTELMAGLTTFLTMCYIVIVNPL I LGETGMDMGAVFVA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m097 . pep TC I AS AI GCFVMGFVGNYP I ALAPGMGLNAY FTFAWKGMGVPWQVALGAVF I SGL I F I L 

M I IM 1 1 1! II I h I II II I II I II ! I h I II 1 1 1 1 1 1 II 1 1 1 1 ! II 1 1 1 1 1 1 1 1 II: I 

g097 TCI AS AIGCFVMGFIGNYPI ALAPGMGLNAY FTFAWKGMGVPWQVALGAVF I SGL I FIL 

70 80 90 100 110 120 

130 140 150 160 170 180 
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m097 pep FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGI IVANPATLVGLGDIHQPSAL 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
g097 FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGI IVANPATLVGLGDIHQPSAL • 

130 140 150 160 170 180 



190 200 210 220 230 240 

m097.pep LALFGFAMVWLGHFRVQGAIIITILTITVIASLMGLNEFHGIIGEVPSIAPTFMQMDFE 
I I I I I I : I I Ml hi I I I !l I II II M I I I! I I I I I I I I I II - I I I I : I II M II I I I : 
g097 LALFGFVMVWLGYFRVQGAIIITILTITVIASLMGLNEFHGWGEVPGIAPTFMQMDFK 

190 200 210 220 230 240 



250 260 270 280 290 300 

m097 . pep GLFTVSMVSVIFVFFLVDLFDSTGTLVGISHRAGLLVDGKLPRLKRALLADSTAIVAGAA 

! I! ! 1 1 M I M 1 , 1 1 1 1 M 1 1 ' 1 1 II I i M : 1 1 1 1 M II ! i 1 1 1 1 M I M M M I M 1 1 1 

gO 9 7 GLFTVSMVSVI FVFFLVDLFDSTGTLVGVSHRAGLLVDGKLPRLKRALLADSTAIVAGAA 

250 260 270 280 290 300 



310 320 330 340 350 360 

mO 9 7 . pep LGTSSTTP YVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 

1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h II 1 1 1 1 1 1 1 1 1 1 

g097 LGTSSTTP YVES AAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPVFATAPALLYVGT 

310 320 330 340 350 360 

370 380 390 400 410 420 

m097 . pep QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTKDVPPM 

I III I IliMIMI III III Ml III III MM II I III II I II I II Ml Mil Mill 

g 0 9 7 QMLRSARD I DWDDMTE AAP AFLT I VFMP FT YS I ADG IAFGF I S YAWKLLCRRTGDVP PM 

370 380 390 400 410 420 



m097 .pep 
g097 



430 

VW I VAVLWALKFW YLGX 

M M M M I I I M M M 

VWVVAVLWALKFWYLGX 
430 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 343> 

a097 . seq 

1 ATGGACACTT CAAAACAAAC ACTGTTGGAC GGGATTTTTA AGCTGAAGGC 

51 AAACGGTACG ACGGTGCGTA CCGAGTTGAT GGCGGGTTTG ACAACTTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAACC CTCTGATTTT GGGCGAGACC 

151 GGCATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CGTCTGCCAT 

201 CGGCTGTTTT GTTATGGGTT TTGTCGGCAA CTATCCGATT GCACTCGCAC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

301 GGCGTGCCTT GGCAGGTTGC GTTGGGTGCG GTGTTCATCT CCGGTCTGAT 

351 TTTCATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

401 TGCCTATGGG TTTGAAAATG TCGATTGCTG CCGGTATCGG TTTGTTTTTG 

4 51 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 

501 CTTGGTCGGC TTGGGCGATA TTCATCAGCC GTCCGCGTTG TTGGCACTGT 

551 TCGGTTTTGC CATGGTGGTC GTATTGGGAC ATTTCCGCGT TCAAGGCGCA 

601 AT CAT CATC A CCATTTTGAC GATTACCGTC ATTGCCAGCC TGATGGGTTT 

651 GAACGAATTT CACGGCATCA TCGGCGAAGT GCCGAGCATT GCGCCGACTT 

701 TTATGCAGAT GGATTTTAAA GGGTTGTTTA CCGTCAGCAT GGTCAGCGTG 

7 51 ATTTTCGTCT TTTTCCTAGT CGATCTGTTC GACAGTACCG GAACACTGGT 

801 CGGTGTATCG CATCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 

851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CTATTGTGGC AGGTGCGGCT 

901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGTGCGG CGGGCGTATC 

951 GGCAGGCGGG CGGACAGGTC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA AAAGTGTTCC CGCTTTTGCC 

1051 ACCGCGCCCG CCCTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGACATCGAT TGGGACGATA TGACGGAAGC CGCACCCGCA TTCCTGACCA 

1151 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCTTTCGGC 

1201 TTCATCAGTT ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT 

1251 TCCGCCTATG GTATGGATTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 
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1301 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 344; ORF 097.a>: 

a097 .pep 

1 MDTSKQTLLD GIFKLKANGT TVRTE LMAGL TTFLTMCYIV IV NPLILGET 

51 GMDMG AVFVA TCIASAIGCF VM GFVGNYPI ALAPGMGLNA YFTFAVVKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAG IGLFL 

151 ALISLKGAGI IV ANPATLVG LGDIHQPS AL LALFGFAMVV VLGHF RVQGA 

201 IIITILTITV IASLMGL NEF HGIIGEVPSI APTFMQMDFK GLFTVSMVSV 

251 IFVFFLV DLF DSTGTLVGVS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

301 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPAFA 

351 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 

4 01 FISYAVVKLL CRRTKDVPP M VWIVAVLWAL KFWYLG * 

m097/a097 99.3% identity in 436 aa overlap 

10 20 30 40 50 60 

m097 .pep MDTSKQTLLDGIFKLKANGTTVRTELMAGLTTFLTMCYIVIVNPXILGETGMDMGAVFVA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I i I I I 
a097 MDTSKQTLLDGIFKLKANGTTVRTELMAGLTTFLTMCYIVIVNPLILGETGMDMGAVFVA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m097 . pep TCIASAIGCFVMGFVGNYPIALAPGMGLNAYFTFAVVKGMGVPWQVALGAVFISGLIFIL 
I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I t 1 1 I I I 
a097 TCIASAIGCFVMGFVGNYPIALAPGMGLNAYFTFAVVKGMGVPWQVALGAVFISGLIFIL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m0 97.pep FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 
I | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a097 FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m097 . pep LALFGFAMVVVLGHFRVQGAIIITILTITVIASLMGLNEFHGIIGEVPSIAPT FMQMDFE 

I | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : 
a 097 LALFGFAMVWLGHFRVQGAIIITILTITVIASLMGLNEFHGIIGEVPSIAPTFMQMDFK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m097 . pep GLFTVSMVSVIFVFFLVDLFDSTGTLVGISHRAGLLVDGKLPRLKRALLADSTAIVAGAA 
M I I I I I I I M I I M M 1 E I I I I I I I I I : I I I I t I I I I I I I i I I I I 1 I I I I I I I I I I I I I 
a097 GLFTVSMVSVIFVFFLVDLFDSTGTLVGVSHRAGLLVDGKLPRLKRALLADSTAIVAGAA 

250 260 270 280 290 300 

310 320 330 340 350 360 

m097 . pep LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 
I | I | | I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I II I I I 
a097 • LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 

310 320 330 340 350 360 

370 380 390 400 410 420 

m097.pep QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTKDVPPM 
| | I | I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a097 QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAVVKLLCRRTKDVPPM 

370 380 390 400 410 420 

430 

m097 . pep VWIVAVLWALKFWYLGX 

I I I I I I i I I I I I I I I I I 
a097 VWIVAVLWALKFWYLGX 

430 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 345>: 

g098 . seq 

1 ATGACCGCCG ACGGTCTCTT CGTCGCTTTC AACTTCAATA CGTTTGCCGT 

51 TGTGCGAATA TTGATACCAG TACAGCAGGA TGCTGCCCAG GCTGGCGATC 

101 AGTTTGTCGG CGATGTCGCG CGCTTCGCTG TCGGGATGGC TTTCGCGTTC 

151 GGGATGAACG CAGCCGAGCA TGGACACGCC GGTACGCATC ACGTCCATCG 

2 01 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

2 51 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

3 01 GTTGGGCAGA TGGCCGTGAA TCAGCAAGTG TGCGACTTCT TCAAACTCGC 
3 51 ATTTTTGTGC CAAATTAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 346; ORF 098.ng>: 

g098.pep 

1 MTADGLFVAF NFNTFAWRI LIPVQQDAAQ AGDQFVGDVA RFAVGMAFAF 

51 GMNAAEHGHA GTHHVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV CDFFKLAFLC QIRMS* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 347>: 

m098 . seq 

1 ATGACCGCCG ATGGTCTCTT CGTCGCTTTC AACCTCAATG CGTTTGCCGT 

51 TGTGCGAATA TTGATACCAG TACAAGAGGA TGCTGCCGAG GCTGGCGATC 

101 AGTTTGTCGG CGATGTCGCG CGCTTCACTT TCCGGATGGC TTTCACGTTC 

151 AGGATGAACG CAGCCCAGCA TGGATACGCC GGTACGCATT ACGTCCATCG 

2 01 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

2 51 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 
301 GTTGGGCAGA TGGCCGTGAA TCAGCAGGTG GGCGACTTCT TCAAACTCGC 

3 51 ATTTTTGTGC CAAATCAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 348; ORF 098>: 

m098 .pep 

1 MTADGLFVAF NLNAFAWRI LIPVQEDAAE AGDQFVGDVA RFTFRMAFTF 

51 RMNAAQHGYA GTHYVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV GDFFKLAFLC QIRMS* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 098 shows 89.6% identity over a 125 aa overlap with a predicted ORF (ORF 098.ng) 
from N. gonorrhoeae: 

m098/g098 

10 20 30 40 50 60 

m098 . pep MTADGLFVAFNLNAFAWRI LI PVQEDAAEAGDQFVGDVARFTFRMAFTFRMNAAQHGY^ 

1 1 1 1 1 1 1 1 II h 1 : 1 1 1 1 1! 1 1 1 1 h 1 1 h i I M 1 1 1 1 1 1 1 h 1 1 h I 1 1 1 h 1 1 : 1 

g098 MTADGLFVAFNFNTFAWRI LI PVQQDAAQAGDQFVGDVARFAVGMAFAFGMNAAEHGHA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m098 . pep GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 

I I I : I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
g0 9 8 GTHHVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVCDFFKLAFLC 

70 80 90 100 110 120 



m098.pep QIRMSX 
I I I I I I 

g098 QIRMSX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 349>: 

a098 . seq 

1 ATGACCGCCG ATGGTCTCTT CGTCGCTTTC AACCTCAATG CGTTTGCCGT 
51 TGTGCGAATA TTGATACCAG TACAAGAGGA TGCTGCCGAG GCTGGCGATC 
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101 AGTTTGTCGG CGATGTCGCG CGCTTCACTT TCCGGATGGC TTTCACGTTC 

151 AGGATGAACG CAGCCCAGCA TGGATACGCC GGTACGCATT ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

251 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

301 GTTGGGCAGA TGGCCGTGAA TCAGCAGGTG GGCGACTTCT TCAAACTCGC 

351 ATTTTTGTGC CAAATCAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 350; ORF 098.a>: 

a098.pep 

1 MTADGLFVAF NLNAFA WRI LIPVQEDAAE AGDQFVGDVA RFTFRMAFTF 

51 RMNAAQHGYA GTHYVHRMGM CRQAFQNFNH TDRQAAHGFE ' LGFISGQLEF 

101 VGQMAVNQQV GDFFKLAFLC QIRMS* 

m098/a098 100.0% identity in 125 aa overlap 

10 20 30 40 50 60 

m098 pep MTADGLFVAFNLNAFAVVRILIPVQEDAAEAGDQFVGDVARFTFRMAFTFRMNAAQHGYA 

I | | | | I I I I I I I I I I I I ! I 1 1 1 1 I I 1 1 I I I I t I I I i 1 1 I I I I I 1 1 I I I I I I I I I I I I 1 1 I 

a 098 MTADGLFVAFNLNAFAWRILIPVQEDAAEAGDQFVGDVARFTFRMAFTFRMNAAQHGYA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m098 . pep GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 
I | | | | | | | I | | I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
a 098 GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 

70 80 90 100 110 120 



m098.pep QIRMSX 
I I I I I I 

a098 QIRMSX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 35 1>: 

g099.seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTGGA 

51 GCTGACGGGC AAACGGCAGG CGGGCATTAC TGCCACAGAC ATCGTGTTGG 

101 CACTGACCGA ATTCTTGCGT AAAGAGCGCG TGGTCGGGGC GTTTGTCGAA 

151 TTTTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCCATG TTCGCCATCG 

2 51 ACGCGCAAAC TATTGATTAT TTGAAACTGA CCGGACGTGA CGACGCGCAG 

3 01 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTAT GGGCAGGTGG 
3 51 CTTGAAAACC GCCGTTTATC CGCGCGTTTT GAAATTTGAT TTGAGCAGCG 
401 TAACGCGCAA TATGGCAGGC CCGAGCAACC CGCACGCGCG TTTTGCCACC 
451 GCCGATTTGG CGGCGAAAGG GCTGGCGAAG CCTTACGAAG AGCCTTCAGA 
501 CGGCCAAATG CCTGACGGTG CAGTGATTAT TGCCGCGATT ACTTCGTGTA 
551 CCAATACTTC CAACCCGCGC AACGTTGTCG CCGCCGCACT GTTGGCACGC 
601 AATGCCAACC GCCTCGGCTT GAAACGCAAA CCTTGGGTGA AATCTTCGTT 
651 TGCCCCGGGT TCAAAAGTAG CCGGAATCTA TTTGAAAGAA GCAGGCTTGT 
701 TGCCCGAAAT GGAAAAACTC GGCTTCGGTA TCGTCGCCTT CGCATGTACC 
751 ACCTGTAACG GCATGAgcgG CGCGCTcgaC CCGAAAATCC AACAAGAAAT 
801 CATCGACCGC GAtttgtacg cCACCGCCGT ATTGTCAGGC AACCGCAACT 
851 TCGACGGCCG TATCCATCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 
9 01 CCTTTGGTCG TTGCCTACGC ATTGGCAGGT AGCATCCGTT TCGATATTGA 
951 AAACGACGTA CTCGGCGTTG CAGACGGCCG CGAAATCCGC CTGAAAGATA 

1001 TCTGGCCGAC AGACGAAGAA ATCGATGCCA TCGTTGCCGA ATATGTGAAA 

1051 CCGCAACAAT TCCGCGACAT TTATATCCCG ATGTCCGACA CCGGCACAGC 

1101 GCAAAAAGCA CCAAGCCCGC TGTACGACTG GCG AC CGATG TCCACCTACA 

1151 TCCGCCGTCC GCCCTATTGG GAAGGCGCAC TGGCAGGGGA ACGTACATTA 

1201 AGAGGTATGC GTCCGCCGGC GATTTTGCCC GACAACATCA CCACCGACCA 

1251 CATCTCgcca tCCAATGCGA TTTTGGCCGG cagTGCcgca ggtgaATATT 
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13 01 TGGCGAAAAT GGGTTTGCCT GAAGAagaCT TCAACTCTTA CGCAACCCAC 

13 51 CGCGGCGACC ACTTGACCGC CCAACGCGCA ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAGAA ACGAAGACGG CAGCGTACGC CAAGGTtcgt 
1451 tggcacgcgT tgaacCAGAA GGCCAAACCA TGCGCATGTG GGAAGCCATC 
1501 GAAACCTATA TGAACCGCAA ACAGCCGCTT ATCATCATTG CCGGTGCGGA 
1551 CTATGGTCAA GGCTCAAGCC GCGACTGGGC GGCGAAGGGC GTGCGGCTGG 
1601 CGGGTGTGGA AGCCATCGCC GCCGAAGGTT TCGAGCGCAT CCACCGCACC 
16 51 AACCTCATCG GCATGGGCGT CTTGCCGCTG CAATTCAAAC CCGGCACCAA 
1701 CCGCCATACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTTGTCGGCG 
1751 AACGCACACC GCGCTGCGGC CTGACCCTCG TGATTCACCG TAAAAACGGA 
18 01 GAAACCGTCG AAGTTCCGGT TACCTGCCGC CCCGATACCG CAGAAGAAGC 
1851 ATTGGTATAT GAAGCCGGCG GCGTATTGCA ACGGTTTGCA CAGGACTTTT 
1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 352; ORF 099.ng>: 

g099 .pep 

1 MLGRASMMRL PDIVGVELTG KRQAGITATD IVLALTEFLR KERWGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDAQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWAGGLKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAAKGLAK PYEEPSDGQM PDGAVIIAAI TSCTNTSNPR NWAAALLAR 

201 NANRLGLKRK PWVKSSFAPG SKVAGIYLKE AGLLPEMEKL GFGIVAFACT 

2 51 TCNGMSGALD PKIQQEIIDR DLYATAVLSG NRNFDGRIHP YAKQ AFLASP 

3 01 PLWAYALAG SI RFDIENDV LGVADGREIR LKDIWPTDEE IDAIVAEYVK 
3 51 PQQFRDIYIP MSDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 
401 RGMRPPAILP DNITTDHISP SNAILAGSAA GEYLAKMGLP EEDFNSYATH 
451 RGDHLTAQRA TFANPKLFNE MVRNEDGSVR QGSLARVEPE GQTMRMWEAI 

5 01 ETYMNRKQPL IIIAGADYGQ GSSRDWAAKG VRLAGVEAIA AEGFERIHRT 
551 NLIGMGVLPL QFKPGTNRHT LQLDGTETYD WGERTPRCG LTLVIHRKNG 

6 01 ETVEVPVTCR PDTAEEALVY EAGGVLQRFA QDFLEGNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 353>: 

m099 . seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTTGA 

51 GCTGAACGGC AAACGGCAGG CGGGCATTAC GGCGACGGAT ATTGTGTTGG 

101 CACTGACCGA GTTTCTGCGC AAAGAACGCG TGGTCGGGGC GTTTGTCGAA 

151 TTCTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCGATG TTCGCTATTG 

251 ATGAGCAAAC CATTGATTAT TTGAAACTGA CCGGACGCGA CGACGCGCAG 

3 01 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTGT GGGCAGATGC 

3 51 CTTGAAAACC GCCGTTTATC CTCGCGTTTT GAAATTTGAT TTGAGCAGCG 

4 01 TAACGCGCAA TATGGCAGGC CCAAGTAACC CGCATGCCCG TTTTGCGACC 
4 51 GCCGATTTGG CGGCGAAAGG GCTGGCGAAG CCTTACGAAG AGCCTTCGGA 
501 CGGCCAAATG CCCGACGGCT CGGTCATCAT CGCCGCGATT ACCAGTTGCA 
551 CCAACACTTC CAACCCGCGC AACGTTGTTG CCGCCGCGCT CTTGGCACGC 
601 AATGCCAACC GTCTCGGCTT GAAACGCAAA CCTTGGGTGA AATCTTCGTT 
651 TGCCCCGGGT TCAAAAGTAG CCGAAATCTA TTTGAAAGAA GCGGGCCTGT 
701 TGCCCGAAAT GGAAAAACTC GGCTTCGGTA TCGTCGCCTT CGCCTGCACC 
751 ACCTGCAACG GCATGAGTGG CGCGCTGGAT CCGAAAATCC AGAAAGAAAT 
8 01 CATCGACCGC GATTTGTACG CCACCGCCGT ATTATCAGGC AACCGCAACT 
851 TCGACGGCCG TATCCACCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 
901 CCGTTGGTCG TTGCCTACGC GCTGGCAGGC AGTATCCGTT TCGATATTGA 
951 AAACGACGTA CTCGGCGTTG CAGACGGCAA GGAAATCCGC CTGAAAGACA 

1001 TTTGGCCTGC CGATGAAGAA ATCGATGCCG TCGTTGCCGA ATATGTGAAA 

1051 CCGCAGCAGT TCCGCGATGT GTATGTACCG ATGTTCGACA CCGGCACAGC 

1101 GCAAAAAGCA CCCAGTCCGC TGTACGATTG GCGTCCGATG TCCACCTACA 

1151 TCCGCCGTCC GCCTTACTGG GAAGGCGCGC TGGCAGGGGA ACGCACATTA 

1201 AGAGGTATGC GTCCGCTGGC GATTTTGCCC GACAACATCA CCACCGACCA 

1251 CCTCTCGCCG TCCAATGCGA TTTTGGCCGT CAGTGCCGCA GGCGAGTATT 

1301 TGGCGAAAAT GGGTTTGCCT GAAGAAGACT TCAACTCTTA CGCAACCCAC 

1351 CGCGGCGACC ACTTGACCGC CCAACGCGCT ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAAAA ACGAAGACGG CAGCGTGCGC CAAGGCTCGT 

14 51 TCGCCCGCGT CGAACCCGAA GGCGAAACCA TGCGCATGTG GGAAGCCATC 

1501 GAAACCTATA TGAACCGCAA ACAGCCGCTC ATCATCATTG CCGGTGCGGA 
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15 51 CTATGGTCAA GGCTCAAGCC GCGACTGGGC TGCAAAAGGC GTACGCCTCG 
1601 CCGGCGTAGA AGCGATTGTT GCCGAAGGCT TCGAGCGTAT CCACCGCACC 

16 51 AACCTTATCG GCATGGGCGT GTTGCCGCTG CAGTTCAAAC CCGACACCAA 
1701 CCGCCATACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTGGTCGGCG 
1751 AACGCACACC GCGCTGCGAC CTGACCCTCG TGATTCACCG TAAAAACGGC 
1801 GAAACCGTTG AAGTTCCCGT TACCTGCTGC CTCGATACTG CAGAAGAAGT 
18 51 ATTGGTATAT GAAGCCGGCG GCGTGTTGCA ACGGTTTGCA CAGGATTTTT 
1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 354; ORF 099>: 

m099.pep 

1 MLGRASMMRL PDIVGVELNG KRQAGITATD IVLALTEFLR KERWGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDEQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWADALKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAAKGLAK PYEEPSDGQM .PDGSVIIAAI TSCTNTSNPR NWAAALLAR 

201 NANRLGLKRK PWVKSSFAPG SKVAEIYLKE AGLLPEMEKL GFGIVAFACT 

251 TCNGMSGALD PKIQKEIIDR DLYATAVLSG NRNFDGRIHP YAKQAFLASP 

301 PLWAYALAG 5I RFDIENDV LGVADGKEIR LKDIWPADEE IDAWAEYVK 

351 PQQFRDVYVP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 

4 01 RGMRPLAILP DNITTDHLSP SNAILAVSAA GEYLAKMGLP EEDFNSYATH 

4 51 RGDHLTAQRA TFANPKLFNE MVKNEDGSVR QGSFARVEPE GETMRMWEAI 

501 ETYMNRKQPL IIIAGADYGQ GSSRDWAAKG VRLAGVEAIV AEGFERIHRT 

551 NLIGMGVLPL QFKPDTNRHT LQLDGTETYD WGERTPRCD LTLVIHRKNG 

601 ETVEVPVTCC LDTAEEVLVY EAGGVLQRFA QDFLEGNAA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 099 shows 96.2% identity over a 639 aa overlap with a predicted ORF (ORF 099.ng) 
from N. gonorrhoeae: 

m099/g099 

10 20 30 40 50 60 

MLGRASMMRLPDIVGVELNGKRQAGITATDIVLALTEFLRKERWGAFVEFFGEGARSLS 

1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 h! 1 1 1 1 1 1 1 III IN III 1 1 M 1 1 1 1 1 1 II i 1 1 1 1 1 1 i M II 

MLGRASMMRLPDIVGVELTGKRQAGITATDIVLALTEFLRKERWGAFVEFFGEGARSLS 
10 20 30 40 50 60 

70 80 90 100 110 120 

I GDRAT I SNMTPEFGATAAMFAIDEQT I DYLKLTGRDDAQVKLVETYAKT AGLWADALKT 

1 1: 1 1 1 ! Ml Ml ill I M I ill I 1 1 MMM II 1 1 M M 1 1 M M II M 1 1 M MM 

IGDRATISNMTPEFGATAAMFAIDAQTIDYLKLTGRDDAQVKLVETYAKTAGLWAGGLKT 
70 80 90 100 110 120 

130 140 150 160 170 180 

AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGSVIIAAI 

MMMMIMMMMMMMMMMMMMMMMMMMMMMMMMI 

AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGAVIIAAI 
130 140 150 160 170 180 

190 200 210 220 230 240 

TSCTNTSNPRNWAAALLARNANRLGLKRKPWKSSFAPGSKVAE I YLKE AGLLPEMEKL 

MMMMMMMMMMMMMMMMMMMMMM MMMMMMMI 

TSCTNTSNPRNWAAALLARNANRLGLKRKPWVKSSFAPGSKVAGIYLKEAGLLPEMEKL 
190 200 210 220 230 240 

250 260 270 280 290 300 

GFGIVAFACTTCNGMSGALDPKIQKEIIDRDLYATAVLSGNRNFDGRIHPYAKQAFLASP 

II i 1 1 1 1 1 M I II 1 1 1 1 1 1) 1 1 1 1 = 1 1 ! 1 1! I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 ! 1 1 

GFGIVAFACTTCNGMSGALDPKIQQEIIDRDLYATAVLSGNRNFDGRIHPYAKQAFIjASP 
250 260 270 280 290 300 

310 320 330 340 350 360 



m099 .pep 
g099 

m099 . pep 
g099 

m099 .pep 
g099 

m099.pep 
g099 

m099 .pep 
g099 
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m099 pep PLWAYALAGSIRFDIENDVLGVADGKEIRLKDIWPADEEIDAVVAEYVKPQQFRDVYVP 

I I I I I I I I I I I I I I I I I I I I t I I I I I : I I I I I I I I h I II I I I : I I I I I I I I I I I h h I 
g 099 PLVVAYALAGSIRFDIENDVLGVADGREIRLKDIWPTDEEIDAIVAEYVKPQQFRDIYIP 

310 320 330 340 350 360 

370 380 390 400 410 420 

m099 pep MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRGMRPLAILPDNITTDHLSP 

I M h 1 1 II M III I Ml I III 1 1 1 M 1 1 M I M M 1 1 1 1 1 1 II IIIMIIMIhll 

gO 99 MSDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRGMRPPAILPDNITTDHISP 

370 380 390 400 410 420 



m099 .pep 
g099 



430 440 450 460 470 480 

SNAII^VSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVKNEDGSVR 

IMIli I MM I MMMIIMI I IMM I II II I II 1 1 1 II II I M II MM M II II 

SNAILAGSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVRNEDGSVR 
430 440 450 460 470 480 



490 500 510 520 530 540 

m099 pep QGSFARVEPEGETMRMWEAI ETYMNRKQPLI 1 1 AGADYGQGSSRDWAAKGVRLAGVEAI V 

I | I : I I || I I h I I II I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I : 
g 0 99 QGSLARVEPEGQTMRMWEAI ETYMNRKQPLI 1 1 AGADYGQGSSRDWAAKGVRLAGVEAI A 

490 500 510 520 530 540 

550 560 570 580 590 600 

m099 . pep AEGFERIHRTNLIGMGVLPLQFKPDTNRHTLQLDGTETYDWGERTPRCDLTLVIHRKNG 

Mill II I MM III Ml III III I MM 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 Ml MM III 

g099 AEGFERIHRTNLIGMGVLPLQFKPGTNRHTLQLDGTETYDWGERTPRCGLTLVIHRKNG 

550 560 570 580 590 , 600 

610 620 630 640 

mO 9 9 . pep ETVEVPVTCCLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 

I II II II II II II h I I II I II II II II I M II II I II 
g 0 9 9 ETVEVPVTCRPDTAEEALVYEAGGVLQRFAQD FLEGNAAX 

610 620 630 640 



The following 

a099.seq 

1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 



partial DNA sequence was identified in N. meningitidis <SEQ ID 355>: 



ATGCTGGGAC 
GCTGAACGGC 
CACTGACCGA 
TTCTTCGGCG 
TTCCAACATG 
ATGAGCAAAC 
GTGAAATTGG 
CTTGAAAACC 
TAACGCGCAA 
GCCGATTTGG 
CGGCCAAATG 
CCAATACTTC 
AATGCCAACC 
TGCCCCGGGT 
TGCCCGAAAT 
ACCTGTAACG 
CATCGACCGC 
TTGACGGCCG 
CCGTTGGTCG 
AAACGACGTA 
TTTGGCCTAC 
CCGCAGCAAT 
GCAAAAAGCA 
TCCGCCGCCC 
AGCGGTATGC 



GCGCGTCCAT 
AAACGGAAGG 
GTTTCTGCGC 
AGGGCGCGAG 
ACGCCGGAGT 
CATTGATTAT 
TGGAAACCTA 
GCCGTTTATC 
TATGGCAGGC 
CCGGCAAAGG 
CCTGACGGTG 
CAATCCGCGC 
GCCTCGGCTT 
TCAAAAGTAG 
GGAAAAACTC 
GCATGAGCGG 
GATTTGTACG 
TATCCATCCG 
TTGCCTACGC 
CTCGGCGTTG 
CGATGAAGAA 
TTCGCGACGT 
CCAAGCCCGC 
ACCTTACTGG 
GTCCGCTGGC 



GATGCGCCTG 
CGGGCATTAC 
AAAGAACGCG 
AAGCCTGTCT 
TCGGCGCGAC 
TTGAAACTGA 
CGCCAAAACC 
CGCGCGTTTT 
CCGAGCAACC 
CTTGGCTAAA 
CAGTGATTAT 
AACGTTGTCG 
GCAACGCAAA 
CCGAAATCTA 
GGCTTCGGTA 
CGCGCTGGAT 
CCACCGCCGT 
TATGCGAAAC 
GCTGGCAGGC 
CAGACGGCAA 
ATCGATGCCA 
TTATATCCCG 
TGTACGACTG 
GAAGGCGCAC 
GATTTTGCCC 



CCCGATATTG 
GGCGACGGAT 
TGGTCGGGGC 
ATCGGCGACC 
TGCCGCGATG 
CCGGACGCGA 
GCAGGCTTGT 
GAAATTTGAT 
CGCACGCGCG 
CCTTACGAAG 
TGCCGCGATT 
CCGCCGCGCT 
CCTTGGGTGA 
TTTGAAAGAA 
TCGTTGCCTT 
CCGAAAATCC 
ATTGTCAGGC 
AGGCTTTCCT 
AGCATCCGTT 
AGAAATCCGC 
TCGTTGCCGA 
ATGTTCGACA 
GCGTCCAATG 
TGGCAGGGGA 
GACAACATCA 



TCGGCGTTGA 
ATTGTGTTGG 
GTTTGTCGAA 
GCGCGACCAT 
TTCGCTATTG 
CGACGCGCAG 
GGGCAGATGC 
TTGAGCAGCG 
TTTTGCGACC 
AGCCTTCAGA 
ACTTCCTGTA 
GTTGGCACGC 
AATCTTCGTT 
GCAGATCTGC 
CGCATGTACC 
AGAAAGAAAT 
AACCGCAACT 
CGCTTCGCCT 
TCGATATTGA 
CTGAAAGACA 
ATATGTGAAA 
CCGGCACAGC 
TCTACCTATA 
ACGCACATTA 
CCACCGACCA 



WO 99/57280 



PCT/US99/09346 



308 



1251 TCTCTCGCCA TCCAATGCGA TTTTGGCAAG CAGTGCCGCA GGCGAATATT 

1301 TGGCAAAAAT GGGTTTGCCT GAAGAAGACT TCAACTCTTA CGCAACCCAC 

1351 CGTGGCGACC ACTTGACCGC CCAACGCGCA ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAGAA AC GAAG AC GG CAGCGTACGC CAAGGTTCGC 

14 51 TGGCACGCGT TGAACCCGAA GGCCAAACCA TGCGCATGTG GGAAGCCATC 

1501 GAAACCTATA TGAACCGCAA ACAGCCGCTC AT CAT CAT TG CCGGCGCGGA 

1551 CTACGGTCAA GGCTCAAGCC GCGACTGGGC TGCAAAAGGC GTACGCCTCG 

1601 CCGGCGTGGA AGCGATTGTT GCCGAAGGCT TCGAGCGTAT CCACCGCACC. 

1651 AACTTGATCG GTATGGGCGT GTTGCCGCTG CAGTTCAAAC CGGGTACCAA 

17 01 CCGCCACACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTTGTCGGCG 

17 51 AACGCACACC GCGCTGCGAC CTGACCCTTG TGATTCACCG TAAAAACGGC 

1801 GAGACCGTCG AAGTCCCCAT TACCTGCCGC CTCGATACCG CAGAAGAAGT 

1851 GTTGGTATAT GAAGCCGGTG GCGTATTGCA ACGGTTTGCA CAGGATTTTT 

1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 356; ORF 099.a>: 

a099.pep 

1 MLGRASMMRL PDIVGVELNG KRKAGITATD IVLALTEFLR KERVVGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDEQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWADALKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAGKGLAK PYEEPSDGQM PDGAVIIAAI TSCTNTSNPR NVVAAALLAR 

201 NANRLGLQRK PWVKSSFAPG SKVAEIYLKE ADLLPEMEKL GFGIVAFACT 

251 TCNGMSGALD PKIQKEIIDR DLYATAVLSG NRNFDGRIHP YAKQA FLASP 

301 PLVVAYALAG SI RFDIENDV LGVADGKEIR LKDIWPTDEE IDAIVAEYVK 

351 PQQFRDVYIP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 

4 01 SGMRPLAILP DNITTDHLSP SNAILASSAA GEYLAKMGLP EEDFNSYATH 

4 51 RGDHLTAQRA TFANPKLFNE MVRNEDGSVR QGSLARVEPE GQTMRMWEAI 

501 ETYMNRKQPL I I IAGADYGQ GSSRDWAAKG VRLAGVEAIV AEGFERIHRT 

551 NLIGMGVLPL QFKPGTNRHT LQLDGTETYD VVGERTPRCD LTLVIHRKNG 

601 ETVEVPITCR LDTAEEVLVY EAGGVLQRFA QDFLEGNAA* 



m099/a099 97.5% identity in 639 aa overlap 

10 20 30 40 50 60 

m099.pep MLGRASMMRLPDIVGVELNGKRQAGITATDIVLALTEFLRKERVVGAFVEFFGEGARSLS 
I I I I I j I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a099 MLGRASMMRLPDIVGVELNGKRKAGITATDIVLALTEFLRKERVVGAFVE FFGEGARSLS 

10 20 30 40 50 60 



70 80 90 100 110 120 

m09 9.pep IGDRAT I SNMT PE FGATAAMFAI DEQT I DYLKLTGRDDAQVKLVETYAKT AGLWADALKT 

I I I I I I I I I I I I I i II I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I II I I I I I I I I I I I 
a 0 9 9 I G DRAT I SNMT PE FGATAAMFAI DEQT I D YLKLTGRDDAQVKL VET YAKT AGLWADALKT 

70 80 90 100 110 120 



130 140 150 160 170 180 

m099.pep AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGSVIIAAI 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I : I I I I I I I I I I I I I I I I I I : i I I I II 
a099 AVYPRVLKFDLSSVTRNMAGPSNPHARFAT ADLAGKGLAK PYEEPSDGQM PDGAVIIAAI 

130 140 150 160 170 180 

190 200 210 220 230 240 

m099.pep TSCTNTSNPRNWAAALLARNANRLGLKRKPWVKSSFAPGSKVAEIYLKEAGLLPEMEKL 
I I I I II I I I I I I I I I I I I I I I I ! I I I I : I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
a099 TSCTNTSNPRNVVAAALLARNANRLGLQRKPWVKSSFAPGSKVAEIYLKEADLLPEMEKL 

190 200 210 220 230 240 



250 260 270 280 290 300 

m099.pep GFGIVAFACTTCNGMSGALDPKIQKEIIDRDLYATAVLSGNRNFDGRIHPYAKQAFLASP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I M I I 
a099 GFGI VAFACTTCNGMSGALDPKIQKE 1 1 DRDL YATAVLSGNRN FDGRI H P YAKQAFLAS P 

250 260 270 280 290 300 



m099.pep 



310 320 330 340 350 360 

PLVVAYALAGSIRFDIENDVLGVADGKEIRLKDIWPADEEIDAVVAEYVKPQQFRDVYVP 
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I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I : I I I I I I : I | I I | | | | | | | | | | : | 

a099 PLVVAYALAGSIRFDIENDVLGVADGKEIRLKDIWPTDEEIDAIVAEYVKPQQFRDVYIP 

310 320 330 340 350 360 

370 380 390 400 410 420 

m0 9 9.pep MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRGMRPLAILPDNITTDHLSP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I i 
a099 MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLSGMRPLAILPDNITTDHLSP 

370 380 390 400 410 420 

430 440 450 460 470 480 

m099.pep SNAILAVSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVKNEDGSVR 
I I I I I I I I ( I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I : | | | | | | I 
a099 SNAILASSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRAT FANPKLFNEMVRNEDGSVR 

430 440 450 460 470 480 

490 500 510 520 530 540 

mO 9 9 . pep QGS FARVEPEGETMRMWEAIETYMNRKQPLI I IAGADYGQGSSRDWAAKGVRLAGVEAI V 

I II : I I I I I I I : M I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I | I | | | | | | M I 
a099 QGSLARVEPEGQTMRMWEAIETYMNRKQPLI I IAGADYGQGSSRDWAAKGVRLAGVEAI V 

490 500 510 520 530 540 

550 560 570 580 590 600 

m099 . pep AEGFERIHRTNLIGMGVLPLQFKPDTNRHTLQLDGTETYDVVGERTPRCDLTLVIHRKNG 
I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I I I I I | | | | 

a099 AEGFERIHRTNLIGMGVLPLQFKPGTNRHTLQLDGTETYDVVGERTPRCDLTLVIHRKNG 

550 560 570 580 590 600 

610 620 630 640 

mO 9 9 . pep ETVEVPVTCCLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 
I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a 0 9 9 ETVEVPITCRLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 

610 620 630 640 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 357>: 

gl02.seq 

1 AtgtCCGCCA AAactccgtc gctcttcggc ggcgcgatga Ttatcgccgg 

51 gaaggttatc ggcgcAGgta tgttccccaa ccccaccgcc aacttggggg 

101 acgggttaat aggctcgctg attgtgctgc tgtacacctg gtttccattc 

151 tcctccggcg ccctcatgat tttggaagtc aacacccata acCCccgagg 

201 ggcaAGtttt gacaccATGg tcAAagacct gctcgGaCGc ggctggaaca 

251 tcatcaacgg catcgccgtc gctttggTCc tatacggctc gacctacgcg 

301 tacattttag tcggcggtga cctGACCGCC AAAGGCAtcg GCAgCGCAGT 

351 AGGCGGCAAA ATTTCgctca CCGTCGGACA actcgtcttc tTCGGCATCC 

401 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG CTTTACCGGC 

4 51 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 

501 GGTTGCCGAT GCCAAACCGT CCGTCCTCTT CGACACCCAA GCCCCCGTCG 

551 GCACCGGCTA CTGGATTTAC GCCGCCACCG CCCTGCCCGT CTGCCTCGCT 

601 TCCTTCGGCT TCCACGGCAA CGTTTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGcgcCc aaagtGgCGA aATCcatctg gGcaggtaca ttggTTGCCt 

701 tggtaattta cgtccTCTgg caaaccgcca tCcaaagcaa ccTGCcgcgc 

751 aacgagttcg cCCCcgtgat tgccgccgag aggcaactCT CCGTCCTgaa 

801 tgaaacccTG tccaaattcg cccaaaccgg cgatatggat aAaatattgt 

851 ccctatttcc ctacatggca atcgccacct cctttttagg cgTAACctta 

901 ggcctgtttg acaacatcgc cgacatcttc aaatggaacg acagtatgtc 

951 cgggcggggc accaaaaccg tcgcgctgaa cttcctgccg CCCCtgattt 

1001 cctggctgct cctccccacc ggcttcttta ccgccattgg tgcgtccggc 

1051 ctggcggcaa ccgtctggga ccaagGcatc atccccgcca tgctgctcta 

1101 cgtttccccc caaaaaattG gcGcaggcaa gacttataAa gtttaCGGCG 

1151 gcttgtggct gatgttagtc ttccttttcg gcatcgccaa catcgccgca 

1201 CAGGTATTGA GccaAatgGa ACtcgtCccc GTATTTAAAG GATAA 

This corresponds to the amino acid sequence <SEQ ID 358; ORF 102.ng>: 

gl02 .pep 
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1 MSAKTPSLFG GAMI IAGKVI GAGMFPNPTA NLGDGLIGSL IVLLYTWFPF 

51 SSGALMILEV NTHNPRGASF DTMVKDLLGR GWNIING IAV ALVLYGSTYA 

101 YILVGGDLTA KGIGSAVGGK ISLTVGQLVF FGILAFC VWA SARLVDRFTG 

151 V LIGGMVLTF IWATGGLV AD AKPSVLFDTQ APVGTGYWIY AATALPVCLA 

201 SFGFHGNVSS LLKYFKGDAP KVAKSIW AGT LVALVIYVLW QTAI QSNLPR 

251 NEFAPVIAAE RQLSVLNETL SKFAQTGDMD K ILSLFPYMA IATSFLGV TL 

301 GLFDNIADIF KWNDSMSGRG TKTVALNFLP P LISWLLLPT GFFTAIGA SG 

351 LAATVWDQGI IPAMLLYVSP QKIGAGKTYK VYGGLW LMLV FLFGIANIAA 

4 01 QVLSQMELVP VFKG* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 359>: 

ml02 . aeq 

1 ATGCCCAACA AAACCCCTTC ACTGTTCGGC GGCGCGATGA TTATCGCCGG 

51 CACGGTCATC GGCGCAGGCA TGCTCGCCAA CCCGACCGCC ACATCCGGCG 

101 TATGGTTTAC CGGCTCGCTG GCCGTGTTGC TGTACACCTG GTTTTCTATG 

151 CTTTCCAGCG GCCTGATGAT TTTGGAAGTC AACACCCATT ATCCGCACGG 

201 CGCAAGTTTC GACACGATGG TCAAAGACCT GCTCGGACGC GGCTGGAACA 

251 TCATCAACGG CATCGCCGTC GCCTTCGTTT TATACCTGCT TACTTACGCT 

301 TATATCTTCG TCGGCGGCGA CCTGACCGCC AAAGGCTTAG GCAGCGCGGC 

351 AGGCGGCGAC GTTTCACTCA CCGTCGGACA ACTCGTCTTC TTCGGCATCC 

4 01 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG CTTCACCGGC 

4 51 GTCCTTATCG GCGGCATGGT ATTGACCTTT ATTTGGGCGG CCGGCGGGCT 

501 GATTGCCGAT GCCAAGCCGT CCGTCCTCTT CGATACCCAA GCCCCCGCCG 

551 GCACAAACTA CTGGATTTAC GCCGCCACCG CCCTGCCCGT CTGCCTCGCT 

601 TCCTTCGGCT TCCACGGCAA CGTCTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGCGCCC AAAGTGGCTA AATCCATCTG GACGGGCACA CTGATTGCGC 

701 TGGTAATTTA CGTCCTCTGG CAAACCGCCA TCCAAGGCAA CCTGCCGCGC 

751 AACGAGTTCG CCCCCGTCAT CGCCGCCGAA GGGCAAGTCT CCGTCCTCAT 

801 CGAAACCCTG TCCAAATTCG CCCAAACCGG CAATATGGAC AAAATATTGT 

851 CCCTGTTTTC CTATATGGCG ATCGCCACCT CGTTTTTAGG CGTAACGCTC 

901 GGACTCTTCG ACTACATCGC CGACATCTTC AAATGGAACG ACAGCATCTC 

951 CGGCCGCACC AAAACCGCCG CGCTGACCTT CCTGCCGCCC CTGATTTCCT 

1001 GCCTGCTCTT CCCCACCGGC TTCGTTACCG CCATCGGCTA CGTCGGCCTG 

1051 GCGGCAACCG TCTGGACAGG CATCATCCCC GCCATGCTGC TCTACCGTTC 

1101 GCGCAAAAAA TTCGGCGCAG GCAAAACCTA TAAAGTTTAC GGCGGCTTGT 

1151 GGCTGATGGT TTGGGTCTTC CTTTTCGGCA TCGTCAACAT CGCCGCACAG 

1201 GTATTGAGCC AAATGGAACT CGTCCCCGTA TTTAAAGGAT AA 



This corresponds to the amino acid sequence <SEQ ID 360; ORF 102>: 

ml02 .pep. . 

1 MPNKTPSLFG GAMIIAGTVI GAGMLANPTA TSGVWFTGSL AVLLYTWFSM 

51 LSS GLMILEV NTHYPHGASF DTMVKDLLGR GWNII NGIAV AFVLYLLTYA 

101 YIFVGGDLTA KGLGSAAGGD VSLTVGQLVF FGILAFC VWA SARLVDRFTG 

151 V LIGGMVLTF IWAAGGLI AD AKPSVLFDTQ APAGT NYWIY AATALPVCLA 

201 SFGFHGNVSS LLKYFKGDAP KVAKS IWTGT LIALVIYVLW QT AIQGNLPR 

251 NEFAPVIAAE GQVSVLIETL SKFAQTG NMD KILSLFSYMA IATS FLGVTL 

301 GLFDYIADIF KWNDSISGRT KTAALTFLPP LISCLL FPTG FV TAIGYVGL 

351 AATVWTGII P AMLLYRSRKK FGAGKTYKV Y GGLWLMVWVF LFGIVN IAAQ 

401 VLSQMELVPV FKG* 

ml02/gl02 86.0% identity in 415 aa overlap 



10 20 30 40 50 60 

ml02.pep MPNKTPSLFGGAMIIAGTVIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 
I I I I I I I I I I I 1 I I I I I I I i I : I I 1 I : ! : I I I I ! I I I I I : I : : I I I I j I 
gl02 MSAKTPSLFGGAMIIAGKVIGAGMFPNPTANLGDGLIGSLIVLLYTWFPFSSGALMILEV 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 02 . pep NTH YPHGAS FDTMVKDLLGRGWNI INGI AVAFVLYLLTYAYI FVGGDLTAKGLGSAAGGD 

111 1:11111 lltlll II IMMI til 111:111 I I I I I : I I I I I I I I I : I I I : I 1 
gl02* NTHNPRGASFDTMVKDLLGRGWNIINGIAVALVLYGSTYAYILVGGDLTAKGIGSAVGGK 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml02 . pep VSLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWAAGGLIADAKPSVLFDTQ 
: I 1 I I I I I I I I I I t 1 I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I : I I I : I | I | | | | | | | | i 
gl02 ISLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWATGGLVADAKPSVLFDTQ 
130 140 150 160 170 180 



ml02 . pep 



190 200 210 220 230 240 

APAGTNYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVIYVLW 
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I I : N : I I I I I I I I I I I I I I I I I I I I f 1 I I I I I I I I I I I I I I I I I I I : I I I : I I I I I I I I 
gl02 APVGTGYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWAGTLVALVIYVLW 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml 02 .pep QTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 
I I I I I :| I I I I I I I I I I I I I hill I I I I I I I I I I : I i I I I I I I I I I 1 II I I I I I I I 
g 102 QTAIQSNLPRNEFAPVIAAERQLSVLNETLSKFAQTGDMDKILSLFPYMAIATSFLGVTL 

250 260 270 280 290 300 

310 320 330 340 350 

ml02 . pep GLFDYIADIFKWNDSISGR-TKTAALTFLPPLISCLLFPTGFVTAIGYVGLAATVWT-GI 
I I I I I I I I I I I I I I : I I I I I I : I I : I I I I M I I I : I I I I I II I I I I I I I I II 
gl02 GLFDNIADIFKWNDSMSGRGTKTVALNFLPPLISWLLLPTGFFTA1GASGLAATVWDQGI 

310 320 330 340 350 360 

360 370 380 • 390 400 410 

ml02 .pep IPAMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIVNIAAQVLSQMELVPVFKGX 
I I I I I I I I : I : I I I I I I I I I I I I I I I : I I I I I I : I I I II I I I I I I I I I I I I I I 
g 102 IPAMLLYVSPQKIGAGKTYKVYGGLWLML-VFLFGIANIAAQVLSQMELVPVFKGX 
370 380 390 400 410 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 361 >: 

al02.seq 

1 ATGCCCACCA AAACCCCTTC ACTGTTCGGC GGCGCGATGA TTATCGCCGG 

51 CACGNTCATC GGCGCAGGTA TGCTCGCCAA CCCGACCGCC ACATCCGGCG 

101 TATGGTTTAC CGGCTCGCTG GCCGTGTTGC TGTACACCTG GTTTTCCATG 

151 CTCTCCAGCG GCCTGATGAT TTTGGAAGTC AACACCCACT ACCCCCACGG 

201 CGCGANCTTC GACACCATGG TTAAAGACCT GCTCGGACGG AGCTGGAACA 

251 TCATCAACGG CATCGCCGTC GCCTTCGTTT TATACCTGCT TACTTACGCT 

301 TATATCTTCG TCGGCGGCGA CCTGACCGCC AAAGGCTTAG GCAGCGCGGC 

351 AGGCGGCAAT GTTTCACTCA CCGTCGGACA ACTCGTCTTC TTCGGCATTC 

4 01 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG ATTCACCAGC 

4 51 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 

501 GATTGCCGAT GCCAAACTGC CCGTCCTCTT CGACACCCAA GCCCCTACCG 

551 GCACCAACTA CTGGATTTAT GTCGCCACCG CCCTGCCCGT CTGCCTTGCG 

601 TCATTCGGTT TCCACGGCAA CGTCTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGCGCCC AAAGTGGCTA AATCCATCTG GACGGGCACA CTGATTGCGC 

7 01 TGGTAATTTA CGTCCTCTGG CAAACCGCCA TCCAANGCAA CCTGCCGCGC 

7 51 AACGAGTTCG CCCCCGTGAT TGCCGCCGAA GGGCAAGTCT CCGTCNTGAT 

801 TGAAACCCTG TCCAAATTCG CCCAAACCGG CAATATGGAC AAAATATTGT 

851 CCCTGTTTTC CTATATGGCG ATCGCCACCT CGTTTTTAGG CGTAACGCTC 

901 GGACTCTTCG ACTACATCGC CGACATCTTC AAATGGAACG ACAGCGTGTC 

951 CGGCCGCACC AAAACCGCCG CGCTGACCTT CCTGCCGCCT NTAATTTCCT 

1001 GCCTGCTCTT CCCCACCGGC TTTGTTACCG CCATCGGNTA CGTCGGCCTG 

1051 GCGGCAACCG TCTGGACAGG CATCATCCCC GCCATGCTGC TNTACCGTTC 

1101 GCGCAAAAAA TTCGGCGCAG GCAAAACCTA TAAAGTTTAC GGCGGCTTGT 

1151 GGCTGATGGT TTGGGTCTTC CTTTTCGGCA TCNTCAACAT CGCCGCACAN 

1201 GTATTGAGCC AAATGGAACT CGTCCCCGTA TTTAAAGGAT AA 
1202 

This corresponds to the amino acid sequence <SEQ ID 362; ORF 102.a>: 

al02.pep 

1 MPTKTPSLFG GAMIIAGTXI GAGMLANPTA TSGVWFTGSL AVLLYTWFSM 

51 LSSGLMILEV NTHYPHGAXF DTMVKDLLGR SWNIINGIAV AFVLYLLTYA 

101 YIFVGGDLTA KGLGSAAGGN VSLTVGQLVF FGILAFCVWA SARLVDRFTS 

151 VLIGGMVLTF IWATGGLIAD AKLPVLFDTQ APTGTNYWIY VATALPVCLA 

201 SFGFHGNVSS LLKYFKGDAP KVAKSIWTGT LIALVIYVLW QTAIQXNLPR 

251 NEFAPVIAAE GQVSVXIETL SKFAQTGNMD KILSLFSYMA IATSFLGVTL 

301 GLFDYIADIF KWNDSVSGRT KTAALTFLPP XISCLLFPTG FVTAIGYVGL 

351 AATVWTGIIP AMLLYRSRKK FGAGKTYKVY GGLWLMVWVF LFGIXNIAAX 

4 01 VLSQMELVPV FKG* 



ml02 / al02 



95.9% identity in 413 aa overlap 
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10 20 30 40 50 60 

ml02 .pep MPNKTPSLFGGAMIIAGTVIGAGMLANPTATSGVW FTGSLAVLLYTWFSMLSSGLMILEV 

I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I | 
al02 MPTKTPSLFGGAMIIAGTXIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml02 . pep NTHYPHGASFDTMVKDLLGRGWNIINGIAVAFVLYLLTYAYIFVGGDLTAKGLGSAAGGD 
I I I II I I I I ! I I I I I I I I I : I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I t I I I I I I : 
a 102 NTHYPHGAXFDTMVKDLLGRSWNIINGIAVAFVLYLLTYAYIFVGGDLTAKGLGSAAGGN 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml02 . pep VSLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWAAGGLIADAKPSVLFDTQ 
I M I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I II I I I I I I M : I I I I M I I I I I I I I 
al02 VSLTVGQLVFFGILAFCVWASARLVDRFTSVLIGGMVLTFIWATGGLIADAKLPVLFDTQ 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml02 . pep APAGTNYWIYAATALPVCLASFG FHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVIYVLW 

I I : I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al02 APTGTNYWIYVATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVI YVLW 

190 200 210 220 230 -24 0 



250 260 270 280 290 300 

ral02 . pep QTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 
I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a l02 QTAIQXNLPRNEFAPVIAAEGQVSVXIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml02 .pep GLFDYIADIFKWNDSISGRTKTAALTFLPPLISCLLFPTGFVTAIGYVGLAATVWTGIIP 
I I I I I I I I I I I i I I I : I I I I I I I I I I I I I I I I I I I 1 I I I I i I I I I I I I II I I I I I I I I I 
al02 GLFDYIADIFKWNDSVSGRTKTAALT FLPPXISCLLFPTGFVTAIGYVGLAATVWTGIIP 

310 320 330 340 350 360 



370 380 390 400 410 

ml 02 . pep AMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIVNIAAQVLSQMELVPVFKGX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I 
al02 AMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIXNIAAXVLSQMELVPVFKGX 

370 380 390 400 410 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 363>: 

gl05 . seq 

1 Atgtccgcag aaaCATACAc acAAAtcggc tGGgtaggct taggGcaaat 

51 gGgtctgcct atgGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCcgc C AAAGGAG C A 

151 AAAGTTTACG GCagcACCGC CGAACTCGTC CGCGCCTGCC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

3 01 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

3 51 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG ACCCGCCACC AACGGCACAC 
401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

4 51 ATATTTTCCC TTGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 
501 AGGCTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTAGGC ATTTTCGGCG 
551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 
601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCTATGTT 
651 TCAAACAAAA AAATCACTAT GGGCAAACCG TGAGTTCCCC CCTGCCTTTG 
701 CACTCAAACA CGCTTCCAAA GAcCtTAACC TCGccgtcAA AGAGCTTGAA 
751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 
8 01 CAAAGCAGTT GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 
851 TGAAATTGGC AGAACACTGA 
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This corresponds to the amino acid sequence <SEQ ID 364; ORF 105.ng>: 

gl05 .pep 

1 MSAETYTQIG WVGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGSTAELV RAC PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKIoAEH 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 365>: 



ml05 . seq 




1 


ATGTCCGCAA 


51 


GGGTCTGCCT 


101 


TATACAACCG 


151 


AAAGTTTACG 


201 


CCTGATGGTT 


251 


GCGACGGATT 


301 


ACCGAAAaGC 


351 


CGAAGCACCC 


401 


TGATTCTGTT 


451 


TTTCCCTCGT 


501 


TCGGGCGCGA 


551 


TAcAGCGAAs 


601 


ATCGTCGAAG 


651 


CAAAAAATCC 


701 


AACACGCCTC 


751 


GGCAACACCC 


801 


AGTCGAAGCC 


851 


TGGCAGAACA 



This corresponds to the amino acid sequence <SEQ ID 366; ORF 105>: 

ml05 .pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGX IIVNMSTISP 

101 TEKLAVKALV EAQRQFAEAP VSGSVGPATN GTLLILFGGS EPFXTRCKKY 

151 FPSSAKKPSI SAMSAKVRAR NSSXTRSWAF SANVQRXXLM ARQFGIDTDT 

201 IVEAIGDSAM DSPMFQTKKS LWANREFPXX FALKHASKDL NLAVKELEQA 

2 51 GNTLPAVETV AASYRKAVEA GYGTQDVSGV YLKLAEH 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 105 shows 79.9% identity over a 289 aa overlap with a predicted ORF (ORF 105.ng) 
from N. gonorrhoeae: 

ml05/gl05 

10 20 30 40 50 60 

gl05 pep MSAETYTQI GWVGLGQMGL PMVTRLLDGG I EVGVYNRS PDKTAP I S AKGAKVYGS T AE LV 

" ^ | | I : I : I I I I : I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I h I ! I I I 

ml0 5 MS ANEYAQ I GW I GLGQMGL PMVTRLLDGG I EVGVYNRS PDKTAP I SAKGAKVYGNTAELV 

10 20 30 40 50 60 

70 80 90 100 110 120 

q 1 0 5 pep RACPVI FLMVSDYAAVCDI LNGVRDGLAGKI I VNMSTI SPTENLAVKALVEAAGGQFAEA 

I | 1 1 I I I I I 1 1 I I I 1 I I 1 I I 1 I I I I 1 I I I I I I I 1 I I 1 I I = t I I f I I 1 I I Mill 
m i05 RDYPV I FLMVSDYAAVCDI LNGVRDGLAGX 1 1 VNMSTI S PTEKLAVKALVEAQR - QFAEA 

70 80 90 100 110 

130 140 150 160 170 180 

g 1 0 5 pep PVSGSVGPATNGTLLI LFGGSEAVLNPLQKI FSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

III II I II MINI MINIM : Ml Ml == == h 

ml 0 5 PVSGSVGPATNGTLLI LFGGSEPFXTRCKKYFPSSAKKP - S I SAMSAKVRARNSSXTRSW 

120 130 140 150 160 170 
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190 200 210 220 230 240 

ql05 pep IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 

" ' |: :: | I | I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I llllllll 

ml05 AFSANVQRXXLMARQFGIDTDTIVEAI GDSAMDSPMFQTKKSLWANREFPXXFALKHASK 

180 190 200 210 220 230 

250 260 270 280 289 

ql05 pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEH 

' ^ MINI llllllll Mill MIIIIMI II Mill MINI MINI I 

m i05 DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGTQDVSGVYLKLAEH 
240 250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 367> : 

al05 . seq 

ATGTCCGCAA ACGAATACAC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 



GGGTCTGCCT ATGGTAACGC 
TATACAACCG CTCGCCCGAC AAAACTGCCC 
AAAGTTTACG GCAACACCGC CGAACTCGTC 
CCTGATGGTT TCCGACTATG 



CCATCTCCGC CAAAGGCGCA 
CGCGACTATC CCGTCATTTT 
CCGCCGTGTG CGACATCCTG AACGGAGTCC 



TGCCGAAGCA CCCGTTTCCG 
TGCTGATTCT GTTCGGCGGC 



GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 
ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

GATCGGTCGG GCCCGCCACC AACGGCACGC 
AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 
ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 
AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

CTGATGGCGC GGCAGTTCGG 
CGGCGGCTCG GCAATGGACT 
GGGCAAACCG 



AAGCGTACAG CGAAGCGATG 
GACACCATCG TCGAAGCCAT 
C C AAAC C AAA AAATCCCTGT 



CATCGATACC 
CGCCCATGTT 
CGAATTCCCA CCCGCCTTCG 



CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 
CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 
CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 
TGAAATTGGC AGAACACTGA 



This corresponds to the amino acid sequence <SEQ ID 368; ORP 105.a>: 

al05 .pep 

MSANEYTQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 
KVYGNTAELV RDYPVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 



1 
51 
101 
151 
201 
251 



TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 
QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 



a!05/al05 96.5% identity in 289 aa overlap 

10 20 30 40 50 60 

ml05 . pep MSANEYAQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 

I | | I | I : I I I I I I I I I I I I I I I I I i II i I I I I I I I I M I I I I I I I I I i I I I I I I I I I I I I 
a 105 MSANEYTQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 

10 20 30 40 50 60 

70 80 90 100 110 119 

ml05.pep RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAG-QFAEA 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I II I I I I I I I Mill 
al05 RDYPVI FLMVSDYAAVCDILNGVRDGLAGKI IVNMST I S PTENLAVKALVEAAGGQFAEA 

70 80 90 100 110 120 



120 130 140 150 160 170 179 

ml05 . pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
I j I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I II I I I I I I I I I I I I I I I I I I I I I I I 
al05 PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 



180 



190 200 210 220 230 
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ml 05 . pep IFGDV-QRXMLMARQFGIDTDTIVEAIGDSAMDSPMFQTKKSLWANREFPXAFALKHASK 
Ml:: : : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
a l05 IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 

190 200 210 220 230 240 



240 250 260 270 280 

ml 05 . pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I I I I I I I I I I I I I I i I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al05 DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 

250 260 270 280 290' 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 369>: 

gl05-l.seq 

1 ATGTCCGCAG AAACATACAC ACAAATCGGC TGGGTAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGAGCA 

151 AAAGTTTACG GCAGCACCGC CGAACTCGTC CGCGCCTGCC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG ACCCGCCACC AACGGCACAC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

4 51 ATATTTTCCC TTGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGCTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTAGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCTATGTT 

651 TCAAACAAAA AAATCACTAT GGGCAAACCG TGAGTTCCCC CCTGCCTTTG 

701 CACTCAAACA CGCTTCCAAA GACCTTAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTT GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 



This corresponds to the amino acid sequence <SEQ ID 370; ORF 105-1. ng>: 

gl05-l .pep 

1 MSAETYTQIG WVGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGSTAELV RACP VIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 I FSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 371>: 

mlOS-l.seq 

1 ATGTCCGCAA ACGAATACGC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCcG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAnCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGsGACTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCG CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAACTGGC AGAACACTGA 



This corresponds to the amino acid sequence <SEQ ID 372; ORF 105-1>: 

ml05-l.pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 
51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 
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101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 I FSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEXM LMARQFGIDT 

201 DTIVEAIGDS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 

ml05-l/gl05-l 96.9% identity in 289 aa overlap 

10 20 30 40 50 60 

ml05-l.pep MSANEYAQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
III: I : I I I I : I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I : I I I I I 
gl05-l MSAETYTQIGWVGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGSTAELV 

10 20 30 40 50 60 . 

70 80 90 100 110 120 

ral05-l .pep RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I 1 I I I I I I II I I I I I I I I I I I I 
gl05-l RACPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 

70 80 90 100 110 120 

130 140 150 160 . 170 180 

ml05-l.pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 1 I I I I I I I i I I I I I I I I I 
gl05-l PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml05-l .pep IFGEAYSEXMLMARQFGIDTDTIVEAIGDSAMDSPMFQTKKSLWANREFPPAFALKHASK 
I I I I I I II I I I I I I I I I I I I I I I I I I I' I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl05-l IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 

190 200 210 220 230 240 

250 260 270 280 290 

ml05-l .pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I 1 1 M I I I I II I I I I I I I I 
gl05-l DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 

250 260 270 280 290 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 373>: 

al05-l . seq 

1 ATGTCCGCAA ACGAATACAC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GC AACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCA CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ID 374; ORF 105-l.a>: 

al05-l .pep 

1 MSANEYTQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 I FSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 



al05~l/ml05-l 



99.0% identity in 289 aa overlap 



10 20 30 40 50 60 

al05-l .pep MSANEYTQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
I I I I I I : I I I I I I I I I I I I I I I I I I I 1 1 I I I I 1 1 I I I I I I I I I 1 1 M I I I I I I I I 1 1 I I i 
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ml05-l MSANEYAQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
10 20 30 40 50 60 

70 80 90 100 110 120 

al05-l .pep RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 
I I I f I I I I I I I I I I I I I I I I I I I ! I i I I I I I I I I i I 1 If I I I I I I I I I I I I I I I I I I I I I 
ml05-l RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

al 05-1 .pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKT FHFGDVGKGSGAKLVLNSLLG 
I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I | | 
ml05-l PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKT FHFGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 

190 200 210 220 230 240 

al05-l . pep IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I II I I I I I I I I I I I I I | | | 
ml05-l IFGEAYSEXMLMARQFGIDTDTIVEAIGDSAMDSPMFQTKKSLWANREFPPAFALKHASK 

190 200 210 220 230 240 

250 260 270 280 290 

al05-l.pep DLNLAVKELEQAGNTLPAVETVAAS YRKAVEAGYGEQDVSGVYLKLAEHX 
I I I I I I I I I I I I I I I I I I I 1 I I I I I II I I I I I I I I I I I 11 I I I I I I I I I I 
ml 0 5 - 1 DLNLAVKELEQAGNTLPAVETVAAS YRKAVEAGYGEQDVSGVYLKLAEHX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 375>: 

gl07 . seq 

1 ATGGTATTAA CCTTTATTTG GGCAACCGGC GGCCTGGTTG CCGATGCCAA 

51 ACCGTCCGTC CTCTTCGACA CCCAAGCCCC CGTCGGCACC GGCTACTGGA 

101 TTTACGCCGC CACCGCCCTG CCCGTCTGCC TCGCTTCCTT CGGCTTCCAC 

151 GGCAACGTTT CCAGCCTGCT CAAATACTTT AAAGGCGACG cgcCcaaagt 

201 GgCGAaATCc atctggGcag gtacattggT TGCCttggta atttacgtcc 

251 TCTggcaaac cgccatCcaa agcaaccTGC cgcgcaacga gttcgcCCCc 

3 01 gtgattgccg ccgagaggca actCTCCGTC CTgaatgaaa cccTGtccaa 

351 attcgcccaa accggcgata tggataAaat attgtcccta tttccctaca 

401 tggcaatcgc cacctccttt ttaggcgTAA Ccttaggcct gtttgacaac 

451 atcgccggac atcttcaaat ggaacgacag tatgtccggg cggcaccaaa 

501 accgtcgcgc tga 

This corresponds to the amino acid sequence <SEQ ID 376; ORF 107.ng>: 

gl07 .pep 

1 MVLTFIWATG GLVADA KPSV LFDTQAPVGT GYWIYAATAL PVCLASFGFH 

51 GNVSSLLKYF KGDAPKVAKS IW AGTLVALV IYVLWQTAI Q SNLPRNEFAP 

101 VIAAERQLSV LNETLSKFAQ TGDMD KILSL FPYMAIATSF LGV TLGLFDN 

151 IAGHLQMERQ YVRAAPKPSR * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 377>: 



m!07 . seq 












1 


ATGGTATTGA 


CCTTTATTTG 


GGCGGCCGGC 


GGGCTGATTG 


CCGATGCCAA 


51 


GCCGTCCGTC 


CTCTTCGATA 


CCCAAGCCCC 


CGCCGGCACA 


AACTACTGGA 


101 


TTTACGCCGs 


CACCGCCCTG 


CCCGTCTGCC 


TCGCTTCCTT 


CGGCTTCCAC 


151 


GGCAACGTCT 


CCAGCCTGCT 


CAAATACTTT 


AAAGGCGACG 


CGCCCAAAGT 


201 


GGCTAAATCC 


ATCTGGACGG 


GCACACTGAT 


TGCGCTGGTA 


ATTTACGTCC 


251 


TCTGGCAAAC 


CGCCATCCAA 


GGCAACCTGC 


CGCGCAACGA 


GTTCGCCCCC 


301 


GTCATCGCCG 


CCGAAGGGCA 


AGTCTCCGTC 


CTCATCGAAA 


CCCTGTCCAA 


351 


ATTCGCCCAA 


ACCGGCAATA 


TGGACAAAAT 


ATTGTCCCTG 


TTTTCCTATA 


401 


TGGCGATCGC 


CACCTCGTTT 


TTAGGCGTAA 


CGCTCGGACT 


CTTCGACTAC 


451 


ATCGCCCATC 


TTCAAATGGA 


ACGACAGCAT 


CTCCGGgCCG 


CACCAAAACC 


501 


GCCGCGCTGA 











This corresponds to the amino acid sequence <SEQ ID 378; ORF 107>: 

ml 07 .pep . . 

1 MVLTFIWAAG GLIADA KPSV LFDTQAPAGT NYWIYAXTAL PVCLASFGFH 
51 GNVSSLLKYF KGDAPKVAKS IW TGTLIALV IYVLWQTAI Q GNLPRNEFAP 
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101 VIAAEGQVSV LIETLSKFAQ TGNMDK ILSL FSYMAIATSF LGV TLGLFDY 
151 IAHLQMERQH LRAAPKPPR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 107 shows 89.4% identity over a 170 aa overlap with a predicted ORF (ORF 107.ng) 
from N. gonorrhoeae: 

ml07/gl07 

10 20 30 40 50 60 

ml 0 7 .pep 
gl07 



ml07 -pep 
gl07 



ml 07 .pep 
gl07 

130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 379>: 

al07 . seq 

1 ATGGTATTAA CCTTTATTTG GGCAACCGGC GGCCTGATTG CCGATGCCAA 

51 ACTGCCCGTC CTCTTCGACA CCCAAGCCCC TACCGGCACC AACTACTGGA 

101 TTTATGTCGC CACCGCCCTG CCCGTCTGCC TTGCGTCATT CGGTTTCCAC 

151 GGCAACGTCT CCAGCCTGCT CAAATACTTT AAAGGCGACG CGCCCAAAGT 

201 GGCTAAATCC ATC.TGGACGG GCACACTGAT TGCGCTGGTA ATTTACGTCC 

251 TCTGGCAAAC CGCCATCCAA GGCAACCTGC CGCGCAACGA GTTCGCCCCC 

301 GTGATTGCCG CCGAAGGGCA AGTCTCCGTC CTGATTGAAA CCCTGTCCAA 

351 ATTCGCCCAA ACCGGCAATA TGGACAAAAT ATTGTCCCTG TTTTCCTATA 

4 01 TGGCGATCGC CACCTCGTTT TTAGGCGTAA CGCTCGGACT CTTCGACTAC 

4 51 ATCGCCGACA TCTTCAAATG GAACGACAGC GTGTCCGGCC GCACCAAAAC 

501 CGCCGCGCTG ACCTTCCTGC CGCCTCTAAT TTCCTGCCTG CTCTTCCCCA 

551 CCGGCTTTGT TACCGCCATC GGCTACGTCG GCCTGGCGGC AACCGTCTGG 

601 ACAGGCATCA TCCCCGCCAT GCTGCTCTAC CGTTCGCGCA AAAAATTCGG 

651 CGCAGGCAAA ACCTATAAAG TTTACGGCGG CTTGTGGCTG ATGGTTTGGG 

7 01 TCTTCCTTTT CGGCATCGTC AACATCGCCG CACAGGTATT GAGCCAAATG 

751 GAACTCGTCC CCGTATTTAA AGGATAA 

This corresponds to the amino acid sequence <SEQ ID 380; ORF 107. a>: 
al07 .pep 

1 MVLTFIWATG GLIADAKLPV LFDTQAPTGT NYWIYVATAL PVCLASFGFH 

51 GNVSSLLKYF KGDAPKVAKS IWTGTLIALV IYVLWQTAIQ GNLPRNEFAP 

101 VIAAEGQVSV LIETLSKFAQ TGNMDKILSL FSYMAIATSF LGVTLGLFDY 

151 IADIFKWNDS VSGRTKTAAL TFLPPLISCL LFPTGFVTAI GYVGLAATVW 

201 TGIIPAMLLY RSRKKFGAGK TYKVYGGLWL MVWVFLFGIV NIAAQVLSQM 

251 ELVPVFKG* 



MVLTFIWAAGGLIADAKPSVLFDTQAPAGTNYWIYAXTALPVCLASFGFHGNVSSLLKYF 

lllllllhllhlllllllllllllhlhlllll IIIIIIIIIIMIIIMIMMI 

MVLT F I WATGGLVADAKPS VLFDTQAP VGTGYW I YAAT AL P VCLAS FG FHGNVS S LLKY F 
10 20 30 40 50 60 

70 80 90 100 110 120 

KGDAPKVAKSIWTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 
I I I I I I II I I I I : I I I •* I I I I I I I I I I I I I : I I I I I I I I I I I I | | Ml I I I I I I I I 
KGDAPKVAKS I WAGTLVALVIYVLWQTAIQSNLPRNEFAPVIAAERQLSVLNETLSKFAQ 

70 80 90 100 110 120 

130 140 150 160 170 

TGNMDKILSLFSYMAIATSFLGVTLGLFDYIA-HLQMERQHLRAAPKPPR 
Ihllllllll IMIMIIIIMIIIII I I I I I I I I I - I I I I I I | 
TGDMDKILSLFPYMAIATSFLGVTLGLFDNIAGHLQMERQYVRAAPKPSR 



m!07/al07 



94.8% identity in 154 aa overlap 



10 20 30 40 50 60 

ml07 .pep MVLT FIWAAGGLIADAKPSVLFDTQAPAGTNYWIYAXTALPVCLAS FG FHGNVS SLLKYF 

I I I I t I I I : I I I t I I I I I I I I I I I I : I I I I I II : I I f I I I M I I I I I f I I I I I I I I | 
a 1 0 7 MVLT FI WATGGLI ADAKLPVLFDTQAPTGTNYWI YVATALPVCLAS FG FHGNVS SLLKY F 

10 20 30 40 50 60 
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70 80 90 100 110 120 

ml07 pep KGDAPKVAKSIWTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 

I I | I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

al0 7 KGDAPKVAKSIWTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 

70 80 90 100 110 120 

130 140 150 160 170 

ml07 . pep TGNMDKILSLFSYMAIATSFLGVTLGLFDYIAHLQMERQHLRAAPKPPRX 

I | | I I I I I I I I I I I I I I I I I I II II I I II I I I : 

a 107 TGNMDKILSLFSYMAIATSFLGVTLGLFDYIADIFKWNDSVSGRTKTAALTFLPPLISCL 

130 140 150 160 170 180 



al07 



LFPTGFVTAIGYVGLAATVWTGIIPAMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIV 
190 200 210 220 230 240 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 38 1>: 

gl08 . seq 

1 ATGttgccgg gCTTCAACCG GATATTCAaa cggTTTGCTC CAACACTCGG 

51 AAcggCGCAT AAAACGCCgc CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATCCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGCG GCAATCTTAC TCGGCGGGTG 

201 CGCCGCCGGC GGCAACACAT TCGGCAGCTT AGACGGCGGC ACGGGTATGG 

251 GTGGCAGCAT CGTCAAAATG ACGGTAGAAA gccAATGCCG TGCGGAATTG 

3 01 GACAGGCGCA GCGAATGGCG TTTGACCGCG CTGGCGATGA GTGCCGAAAA 

3 51 ACAGGCGGAA TGGGAAAACA AGATTTGCGG CTGCGCTACC GAAGAAGCAC 
401 CTAACCAGCT GACCGGCAAC GATGTGATGC AGATGCTGAa ccagtCCacG 
451 CGCaatcagg cacTtgccgc CCtgaccgTC AAAacggtTT CcgcctgcTT 
501 CAaacgcctg tACCGCTAa 

This corresponds to the amino acid sequence <SEQ ID 382; ORF 108.ng>: 

glOS.pep 

1 MLPGFNRIFK RFAPTLGTAH KTPPFALSRT GRLIRSYRHK RRGFNRKGIE 

51 MNKT LSILPA AILLGGCAAG GN TFGSLDGG TGMGGSIVKM TVESQCRAEL 

101 DRRSEWRLTA LAMSAEKQAE WENKICGCAT EEAPNQLTGN DVMQMLNQST 

151 RNQALAALTV KTVSACFKRL YR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 383>: 

ml08 . seq 

1 ATGTTGCCGG GCTTCAACCG GATATTCAAA CGGTTTGTTC CAACACTCGG 

51 AACGGCGCAT AAAACGCCGC CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATTCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGTG GCAATCTTAC TCGGCGGCTG 

2 01 CGCCGCCGGA GGCGGTAACA CATTCGGCAG CTTAGACGGT GGCACAGGCA 

251 TGGGCGGCAG CATCGTCAAA ATGGCGGTTG GGAGCCAATG CCGTGCGGAA 

301 TTGGACAAAC GCAGCGAATG GCGTTTGACC GCGCTGGCGA TGAGTGCCGA 

351 AAAACAGGCG GAGTGGGAAA ACAAGATTTG CGCTTGCGTC GCCCAAGAAG 

4 01 CACCCGAACG GATGACCGGC AACGATGTGA TGCAGATGCT GGCTCCGTCC 
451 ACGCGCAATC AGGCACTTGC CGCCCTGACC GCCAAAACGG TTTCCGCCTG 
501 CTTCAAACAC CTGTACCGCT AA 

This corresponds to the amino acid sequence <SEQ ID 384; ORF 108>: 

ml08 .pep 

1 MLPGFNRIFK RFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 

51 MNKT LSILPV AILLGGCAAG G GNTFGSLDG GTGMGGSIVK MAVGSQCRAE 

101 LDKRSEWRLT ALAMSAEKQA EWENKICACV AQEAPERMTG NDVMQMLAPS 

151 TRNQALAALT AKTVSACFKH LYR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 108 shows 89.6% identity over a 173 aa overlap with a predicted ORE (ORF 108.ng) 
from N. gonorrhoeae: 

ml08/gl08 
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10 20 30 40 50 60 

ml08 . pep MLPGFNRI FKRFVPTLGTAHKTPPFALSRTGRL I RFYRHKRRGFNRKGI EMNKTLS I LPV 

I I I I I I I I I I I h I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I : 
gl08 MLPGFNRI FKRFAPTLGTAHKTPPFALSRTGRLIRSYRHKRRGFNRKGI EMNKTLS I LPA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 08 . pep AILLGGCAAGGGNTFGSLDGGTGMGGSIVKMAVGSQCRAELDKRSEWRLTALAMSAEKQA 

II I I I I I I I I I I I I I I I I I I I I I I I I I M h I I I I I I I I I : I I I I I I I I I I I I I I I I I 
gl08 AILLGGCAAGG-NTFGSLDGGTGMGGSIVKMTVESQCRAELDRRSEWRLTALAMSAEKQA 

70 80 90 100 110 

130 140 150 160 170 

ml 0 8 pep EWENKI CACVAQEAPERMTGNDVMQMLAPSTRNQALAALTAKTVS ACFKHLYRX 

I I I I I I I : I : = : I I h = : I I I I I I I I I I I I I I I I I I I I : I I I I II I I : I I I 
g 1 0 8 EWENKI CGCATEEAPNQLTGNDVMQMLNQSTRNQALAALTVKTVS ACFKRLYRX 

120 130 140 150 160 170 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 385>: 

al08.seq 



1 ATGTTGCCGG GCTTCAACCG 

51 AACGGCGCAT AAAACGCCGC 

101 TCAGATTCTA TCGCCATAAA 

151 AT.GAATAAAA 

201 CGCCGCCGGG 

251 TGGGCGGCAG 

301 TTGAACAAAC 

351 AAAACAGGCG 

4 01 CACCCAACCA 

4 51 ACGCGCAATC 

501 CTTCAAACAC 



GAT ATT C AAA CGGTTTGTTC CAACACTCGG 
CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 
AGGCGGGGTT TCAACCGAAA AGGAATTGAG 
CCTTGTCTAT TTTGCCGGTG GCAATCTTAC TCGGCGGCTG 
GGCGGTAACA CATTCGGCAG CTTAGACGGC GGCACAGGTA 
CATCGTCAAA ATGGCGGTAG AAAGCCAATG CCGTGCGGAA 
GCAGCGAATG GCGTTTGACC GCGCTGGCGA TGAGTGCCGA 
ACAAGATTTG CGCTTGCGTC GCCCAAGAAG 
AACGATGTGA TGCAGATGCT GGATCCGTCC 
CGCCCTGACC GCCAAAACGG TTTCCGCCTG 
AA 



GAATGGGAAA 
GCTGACCGGC 
AGGCACTTGC 
CTGTACCGCT 

This corresponds to the amino acid sequence <SEQ ID 386; ORF 108.a>: 

al08.pep 

1 MLPGFNRI FK RFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 

51 MNKT LSILPV AILLGGCAAG G GNTFGSLDG GTGMGGSIVK MAVESQCRAE 

101 LNKRSEWRLT ALAMSAEKQA EWENKICACV AQEAPNQLTG NDVMQMLDPS 

151 TRNQALAALT AKTVSACFKH LYR* 



ml08/al08 



96.5% identity in 173 aa overlap 



10 20 30 40 50 60 

ml08 . pep MLPGFNRI FKRFVPTLGTAHKTPPFALSRTGRL I RFYRHKRRGFNRKGI EMNKTLS I LPV 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I 
al08 MLPGFNRI FKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSI LPV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml08 . pep AILLGGCAAGGGNTFGSLDGGTGMGGSIVKMAVGSQCRAELDKRSEWRLTALAMSAEKQA 
I I I i I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I 1 I I I I 
al08 AILLGGCAAGGGNTFGSLDGGTGMGGS I VKMAVESQCRAE LNKRSEWRLT ALAMSAEKQA 

70 80 90 100 110 120 



130 140 150 160 170 

ml08 . pep EWENK I CACVAQEAPERMTGN DVMQMLAPSTRNQALAALTAKTVS ACFKHLYRX 

I I I I I I I I I I I I I I I : :: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al08 EWENKI CACVAQEAPNQLTGNDVMQMLDPSTRNQALAALTAKTVSACFKHLYRX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 387>: 
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gl09 . seq 

1 ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GATCGTAGGC GTATGCTTAC CGCTTTTGGA AGCGGGCATG 

101 GAAATGACGC GCAAAGGCAA AACCACCCAA TCCGCCGCCA TCGTGGTGTT 

151 CTCTTCCGTC TGGTCAATCC GGTTTTCGGC TGGGCGTTGA CGATGCTGTT 

201 GGATAATTTG GGCTTAATCG GCTGCAAAGA ACGCAGCGCG CAATTAGGTT 

2 51 TTGTCGGACG AGTATTGATA CCCGCAGTAG GTTTCTTAAT CTTGTGTGTG 

3 01 GCGATGGGTG CGGTCGGGAT GCTGCCCGGT ATCCCTCCGT TTTTGGAGCA 
3 51 GTTCAAATCT TTGGGCTAG 

This corresponds to the amino acid sequence <SEQ ID 388; ORF 109.ng>: 

gl09 -pep 

1 MYYRRWGLS DGLGDLAAGI DRRRMLTAFG SGHGNDAQRQ NHPIRRHRGV 

51 LFRLVNPVFG WALTMLLDNL GLIGCKERSA QLGFVGR VLI PAVGFLILCV 

101 AMGAVGMLPG IPPFLEQFKS LG 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 389>: 

ml09 . seq 

1 ATGTATTATC GCCGGGTTAT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 

101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AAAACCACCC AATCCGCCGC 

151 CATCGTGGTG TTCTCTTCCG CCTTGTCAAT CCGGTTTTCG GCTGGGCGTT 

201 GACGATGCTG TTGGATAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGTG 

251 CGCAATTAGG TTTCGCCGGA CGCGTGTTGA TACCCGCAGT AGGTTTCTTG 

3 01 ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

351 GTTTTTGGAA CACTTCAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 4; ORF 109>: 

ml09.pep 

1 MYYRRVMGLS DGLGDLAAGI ERSLGRRRIL TAFGSGHGND AQRQNHPIRR 

51 HRGVLFRLVN PVFGWALTML LDNLGLIGCK ERSAQLGFAG R VLIPAVGFL 

101 I LCVAMGA VG MLPGIPPFLE HFKSLG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 109 shows 92.9% identity over a 126 aa overlap with a predicted ORF (ORF 109.ng) 
from N. gonorrhoeae: 

ml09/gl09 

10 20 30 40 50 60 

MYYRRVMGLSDGLGDLAAGIERSLGRRRILTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 

Illllhllllllllilllhl IhlllillM IMM II I II M I II I Ml III! 

MYYRRWGLS DGLGDLAAG I DR RRMLTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 

10 20 30 40 50 

70 80 90 100 110 120 

PVFGWALTMLLDNLGLIGCKERSAQLGFAGRVLI PAVGFLILCVAMGAVGMLPGI PPFLE 

| II MM I MM I M 111 II ! II. Ilihlllllli II I IN Ihlllll i li illl II I 

PVFGWALTMLLDNLGL I GCKERSAQLGFVGRVLI PAVGFLILCVAMGAVGMLPGI PPFLE 
60 70 80 90 100 110 



HFKSLGX 

MINI 

QFKSLGX 
120 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 39 1> : 

al09. seq 

1 ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 

101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AAAACCACCC AATCCGCCGC 

151 CACCGTGGTG TTCTCTTCCG CTTGGTCAAT CCGGTTTTCG GCTGGGCGTT 



ml09 .pep 
gl09 

ml 09 .pep 
gl09 

ml09 .pep 
gl09 



WO 99/57280 



PCT/US99/09346 



322 



201 GACGATGCTG TTGGATAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGCG 

251 CGCAATTAGG TTTCACCGGA CGCGTATTGA TACCCGTAGT AGGTTTCTTG 

301 ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

351 GTTTTTGGAG CACTTCAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 392; ORF 109>: 

al09 .pep 

1 MYYRRVVGLS DGLGDLAAGI ERSLGRRRIL TAFGSGHGND AQRQNHPIRR 

51 HRGVLFRLVN PVFGWALTML LDNLGLIGCK ERSAQLGFTG R VLIPWGFL 

101 ILCVAMGA VG MLPGI.PPFLE HFKSLG* . 

ml09/al09 97.6% identity in 126 aa overlap 

10 20 30 40 50 60 

mlO 9 . pep MYYRRVMGLSDGLGDLAAGIERSLGRRRILTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 
I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | I | | 
al09 MYYRRVVGLS DGLGDLAAGIERSLGRRRILTAFGSGHGNDAQRQNH PI RRHRGVLFRLVN 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 09. pep PVFGWALTMLLDNLGLIGCKERSAQLGFAGRVLIPAVGFLILCVAMGAVGMLPGIPPFLE 
I I I I I I I I I I I I I I I I I I I I I I I I I I II : I I I I I I : I I I I I I I I I I I I I I I I I I I I I | | | 
a!09 PVFGWALTMLLDNLGLIGCKERSAQLGFTGRVLIPVVGFLILCVAMGAVGMLPGIPPFLE 

70 80 90 100 110 120 

ml 09. pep HFKSLGX 
I I I I I I I 

al09 HFKSLGX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 393>: 

gill . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACCTT ATCCGCGCCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGaacaaacC GCGCAaaccg 

101 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCtATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCCCCT GCCAAAATAC AAAAGCGCAT 

2 01 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGtccaCC TACCAGACCG 
251 ATTCCGAAAT CAGCCGGTTt atacagacan atgctggaga gctcttcgcg 

3 01 tntcatgcag nttctataac tgattccgcc gaagactgtc tgcctaatac 
351 gcctatctca tcggcgctct ga 

This corresponds to the amino acid sequence <SEQ ID 394; ORF 1 1 l.ng>: 

gill .pep 

1 MPSETRLPNL IRALIFALGF IFLNA CSEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEISRF IQTAGELFAH 

101 ASITDSAEDC LPNTPISSAL * 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 395>: 

mill . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCGTCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGCGAAACG ATGGGCACGA CCTATAyCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AwAAACGCAT 

2 01 CGATGACGCG CTTAAAGAAk TCAACCGGyA GATGTCCACC TATCAGCCCG 
251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 
301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC GCCTGAACCG 

3 51 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

4 01 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

4 51 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

5 01 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 
551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
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601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCGTGGCGC ATCGGTATCG 

701 AGCAGCCCAA TATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 

8 01 TAAAAACGGC AAACGCCTCT CCCATATCAT CAACCCGAAC AACAAACGAC 

851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

10 01 ATAAAGG CGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGcTC 

10 51 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 396; ORF 1 1 1>: 

mill .pep 

1 MPSETRLPNF IRVLIFALGF IFLNA CSEQT AQTVTLQGET MGTTYXVKYL 

51 SNNRDKLPSP AEIXKRIDDA LKEXNRXMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQTIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISWADSAM 

3 01 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 



ORF 1 1 1 shows 88.7% identity over a 97 aa overlap with a predicted ORF (ORF 1 1 1 .ng) 
from N. gonorrhoeae: 

mill .pep/gill .pep 

10 20 30 40 50 60 

mill .pep MPSETRLPNFIRVLI FALGFI FLNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSP 

I II M 1 1! hi hi II 1 1 1 1 1 II 1 1 illMIM ! I Ml I II 1 1 1 1 H 1 1 1 ! I Ml 1 1 III 

gill MPSETRLPNLI RAL I FALGFI FLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPS P 

10 20 30 40 50 60 

70 80 90 100 110 120 

mill . pep AEIXKRIDDALKEXNRXMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 

hi I 1 I I I I I ! I I I MMI I I I I I I I I :||: 
gill AKIQKRIDDALKEVNRQMSTYQTDSEISRFIQTXAGELFAXHAXSITDSAEDCLPNTPIS 

70 80 90 100 110 120 

130 140 150 160 170 180 

mill, pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 

gill SALX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 397>: 

alll . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAGCGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

4 01 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

451 ATCAAACAAG CAGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCTTGGCGC ATCGGCATCG 

701 AACAGCCCAA CATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 
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801 TAAAAGCGGC AAACGCCTCT CCCATATCAT TAATCCGAAC AACAAACGAC 

851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 398; ORF 1 1 l.a>: 

alll .pep 

1 MPSETRLPNF IRTLIFALSF IFLNA CSEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVHLNRLTH GAL DVT VG PL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKSG KRLSHIINPN NKRPISHNLA SISVVADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 

mlll/alll 97.7% identity in 351 aa overlap 

10 20 30 40 50 60 

mill .pep MPSETRLPNFIRVLI FALGFI FLNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPS P 

I I I I I I I I I I I I : I I I I I : I I I I i I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I i 
alll MPSETRLPNFIRTLIFALSFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 

10 20 30 40 50 60 

70 80 90 100 110 120 

mill, pep AEIXKRIDDALKEXNRXMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 
III I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I : I I I | | | 
alll AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVHLNRLTH 

70 80 90 100 110 120 

130 140 150 160 170 180 

mill .pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 

I I I I I I II I I I I I I I I I I M I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 
alll GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 

130 140 150 160 170 180 

190 200 210 220 230 240 

mill .pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I | | | 
alll AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 1 1 . pep GGNTQI IVPLNNRSLATSGDYRI FHVDKNGKRLSHI INPNNKRP I SHNLAS I S WADSAM 

I I I I I I I I I I I I I I I I I I M I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I 

alll GGNTQI IVPLNNRSLATSGDYRI FHVDKSGKRLSHI INPNNKRPISHNLAS I SWADSAM 

250 260 270 280 290 300 

310 320 330 340 350 

ml 11. pep TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I 
alll TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 399>: 

glll-l. seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACCTT ATCCGCGCCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAacCG 

101 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCCCCT GCCAAAATAC AAAAGCGCAT 

201 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGTC.CACC TACCAGACCG 
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251 ATTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ATTTCGCACA CGTTACCGCC GAAGCCGTCC GCCTGAACCG 

351 CCTGACTCAC GGCGCACTGG ACGTAACCGT CGGCCCTTTG GTCAACCTTT 

401 GGGGGTTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

4 51 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGCAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAA GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCggcGAGTT 

651 GCACGGCAAA GGCAAAAATG CGCACGGCGA ACCGTGGCGC ATCGGTATAG 

701 AGCAACCCAA TATcatccaa ggcggcaata cgcAGattat cgtcccgctg 

751 aaCaaccgtt cgcttgccac ttccggcgAT taccgtaTTT tccacgtcgA 

801 TAAAAACGGC Aaacgccttt cccacATCAT CAATCCCAAC AACAAACGAC 

851 CCATCAGcCA CAAcctcgcc tcCATCAgCg TGGTCTCAGA CAGTGCAATG 

901 ACGGCGGACG GTTTATCCAC AGGATTATTT GTTTTAGGCG AAACCGAAGC 

951 CTTAAGGCTG GCAGAACAAG AAAAACTCGC TGTTTTCCTA ATTGTCCGGG 

1001 ATAAGGACGG CTACCGCACC GCCATGTCTT CCGAATTTGC CAAGCTGCTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 400; ORF 1 1 l-l.ng>: 

glll-l .pep 

1 MPSETRLPNL IRALI FALGF I FLNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILQQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNAHGEPWR IGIEQPNIIQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISVVSDSAM 

301 TADGLSTGLF VLGETEALRL AEQEKLAVFL IVRDKDGYRT AMSSEFAKLL 

351 R* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 40 1>: 

mlll-1 . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCGTCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGCGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAACGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC GCCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

401 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

451 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCGTGGCGC ATCGGTATCG 

701 AGCAGCCCAA TATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 

801 TAAAAACGGC AAACGCCTCT CCCATATCAT CAACCCGAAC AACAAACGAC 

851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 402; ORF 1 1 1-1>: 

mil 1-1 .pep 

1 MPSETRLPNF IRVLIFALGF I FLNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISVVADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 

mlll-l/glll-1 96.6% identity in 351 aa overlap 



mill 



gill 



1 



l.pep 



10 20 30 40 50 60 

MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 

I I I I I I I I I : I I : I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MPSETRLPNLIRALIFALGFI FLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 
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10 20 30 40 50 60 

70 80 90 100 110 120 

mill- 1 .pep AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 
I : I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I 1 I I | | I | | 
glll-1 AKIQKRIDDALKEVNRQMSTYQTDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 
70 80 90 100 110 120 

.130 140 150 160 170 180 

ml 1 1 - 1 . pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAAS YTGI DKI ILKQGKDYASLSKTHPK 
I I I I I I I I I I t I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I 
glll-1 GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILQQGKDYASLSKTHPK 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml 1 1- 1 . pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNI VQ 
I I I I I I I I I I I I 1 I I I I I I I II I I I I I I II I I! I I I I I I I I I I i : I I I I I I I I I I I I I : I 
glll-1 AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNAHGEPWRIGIEQPNIIQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

mlll-1 .pep GGNTQI IVPLNNRSLATSGDYRIFHVDKNGKRLSHIINPNNKRPISHNLASISVVADSAM 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I : I I I | 
glll-1 GGNTQI I VPLNNRSLATSGDYRIFHVDKNGKRLSHIINPNNKRPISHNLASISVVSDSAM 

250 260 270 280 290 300 

310 320 330 340 350 

ml 1 1 - 1 . pep TADGLSTGLFVLGETEALKLAEREKLAVFLI VRDKGGYRTAMSSEFEKLLRX 
I I I I I I I I I II I I I I I I I : I II : I I I I I II I I I I I I I I I I I I I I I I I I I I 
glll-1 TADGLSTGLFVLGETEALRLAEQEKLAVFLIVRDKDGYRTAMSSEFAKLLRX 

310 320 330 340 350 

glll-l/p44550 

sp | P4 4 550 | YOJL_HAEIN HYPOTHETICAL LIPOPROTEIN HI0172 PRECURSOR >gi 1 1074292 | pir | | C64 144 
hypothetical protein HI0172 - Haemophilus influenzae (strain Rd KW20) >gi | 1573128 (U32702) 
lipoprotein, putative [Haemophilus influenzae Rd] Length = 346 
Score = 349 bits (885), Expect - 2e-95 

Identities = 177/328 (53%), Positives = 240/328 (72%), Gaps - 4/328 (1%) 



Query : 


23 


LNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSPAEIXKRIDDALKEXNRXMSTYQ 


82 






L AC ++T + ++L G+TMGTTY VKYL + S + + 1+ LK+ N MSTY + 




Sbjct : 


17 


LAACQKET-KVISLSGKTMGTTYHVKYLDDGSITATS-EKTHEEIEAILKDVNAKMSTYK 


74 


Query : 


83 


PDSEISRFNQHT-AGKPLRISSDFAHVTAEAVRLNRLTHGALDVTVGPLVNLWGFGPDKS 


141 






DSE+SRFNQ+T P+ IS+DFA V AEA+RLN++T GALDVTVGP+VNLWGFGP+K 




Sbjct : 


75 


KDSELSRFNQNTQVNTPIEISADFAKVLAEAIRLNKVTEGALDVTVGPVVNLWGFGPEKR 


134 


Query : 


142 


VTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPKAYLDLSSIAKGFGVDKVAGEL 


201 






++P+PEQ+ + ++ GIDKI L K+ A+LSK P+ Y+DLSSIAKGFGVD+VA +L 




Sbjct: 


135 


PEKQPTPEQLAERQAWVGIDKITLDTNKEKATLSKALPQVYVDLSSIAKGFGVDQVAEKL 


194 


Query : 


202 


EKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQGGNTQIIVPLNNRSLATSGDY 


261 






E+ QNY+VEIGGE+ KGKN G+PW+I IE+P + ++ LNN +A+SGDY 




Sbjct: 


195 


EQLNAQNYMVEIGGEIRAKGKNIEGKPWQIAIEKPTTTGERAVEAVIGLNNMGMASSGDY 


254 


Query: 


262 


RIFHVDKNGKRLSHIINPNNKRPISHNLASISVVADSAMTADGLSTGLFVLGETEALKLA 


321 






RI + ++NGKR +H I+P PI H+LASI+V+A ++MTADGLSTGLFVLGE +AL++A 




Sbjct: 


255 


RIY-FEENGKRFAHEIDPKTGYPIQHHLASITVLAPTSMTADGLSTGLFVLGEDKALEVA 


313 


Query: 


322 


EREKLAVFLIVRDKGGYRTAMSSEFEKL 34 9 








E+ LAV+LI+R G+ T SS F+KL 




Sbjct: 


314 


EKNNLAVYLIIRTDNGFVTKSSSAFKKL 341 





The following partial DNA sequence was identified in N. meningitidis <SEQ ID 403>: 

alll-l . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAGCGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

2 51 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 
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351 


CCTGACACAC 


GGCGCGCTGG 


401 


GGGGATTCGG 


CCCCGACAAA 


451 


ATCAAACAAG 


CAGCATCTTA 


501 


AGGCAAAGAT 


TACGCTTCCT 


551 


ATTTATCTTC 


GATTGCCAAA 


601 


CTGGAAAAAT 


ACGGCATTCA 


651 


GCACGGCAAA 


GGCAAAAACG 


701 


AACAGCCCAA 


CATCGTCCAA 


751 


AACAACCGTT 


CGCTTGCCAC 


801 


TAAAAGCGGC 


AAACGCCTCT 


851 


CCATCAGCCA 


CAACCTCGCC 


901 


ACGGCGGACG 


GCTTGTCCAC 


951 


CTTAAAGCTG 


GCAGAGCGCG 


1001 


ATAAAGGCGG 


CTACCGCACC 


1051 


CGCTAA 





ACGTAACCGT CGGCCCCTTG GTCAACCTTT 
TCCGTTACCC GTGAACCGTC GCCGGAACAA 
TACGGGCATA GACAAAATCA TTTTGAAACA 
TGAGCAAAAC CCACCCCAAG GCCTATTTGG 
GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
AAATTATCTG GTCGAAATCG GCGGCGAGTT 
CGCGCGGCGA ACCTTGGCGC ATCGGCATCG 
GGCGGCAATA CGCAGATTAT CGTCCCGCTG 
TTCCGGCGAT TACCGTATTT TCCACGTCGA 
CCCATATCAT TAATCCGAAC AACAAACGAC 
TCCATCAGCG TGGTCGCAGA CAGTGCGATG 
AGGATTATTC GTATTGGGCG AAACCGAAGC 
AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 
GCCATGTCTT CCGAATTTGA AAAACTGCTC 



This corresponds to the amino acid sequence <SEQ ID 404; ORF 1 ll-l.a>: 

alll-l .pep 

1 MPSETRLPNF IRTLIFALSF IFLNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVHLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKSG KRLSHIINPN NKRPISHNLA SISVVADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 



alll-l/mlll-1 98.9% identity in 351 aa overlap 



10 20 30 40 50 60 

all 1-1 .pep MPSETRLPNFIRTLIFALSFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 
I I ! I I I I I I I I I : I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
mil 1-1 MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 
10 20 30 40 50 60 



70 80 90 100 110 120 

alll-l .pep AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVHLNRLTH 
I I I I I I ! I I I I I I I I I I I I I ! i I I I 1 I I I I I I I I I I I I I I I I I I I II I I I I I I : I I I I I I 
ml 11-1 AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 

70 80 90 100 110 120 



130 140 150 160 170 180 

alll-l .pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 
I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
mil 1-1 GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 

130 140 150 160 170 180 



190 200 210 220 230 240 

alll-l .pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I ! I I I I I I I I I 
mlll-1 AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 

190 200 210 220 230 240 



250 260 270 280 290 300 

alll-l. pep GGNTQI I VPLNNRSLATSGDYRI FHVDKSGKRLSHI INPNNKRPISHNLASI SVVADSAM 
I I I I ! I I I I I I I I I I I I I I I I I I I I I I I : I I I I I i I I I I! I I I II I I I I I I I I I I I I I I I 
mlll-1 GGNTQI IVPLNNRSLATSGDYRIFHVDKNGKRLSHI INPNNKRPISHNLASI SVVADSAM 

250 260 270 280 290 300 



310 320 330 340 350 

alll-l. pep TADGLSTGLFVLGETEALKLAEREKLAVFLI VRDKGGYRTAMSSEFEKLLRX 
I II I I I I I I I ! II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml 1 1 - 1 TADGLSTGLFVLGETEALKLAEREKLAVFLI VRDKGGYRTAMSSEFEKLLRX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 405>: 

gll4 . seq 

1 ATGGCTTCCA TCACTTCGCC GCTGCACGGG GCGCAGCAGG AATGCAGCAA 
51 GACTTTTTTA TGTCCGCCGG GCGGGACGAG TATGGGGCGG TCAATGTCGG 
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101 TAACGGTAGG TTTGTTTTGT GTTTCCATTA ACTTAACAAT ATCTGTCGAA 

151 TACGGTCAAA GCGGCTATTT TACCAGAGCC GCCGAATGTA AAACAGGGTG 

201 TCAGGGCATC AGCCCGAGCT GCCTGAACGA ACGGACGGTT TGCGAGGTAA 

2 51 CGATAAAATG GTCGAGCAGC GAAACATCAA CCAGCGACAT GGCCTGTGCC 

3 01 AGCCGCCTTG TGAACATGAT GTCTTCCTGC GAAGGTTCAG GCGAGCCGCC 

3 51 CGGATGGTTG TGCGCGATAA TCAGGCTGTC GGCATATTCG TCCAATGCCA 

4 01 GTTTGACGAT TTCGCGGATG TAA 

This corresponds to the amino acid sequence <SEQ ID 406; ORF 1 14.ng>: 

gll4 . pep 

1 MASITSPLHG AQQECSKTFL CPPGGTSMGR SM SVTVGLFC VSINLTISV E 

51 YGQSGYFTRA AECKTGCQGI SPSCLNERTV CEVTIKWSSS ETSTSDMACA 

101 SRLVNMMSSC EGSGEPPGWL CAIIRLSAYS SNASLTISRM * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 407>: 

mll4.seq 

1 ATGGCTTCCA TCACTTCGCC GCTGCACGGG GCGCACAGAG AATGCAGCAA 

51 GACTTTTTTA TGTCCACCGG GCGGGACGAG TATAGGGCGG TCAATGTCGG 

101 TAACGGTAGG TTTGTTTTGT GTTTCCATTA ACTTAACAAT ATCTGTTGAA 

151 TACGGTTGAA GCGGCTATTT TATCAGAGCC GCCGCATGTA AAACAGAGTG 

2 01 TCAGGGCATC AACCCGAGCT GTCTGAACGA ACAGACGCTT TGCGAkGTAA 
251 CGATAAAATG GTCGAGCAGC GACACATCGA CCAGCGACAT TGCCTGTGCC 

3 01 AGCCGCCTTG TGAACATGAT GTCTTCCTGC GAArGTTCsG GCGAGCCGcC 
3 51 CGgATGGTTG TGCGCAATAA TCAGGCTGTC GGCATATTCG TCCAATGCCA 
401 GTTTGACGAT TTCGCGGATG TAA 

This corresponds to the amino acid sequence <SEQ ID 408; ORF 1 14>: 

ml 14. pep 

1 MASITSPLHG AHRECSKTFL CPPGGTSIGR SM SVTVGLFC VSINLTISV E 

51 YGXSGYFIRA AACKTECQGI NPSCLNEQTL CXVTIKWSSS DTSTSDIACA 

101 SRLVNMMSSC EXSGEPPGWL CAIIRLSAYS SNASLTISRM * 

).0% identity over a 140 aa overlap 

10 20 30 40 50 60 

MAS I TSPLHGAHRECSKTFLCPPGGTS I GRSMSVTVGLFCVSINLT I SVE YGXSGYFIRA 

I M 1 1 1 !l II !-l llllllllll llhi MM III I IN I M II 1 1 1 MM MM 1 1 

MAS I TSPLHGAQQECSKTFLCPPGGTSMGRSMSVTVGLFCVS I NLT I SVE YGQSGYFTRA 
10 20 30 40 50 60 

70 80 90 100 110 120 

AACKTECQGINPSCLNEQTLCXVTIKWSSSDTSTSDIACASRLVNMMSSCEXSGEPPGWL 

1 Ml Ml MM Ml MM! 1 1 M M M M M M M M M M M I M M 1 1 ! I M I f 

AECKTGCQGI S PS CLNERTVCEVTI KWS S S ETSTSDMACASRLVNMMS S CEGSGE P PGWL 
70 80 90 100 110 120 

130 140 
CAIIRLSAYSSNASLTISRMX 

Ml III II II I IMIIIMM 

CAIIRLSAYSSNASLTISRMX 
130 140 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 409>: 

all4 . seq 

1 ATGCCGGAGG CAAGCATCGC CTCCATCACT TCGCCGCTGC ACGGGGCGCA 

51 ACAGGAATGC AGCAAGACTT TTTTATGTCC GCCGGGCGGG ACGAGTATGG 

101 GGCGGTCAAT GTCGGTAACG GTAGGTTTGT TTTGTGTTTC CATTAACTTA 

151 ACGATATCTG TCGAATACGG TTGAAGCGGC TATTTTATCA GAGCCGCCGC 

201 ATGTAAAACA GGGTGTCAGG GCATCAGCCC GAGCTGCCTG AACGAACGGA 

251 CGGTTTGCGC CGTTACGATA AAATGGTCGA GCAGCGACAC ATCGACCAGC 

301 GACATTGCCT GTGCCAGCCG. CCTTGTGAAC ATGATGTCTT CCTGCGAAGG 

351 TTCGGGCGAG CCGCCCGGAT GGTTGTGCGC GATAATCAGG CTGTCGGCAT 

4 01 ATTCGTCCAA TGCCAGTTTG ACAATTTCAC GGATGTAA 



mll4/gll4 
mll4 .pep 
gll4 

ml 14 .pep 
gll4 

ml 14 .pep 
gll4 
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This corresponds to the amino acid sequence <SEQ ID 410; ORF 1 14.a>: 

all4 .pep 

1 MPEASIASIT SPLHGAQQEC SKTFLCPPGG TSMGRSM SVT VGLFCVSINL 

51 TISV EYG*SG YFIRAAACKT GCQGISPSCL NERTVCAVTI KWSSSDTSTS 

101 DIACASRLVN MMSSCEGSGE PPGWLCAIIR LSAYSSNASL TISRM* 

mll4/all4 92.9% identity in 140 aa overlap 

10 20 30 40 50 

ml 14 .pep MASITSPLHGAHRECSKTFLCPPGGTSIGRSMSVTVGLFCVSINLTISVEYGXSG 
: I I I I I I I I I I : : I I I I I I I I I I f I I I : I I I I I I I I I t I I I I I I I I I I I I I I I I I 
all4 MPEASIASITSPLHGAQQECSKT FLCPPGGTSMGRSMSVTVGLFCVSINLTISVEYGXSG 

10 20 30 40 50 60 

60 70 80 90 100 110 

ml 14. pep YFIRAAACKTECQGINPSCLNEQTLCXVTIKWSSSDTSTSDIACASRLVNMMSSCEXSGE 
I I I I I I I I I I I I I I : I I I I I I : I : I I I I I I I I I I I I I I I I I I I i I I I I I I 1 1 I I I I | 
all 4 YFIRAAACKTGCQGI S PSCLNERTVCAVTIKWSS SDTSTS DI ACASRLVNMMSSCEGSGE 

70 80 90 100 110 120 

120 130 140 

mil 4 .pep PPGWLCAIIRLSAYSSNASLTISRMX 
I I I I I I I I I I I I I I I I I I I I I I I I II 
a 1 1 4 PPGWLCAI IRLSAYS SNAS LT I SRMX 

130 140 



The following partial DNA sequence 

gll7 . seq 



1 


atggtcgacg 


aactcgacCT 


51 


TGCCGACATC 


GGACGCTACG 


101 


GCTGCAACAG 


CACCGTCGCC 


151 


AAACTTACCC 


ACTTCGCCCG 


201 


CGCACAGCAA 


GCGGAAACCA 


251 


Acatccgcgt 


cgtaTTAATC 


301 


ttTTtaaGCA 


ACGCCCCCGA 


351 


aacccTCGAC 


ATCTTCGCCC 


401 


TCAAATGGCA 


GCTCGAAGAT 


451 


TACCGCGAAA 


TCGCCCTGCT 


501 


ATACATCGAA 


AACTTCCTCG 


551 


ATATCCACTT 


TGAAGTCGCC 


601 


AAAAAAATGG 


TGAAGAAAAA 


651 


CGCCGTGCGG 


ATTCTGGTCG 


701 


gcaTCGTCCA 


CAGCCTCTGG 


751 


ATCGCCAACC 


CCAAAGgcaA 


801 


cggcccGGAa 


gacaaaggtg 


851 


accAATTCaa 


CgaatTcggT 


901 


ggcaaaggcg 


attccGCCtA 


951 


CTTGGACTGG 


CGCGAAAATA 


1001 


CCGCCTTCAA 


AACCGAGCTT 


1051 


CACGGCAAAG 


TCCTCTCTCT 


1101 


CTACGCCCTG 


CACAGCAGCA 


1151 


AaggGCAGAT 


TGTGCCGCTG 


1201 


GAAATcatta 


cCGCcaaAGA 


1251 


AGGctgGGtc 


aAATCCGGCA 


1301 


GCCAGcaaAa 


cgCcgaCACC 


1351 


AAGCAGCTTG 


CCAAACTCAC 


1401 


aaATCTCGGC 


tacaaAAAGC 


1451 


gcgaaatttc 


caaccgcgcc 


1501 


ccgccccCCG 


TGCCCGTCAG 



; identified in N. gonorrhoeae <SEQ ID 41 1>: 

GCTGCCCGAT GCCGTCGCCG CCACCCTGCT 
TCCCCGATTG GAACCTATTG GTTTCCGAGC 
GAGCTGGTCA AAGGTGtgga CGAAGTGCAG 
GGTGGACAGC CTCGCCACGC CGGAAGAACG 
TGCGGAAAAT GCTGCTGGCg atggttaccg 
AAACTGGCGA TGCGTacgcg caccCTGcta 
CAGCCCTGAA AAACgcgccG TCgccaaAga 
CGCTCGCCAA CCGCTTGGGC GTGTGGCAGC 
TTGGGCTTCC GCCATCAAGA ACCCGAAAAA 
TTTGGACGAA AAACGCACCG AACGCCTCGA 
ATATCCTGCG TACGGAACTC AAAAAATACA 
GGCCGTCCGA AACACATCTA CTCCATTTAC 
ACTCAGCTTC GACGgccTGT TCGACATCCG 
ATACCGTCCC CGaGTGTTAC ACCACGCTGG 
CAGCCCATTC CCGGCGagtt CGAcgactAC 
CGgttATAAA AGtTTGCACA CCGTCATCGT 
tggaaGtgCA AATCCGCACC TTCGAtatGC 
gtcgccgCCC ACTGGCGtta caaagaaggc 
cgaacaaAAA ATcgccTggt TGCgccaACT 
TGGCGGAAAG CGGCAAGGAA GACCTCGCCG 
TTCAACGACA CGATTTATGT TTTGACCCCG 
GCCAACGGGC GCAACCCCCA TCGACTTCGC 
TcggCGACCG CTGCCGGGGC GCGAAAGTCG 
TCCACCCCGC TCGAAAACGG ACAGCGCGTC 
AGGGCATCCT TCCGTCAACT GGCTTTACGA 
AGGCCATCGG caaAATCCGC GCCTAcatCC 
GTGCGCGAAG AAGGCCGTGT CCAACTCGAC 
GCCCAAACCC AACCTGCAAG AGCTTgccga 
cagaagacct ctacacCGCc gtcggacaag 
atCcaaaaag cctgcggcac GCTgaacgaa 
CGCAACCACC ATCGTCAAAC AGTCCAAAAT 
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1551 CAAAAAAGGT GGCAAAACCG GCGTGCTCAT CGACGGCGAA GACGGCTTGA 

1601 TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGATATTGCC 

16 51 GGCTTCGTTA CCCGCGAGCG CGGCATTTCC GTCCACCGCA AAACCTGCCC 

1701 CTCTTTCCGA CACCTTGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 

1751 GTTGGGCGGC GTTGCAGGAA GGGCAAGTGT TCGCCGTCGA TATCGAAATC 

18 01 CGCGCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTATCCG ACGCGCTCGC 

18 51 CCGCCACAAA CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 

1901 AAGCCAGCAT GAGGTTCACG CTCGAAGTCA AACAAGtCAA CGacCTCCCG 

1951 CGCGTCCTCG CCGGCCTCGG CGATGTCAAA GGCGTATTGA GCGTTACCCG 

2001 GCTTTAA 

This corresponds to the amino acid sequence <SEQ ID 412; ORF 1 17.ng>: 

gll7.pep 

1 MVDELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTHFARVDS LATPEERAQQ AETMRK MLLA MVTDIRWLI KLAM RTRTLL 

101 FLSNAPDSPE KRAVAKETLD I FAPLANRLG VWQLKWQLED LGFRHQEPEK 

151 YREIALLLDE KRTERLEYIE NFLDILRTEL KKYNIHFEVA GRPKHIYSIY 

201 KKMVKKKLSF DGLFDIRAVR ILVDTVPECY TTLGIVHSLW QPIPGEFDDY 

2 51 IANPKGNGYK SLHTVIVGPE DKGVEVQIRT FDMHQFNEFG VAAHWRYKEG 
301 GKGDSAYEQK IAWLRQLLDW RENMAESGKE DLAAAFKTEL FNDTIYVLTP 

3 51 HGKVLSLPTG ATPIDFAYAL HSSIGDRCRG AKVEGQIVPL STPLENGQRV 

4 01 EIITAKEGHP SVNWLYEGWV KSGKAIGKIR AYIRQQNADT VREEGRVQLD 
451 KQLAKLTPKP NLQELAENLG YKKPEDLYTA VGQGEISNRA IQKACGTLNE 
501 PPPVPVSATT IVKQSKIKKG GKTGVLIDGE DGLMTTLAKC CKPAPPDDIA 
551 GFVTRERGIS VHRKTCPSFR HLAEHAPEKV LDASWAALQE GQVFAVDIEI 
601 RAQDRSGLLR DVSDALARHK LNVTAVQTQS RDLEASMRFT LEVKQVNDLP 
651 RVLAGLGDVK GVLSVTRL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 41 3>: 

mll7 . seq (partial) 

1 . . GTGAAACTCA AGAAATACAA TGTCCATTTC GAAGTCGCCG GCCGCCCGAA 

51 ACACATCTAC TCCATTTACA AAAAAATGGT GAAGAAAAAA CTCAGCTTCG 

101 ACGGCCTCTT TGACATCCGC GCCGTGCGAA TTCTGGTTGA TACCGTCCCC 

151 GAGTGTTACA CCACGCTGGG TATCGTCCAC AGCCTCTGGC AGCCCATTCC 

201 CGGCGAGTTC GACGACTACA TCGCCAATCC CAAAGGCAAC GGCTATAAAA 

251 GTTTGCACAC CGTCATCGTC GGCCCGGAAG ACAAAGGCGT GGAAGTACAA 

301 ATCCGCACCT TCGATATGCA CCAATTCAAC GAATTCGGTG TCGCCGCCCA 

351 CTGgCGTTAC AAAGAGGGCG GCAAGGGCGA TTCCGCCTAC GAACAGAAAA 

4 01 TCGCCTGGTT GCGCCAACTC TTGGACTGGC GCGAAAACAT GGCGGAAAGC 

4 51 GGCAAGGAAG ACCTCGCCGC CGCCTTCAAA ACCGAGCTTT TCAACGACAC 

501 GATTTATGTT TTGACCCCGC ACGGCAAAGT CCTCTCCCTG CCCACGGGCG 

551 CGACCCCCAT CGACTTCGCC TACGCCCTGC ACAGCAGCAT CGGCGACCGT 

601 TGCCGCGGTG CGAAAGTCGA AGGGCAGATT GTGCCGCTGT CCACCCCGCT 

651 CGAAAACGGA CAGCGCGTCG AAATCATTAC CGCCAAAGAA GGGCATCCTT 

701 CCGTCAACTG GCTTTACGAA GGCTGGGTCA AATCCAACAA GGCAATCGGC 

751 AAAATCCGCG CCTACATCCG CCAGCAAAAC GCCGACACCG TGCGCGAAGA 

801 AGGCCGCGTC CAACTCGACA AACAGCTTGC CAAACTCACG CCCAAACCCA 

8 51 ACCTGCAAGA GCTTGCCGAA AATCTCGGCT ACAAAAAGCC AGAAGACCTC 

901 TACACCGCCG TCGGACAAGG CGAAATTTCC AACCGCGCCA TCCAAAAAGC 

951 CTGCGGCACg CTGAACGAAC CGCCGCCCGT ACCCGTCAGC GAAACCACCA 

1001 TCGTCAAACA GTCCAAAATC AAAAAAGGCG GCAAAAACGG CGTGCTCATC 

1051 GACGGCGAAG ACGGTCTGAT GACCACGCTT GCCAAATGCT GCAAACCCGC 

1101 GCCGCCCGAC GATATTATCG GCTTCGTTAC CCGCGAGCGC GgCATTTCAG 

1151 TGCACCGCAA AwyyTkCyCG TCTTTCCAAC ACCTCGCCGA ACACGCGCCC 

12 01 GAwAAAGTGC TGGACGCAAG CTGGGCGGCA TTGCAGGAAG GACAAGTATT 
1251 CGCCGTCGAT ATCGAAATCC GCGCCCAAGA CCGCTCCGGG CTTTTGCGCG 

13 01 ACGTATCCGA CGCGCTCGCC CGCCACAAAC TCAACGTTAC CGCCGTGCAA 
1351 ACCCAGTCCC GCGACTTGGA AGCCAGCATG AGGTTCACGC TCGAAGTCAA 

14 01 ACAAGTCAAC GACCTCCCGC GCGTCCTCGC CAGCCTCGGC GACGTCAAAG 
14 51 GCGTATTGAG CGTTACCCGG CTTTAA 

This corresponds to the amino acid sequence <SEQ ID 414; ORF 1 17>: 

m!17 .pep (partial) 
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1. . . . VKLKKYNVHF EVAGRPKHIY SIYKKMVKKK LSFDGLFDIR AVRILVDTVP 

51 ECYTTLGIVH SLWQPIPGEF DDYIANPKGN GYKSLHTVIV GPEDKGVEVQ 

101 IRTFDMHQFN EFGVAAHWRY KEGGKGDSAY EQKIAWLRQL LDWRENMAES 

151 GKEDLAAAFK TELFNDTIYV LTPHGKVLSL PTGATPIDFA YALHSSIGDR 

201 CRGAKVEGQI VPLSTPLENG QRVE I I TAKE GHPSVNWLYE GWVKSNKAIG 

251 KIRAYIRQQN ADTVREEGRV QLDKQLAKLT PKPNLQELAE NLGYKKPEDL 

3 01 YTAVGQGEIS NRAIQKACGT LNEPPPVPVS ETTIVKQSKI KKGGKNGVLI 
351 DGEDGLMTTL AKCCKPAPPD DIIGFVTRER GISVHRKXXX SFQHLAEHAP 
401 XKVLDASWAA LQEGQVFAVD IEIRAQDRSG LLRDVSDALA RHKLNVTAVQ 

4 51 TQSRDLEASM RFTLEVKQVN DLPRVLASLG DVKGVLSVTR L* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 117 shows 97.6% identity over a 490 aa overlap with a predicted ORF (ORF 1 17.ng) 

from N. gonorrhoeae; 
mll7/gll7 

10 20 30 

VKLKKYNVHFEVAGRPKHIYSIYKKMVKKKL 

MM I hi III M II I II: 1 1! M ; 1 1, 1 1 
EKYREIALLLDEKRTERLEYIENFLDILRTELKKYNIHFEVAGRPKHIYSIYKKMVKKKL 
150 160 170 180 190 200 

40 50 60 70 80 90 

SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 

1 1 1 1 1 M I < II 1 1 II I M I MM 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 
210 220 230 240 250 260 

100 110 120 130 140 150 

PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 

M 1 1 M I M M 1 1 IM M II II M 1 1 1 1 II II 1 1 M 1 1 1 1 M I II M I M 1 1 1 M M 1 1 1 

PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
270 280 290 300 310 320 

160 170 180 190 200 210 

KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 

I III II II IIMMIIIIIMI MIIMMMMII IMMI 1 1 1 1 1 M IM MIM MM 

KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
330 340 350 360 370 380 

220 230 240 250 260 270 

PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNAD7VREEGRVQ 

I M I M 1 1 1 1 II M I M M MMM 1 1 M Ml MM I M 1 1 1 Ml I M 1 1 M 1 1 1 1 M M 

PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSGKAIGKIRAYIRQQNADTVREEGRVQ 
390 400 410 420 430 440 

280 290 300 310 320 330 

LDKQLAKLT P KPNLQEL AENLG YKKPEDL YTAVGQGE I SNRAI Q KACGTLNE P P P V P VS E 

I II 1 1 Ml II II Mill IMMM I MIM II II Ml III :l I II Ml 1 1 MMM I I 

LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSA 
450 460 470 480 490 500 

340 350 360 370 380 390 

TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKXXXS 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 M 1 1 M M I 

TTI VKQS KI KKGGKTGVLI DGEDGLMTTLAKCCKPAPPDDI AGFVTRERGI SVHRKTCPS 
510 520 530 540 550 560 

400 410 420 430 440 450 

FQHLAEHAPXKVLDASWAALQEGQVFAVD I E I RAQDRSGLLRDVSDALARHKLNVTAVQT 



mll7 .pep 
gll7 

mll7 .pep 
gll7 

mll7 -pep 
gll7 

mll7 .pep 
gll7 

ml 17 .pep 
gll7 

mll7 .pep 
gll7 

mll7 .pep 
gll7 

mll7 .pep 
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hlllllll 1 1 M I Ml M 1 1 1 1 1 II Ml II II I M Ml 1 1 II M I II M 1 1 II M I M 

g 1 1 7 FRHLAEHAPEKVLDASWAALQEGQVFAVD I E I RAQDRSGLLRDVSDALARHKLNVTAVQT 

570 580 590 600 610 620 

460 470 480 490 

ml 1 7 . pep QSRDLEASMRFTLEVKQVNDLPRVLASLGDVKGVLS VTRLX 

I I I I I I I I I I I I I I I I I I I I I I I I I h I I I I I I I I I I I I I I 
gll7 QSRDLEASMRFTLEVKQVNDLPRVLAGLGDVKGVLS VTRLX 

630 640 650 660 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 415>: 

al!7 . seq 

1 ATGGTTCATG AACTCGACCT GCTCCCCGAT GCCGTCGCCG CCACCCTGCT 

51 TGCCGACATC GGACGCTACG TCCCCGACTG GAACCTATTG GTTTCCGAAC 

101 GCTGCAACAG TACCGTCGCC GAGCTGGTCA AAGGTGTGGA CGAAGTGCAG 

151 AAACTCACCC ACTTCGCCCG GGTGGACAGC CTCGCCACGC CGGAAGAACG 

201 CGCCCAGCAG GCAGAAACTA TGCGGAAAAT GCTGCTGGCG ATGGTTACCG 

251 ACATCCGCGT CGTGTTAATC AAACTGGCGA TGCGTACGCG CACCCTGCAA 

301 TTTTTAAGCA ACGCCCCCGA CAGCCCCGAA AAACGCGCCG TCGCCAAAGA 

351 AACCCTCGAC ATCTTCGCCC CGCTCGCCAA CCGTTTGGGC GTGTGGCAGC 

4 01 TCAAATGGCA GCTCGAAGAT TTGGGCTTCC GCCATCAAGA ACCCGAAAAA 

4 51 TACCGCGAAA TCGCCCTGCT TTTGGACGAA AAACGCACCG AACGCCTCGA 

501 ATACATCGAA AACTTCCTTA ATATCCTGCG TACGGAACTC AAAAAATACA 

551 ATATCCACTT TGAAGTCGCC GGCCGTCCGA AACACATCTA CTCCATTTAC 

601 AAAAAAATGG TGAAGAAAAA ACTCAGCTTC GACGGGTTGT TCGACATCCG 

651 CGCCGTGCGG ATTCTGGTTG ATACCGTCCC CGAGTGTTAC ACCACACTGG 

7 01 GCATTGTCCA CAGCCTCTGG CAGCCCATTC CCGGCGAGTT CGACGACTAC 

7 51 ATCGCCAACC CGAAAGGCAA CGGCTATAAA AGTTTGCACA CCGTCATCGT 

801 CGGCCCGGAA GACAAAGGCG TGGAAGTGCA AATCCGCACC TTCGATATGC 

851 ACCAATTCAA CGAATTCGGT GTCGCCGCGC ACTGGCGTTA CAAAGAGGGC 

901 GGCAAAGGCG ATTCCGCCTA CGAACAAAAA ATCGCCTGGT TACGCCAACT 

951 TTTGGACTGG CGCGAAAACA TGGCGGAAAG CGGCAAGGAA GACCTCGCCG 

1001 CCGCCTTCAA AACCGAGCTT TTCAACGACA CGATTTATGT TTTGACCCCG 

1051 CACGGCAAAG TCCTCTCCCT GCCCACAGGC GCGACCCCCA TCGACTTCGC 

1101 CTACGCCCTG CACAGCAGCA TCGGCGACCG TTGCCGCGGT GCGAAAGTCG 

1151 AAGGGCAGAT TGTGCCGCTG TCCACCCCGC TCGAAAACGG ACAGCGTGTC 

1201 GAAATCATTA CCGCCAAAGA AGGGCATCCT TCCGTCAACT GGCTTTACGA 

1251 AGGCTGGGTC AAATCCAACA AGGCAATCGG CAAAATCCGC GCCTACATCC 

1301 GCCAGCAAAA CGCCGACACC GTGCGCGAAG AAGGCCGCGT CCAACTCGAC 

1351 AAACAGCTTG CCAAACTCAC GCCCAAACCC AACCTGCAAG AGCTTGCCGA 

1401 AAATCTCGGC TACAAAAAGC CAGAAGACCT CTACACCGCC GTCGGACAAG 

14 51 GCGAAATTTC CAACCGCGCC ATCCAAAAAG CCTGCGGCAC GCTGAACGAA 

1501 CCGCCGCCCG TACCCGTCAG CGAAACCACC ATCGTCAAAC AGTCCAAAAT 

1551 CAAAAAAGGC GGCAAAAACG GCGTGCTCAT CGACGGCGAA GACGGTCTGA 

1601 TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGACATTGTC 

1651 GGCTTCGTTA CCCGCGATCG CGGCATTTCG GTACACCGCA AAACCTGCCC 

1701 CTCTTTCCGA GACCTCGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 

17 51 GTTGGGCGGC GTTGCAGGAA GGACAAGTGT TCGCCGTCGA TATCGAAATC 

1801 CGCGCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTATCCG ACGCGCTCGC 

1851 CCGCCACAAA CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 

1901 AAGCCAGCAT GAGGTTCACG CTCGAAGTCA AACAAGTTAC CGACCTCCCA 

1951 CGCGTCCTCG CCAGCCTCGG CGACGTCAAA GGCGTATTGA GCGTTACCCG 

2001 GCTTTAA 

This corresponds to the amino acid sequence <SEQ ID 416; ORF 1 17.a>: 

all 7 .pep 

1 MVHELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTHFARVDS LATPEERAQQ AETMR KMLLA MVTDIRWLI KLA MRTRTLQ 

101 FLSNAPDSPE KRAVAKETLD IFAPLANRLG VWQLKWQLED LGFRHQEPEK 

151 YREIALLLDE KRTERLEYIE NFLNILRTEL KKYNIHFEVA GRPKHIYSIY 

201 KKMVKKKLSF DGLFDIRAVR ILVDTVPECY TTLGIVHSLW QPIPGEFDDY 

2 51 IANPKGNGYK SLHTVIVGPE DKGVEVQIRT FDMHQFNEFG VAAHWRYKEG 

301 GKGDSAYEQK IAWLRQLLDW RENMAESGKE DLAAAFKTEL FNDTIYVLTP 

351 HGKVLSLPTG ATPIDFAYAL HSSIGDRCRG AKVEGQIVPL STPLENGQRV 
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401 EIITAKEGHP SVNWLYEGWV KSNKAIGKIR AYIRQQNADT VREEGRVQLD 

4 51 KQLAKLTPKP NLQELAENLG YKKPEDLYTA VGQGEISNRA IQKACGTLNE 

501 PPPVPVSETT IVKQSKIKKG GKNGVLIDGE DGLMTTLAKC CKPAPPDDIV 

551 GFVTRDRGIS VHRKTCPSFR HLAEHAPEKV LDASWAALQE GQVFAVDIEI 

601 RAQDRSGLLR DVSDALARHK LNVTAVQTQS RDLEASMRFT LEVKQVTDLP 

651 RVLASLGDVK GVLSVTRL* 



mll7/all7 98.0% identity in 490 aa overlap 



10 20 30 

ml 17 .pep VKLKKYNVHFE VAGRPKH I YS I YKKMVKKKL 

: I II I I : I I I I I 1 II I II I I I 1 I I I I I I I I 
all7 EKYRE I ALLLDEKRTERLEYI EN FLN I LRTELKKYNIHFE VAGRPKH I YS I YKKMVKKKL 

150 160 170 180 190 200 

40 50 60 70 80 90 

ml 17 .pep SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 
I I M I I I I I I I I I I I I I I I I I I II I I I I I I II I I 1 I I I I I I I I I I I I I I I | I | | | | | l | | 
all7 SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 
210 220 230 240 250 260 



100 110 120 130 140 150 

ml 17 . pep PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I i II I I I I I I I | I | | | 
all7 PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
270 280 290 300 310 320 



160 170 180 190 200 210 

ml 17. pep KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I | | | | I | | M || ! 
all7 KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
330 340 350 360 370 380 



220 230 240 250 260 270 

mll7 .pep PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 
I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I | I | | | | M 
al!7 PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 
390 400 410 420 430 440 



280 290 300 310 320 330 

mll7 . pep LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 
I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I II I I I I I I I I I I 1 I I I | | | | | | 
all7 LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 
450 460 470 480 490 500 

340 350 360 370 380 390 

mll7.pep TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKXXXS 
I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I : I I I I | : I I I | | I | | : | 
all7 TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIVGFVTRDRGISVHRKTCPS 
510 520 530 540. 550 560 



400 410 420 430 440 450 

mll7 . pep FQHLAEHAPXKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
1:1111111 I I M I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I | | | | | 
all7 FRHLAEHAPEKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
570 580 590 600 610 620 

460 470 480 490 

mll7 . pep QSRDLEASMRFTLEVKQVNDLPRVLASLGDVKGVLSVTRLX 
I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I 
all7 QSRDLEASMRFTLEVKQVTDLPRVLASLGDVKGVLSVTRLX 
630 640 650 660 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 41 7>: 

gll7-l . seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CCCTGCAAGA 

51 ATTGCGCGAA TGGTTCGACA GCTACTGCGC CGCTCTGCCG GACAACGATA 

101 AAAACCTCAT CGGTACCGCA TGGTCGCTGG CGCAGGAACA TTATCCTGCC 

151 GATGCCGCCA CGCCGTATGG CGAGCCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTCGACGAAC TCGACCTGCT GCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGATTGGAA CCTATTGGTT 

301 TCCGAGCGCT GCAACAGCAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTTACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

4 01 AAGAACGCGC ACAGCAAGCG GAAACCATGC GGAAAATGCT GCTGGCGATG 

4 51 GTTACCGACA TCCGCGTCGT ATTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCTGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG CTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAGAACC 

651 CGAAAAATAC CGCGAAATCG CCCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTCGATA TCCTGCGTAC GGAACTCAAA 

751 AAATACAATA TCCACTTTGA AGTCGCCGGC CGTCCGAAAC ACATCTACTC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGCCTGTTCG 

851 ACATCCGCGC CGTGCGGATT CTGGTCGATA CCGTCCCCGA GTGTTACACC 

901 ACGCTGGGCA TCGTCCACAG CCTCTGGCAG CCCATTCCCG GCGagttCGA 

951 cgactACATC GCCAACCCCA AAGgcaACGg ttATAAAAGt TTGCACACCG 

1001 TCATCGTcgg cccGGAagaa aaaggtgtgg aagtgcAAAT CCGCACCTTC 

1051 GATATGCacc AATTCaaCga ATTCGGTGTC GCCGCCCACT GGCGTTACAA 

1101 AGAAGGCGGC AAAGGCGATT CCGCCTACGA ACAAAAAATC GCCTGGTTGC 

1151 GCCAACTCTT GGACTGGCGC GAAAATATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

1251 GACCCCGCAC GGCAAAGTCC TCTCTCTGCC AACGGGCGCA ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGCTG CCGGGGCGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

1401 GCGCGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

14 51 TTTACGAAGG CTGGGTCAAA TCCGGCAAGG CCATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGTGTCCA 

1551 ACTCGACAAG CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTGC CCGTCAGCGC AACCACCATC GTCAAACAGT 

1751 CCAAAATCAA AAAAGGTGGC AAAACCGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGCTTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

1851 TATTGCCGGC TTCGTTACCC GCGAGCGCGG CATTTCCGTC CACCGCAAAA 

1901 CCTGCCCCTC TTTCCGACAC CTTGCCGAAC ACGCGCCCGA AAAAGTACTG 

1951 GACGCAAGTT GGGCGGCGTT GCAGGAAGGG CAAGTGTTCG CCGTCGATAT 

2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTCAACGA 

2151 CCTCCCGCGC GTCCTCGCCG GCCTCGGCGA TGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 

This corresponds to the amino acid sequence <SEQ ID 41 8; ORP 1 1 7-1 .ng>: 

gll7-l .pep 

1 MTAISPIQDT QSATLQELRE WFDSYCAALP DNDKNLIGTA WSLAQEHYPA 

51 DAATPYGEPL PDHFLGAAQM VDELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRK MLLAM 

151 VTDIRVVLIK LAM RTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQEPEKY REIALLLDEK RTERLEYIEN FLDILRTELK 

251 KYNIHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PIPGEFDDYI ANPKGNGYKS LHTVI VGPEE KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

401 LAAAFKTELF NDTIYVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

451 KVEGQIVPLS TPLENGQRVE IITAKEGHPS VNWLYEGWVK SGKAIGKIRA 

501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGEISNRAI QKACGTLNEP PPVPVSATTI VKQSKIKKGG KTGVLIDGED 

601 GLMTTLAKCC KPAPPDDIAG FVTRERGISV HRKTCPSFRH LAEHAPEKVL 

651 DASWAALQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVNDLPR VLAGLGDVKG VLSVTRL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 41 9>: 

mll7-l.seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CTCTGCAAGA 
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51 ATTGCGCGAA TGGTTCGACA GCTACTGCGC CGCTCTGCCG GACAACGATA 

101 AAAACCTCAT CGGTACCGCA TGGTTGCTGG CGCAGGAACA TTACCCCGCC 

151 GATGCCGCCA CGCCGTATGG CGAGCCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTTCATGAAC TCGACCTGCT CCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGACTGGAA CCTATTGGTT 

301 TCCGAACGCT GCAACAGTAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTCACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

401 AAGAACGCGC CCAGCAGGCA GAAACTATGC GGAAAATGCT GCTGGCGATG 

4 51 GTTACCGACA TCCGCGTCGT GTTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCCGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG TTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAAAGCC 

651 CGAAAAATAC CGCGAAATCG CGCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTCAACA TCCTGCGCGG TGAACTCAAG 

751 AAATACAATG TCCATTTCGA AGTCGCCGGC CGCCCGAAAC ACATCTACTC 

801 C ATT T AC AAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGCCTCTTTG 

851 ACATCCGCGC CGTGCGAATT CTGGTTGATA CCGTCCCCGA GTGTTACACC 

901 ACGCTGGGTA TCGTCCACAG CCTCTGGCAG CCCATTCCCG GCGAGTTCGA 

951 CGACTACATC GCCAATCCCA AAGGCAACGG CTATAAAAGT TTGCACACCG 

1001 TCATCGTCGG CCCGGAAGAC AAAGGCGTGG AAGTACAAAT CCGCACCTTC 

1051 GATATGCACC AATTCAACGA ATTCGGTGTC GCCGCCCACT GGCGTTACAA 

1101 AGAGGGCGGC AAGGGCGATT CCGCCTACGA ACAGAAAATC GCCTGGTTGC 

1151 GCCAACTCTT GG ACTGGCGC GAAAACATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

1251 GACCCCGCAC GGCAAAGTCC TCTCCCTGCC CACGGGCGCG ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGTTG CCGCGGTGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

1401 GCGCGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

1451 TTTACGAAGG CTGGGTCAAA TCCAACAAGG CAATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGCGTCCA 

1551 ACTCGACAAA CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTAC CCGTCAGCGA AACCACCATC GTCAAACAGT 

1751 CCAAAATCAA AAAAGGCGGC AAAAACGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGTCTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

1851 TATTATCGGC TTCGTTACCC GC GAGCGCGG CATTTCAGTG CACCGCAAAA 

1901 CCTGCCCGTC TTTCCAACAC CTCGCCGAAC ACGCGCCCGA AAAAGTGCTG 

1951 GACGCAAGCT GGGCGGCATT GCAGGAAGGA CAAGTATTCG CCGTCGATAT 

2001. CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTCAACGA 

2151 CCTCCCGCGC GTCCTCGCCA GCCTCGGCGA CGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 

This corresponds to the amino acid sequence <SEQ ID 420; ORF 1 17-1>: 

mll7-l.pep 

1 MTAISPIQDT QSATLQELRE WFDSYCAALP DNDKNLIGTA WLLAQEHYPA 

51 DAATPYGEPL PDHFLGAAQM VHELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRK MLLAM 

151 VTDIRVVLIK LA MRTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQKPEKY REIALLLDEK RTERLEYIEN FLNILRGELK 

251 KYNVHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PIPGEFDDYI ANPKGNGYKS LHTVIVGPED KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

401 LAAAFKTELF NDTIYVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

451 KVEGQIVPLS TPLENGQRVE IITAKEGHPS VNWLYEGWVK SNKAIGKIRA 

501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGEISNRAI QKACGTLNEP PPVPVSETTI VKQSKIKKGG KNGVLIDGED 

601 GLMTTLAKCC KPAPPDDIIG FVTRERGISV HRKTCPSFQH LAEHAPEKVL 

651 DASWAALQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVNDLPR VLASLGDVKG VLSVTRL* 

mll7-l/gll7-l 98.2% identity in 737 aa overlap 

10 20 30 40 50 60 

ml 17-1 .pep MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWLLAQEHYPADAATPYGEPL 
I I I I I I I I I I I I I M I M I I I I 1 I I I I I I I I I 1 I I I I I I I I ! I I I I I I I I I I I I I I I I I 
gll7-l MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWSLAQEHYPADAATPYGEPL 

10 20 30 40 50 60 

70 80 90 100 110 120 
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ml 1 7 - 1 . pep PDH FLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | 
g 1 1 7 - 1 PDHFLGAAQMVDELDLLPDAVAATLLADI GRY VPDWNLLVSERCNSTVAELVKGVDEVQK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 17-1 .pep LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 

1 1 1 1 1 M 1 1 i i 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

gll7-l LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml 1 7 - 1 . pep RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQKPEKYREI ALLLDEKRTERLEYI EN 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 1 I I I ! I I I I II I I I I I I | I 
g!17-l RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQEPEKYREI ALLLDEKRTERLEYI EN 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 17-1. pep FLNILRGELKKYNVHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 
I I : I I I I I I I I I : ! I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I II I I I I 1 I t I I 
gll7-l FLDILRTELKKYNIHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 

250 260 270 280 290 300 

310 320 330 340 350 360 

mll7-l . pep TLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGV 
I I I I I I I I I I f I I I I I I I I I I I I I II I I I I I I I I I I I I I : I I I I I I I I I I I I I I I | | | | | 
gll7-l TLGI VHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVGPEEKGVEVQIRTFDMHQFNEFGV 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 1 7 - 1 . pep AAHWRYKEGGKGDS AYEQKI AWLRQLLDWRENMAESGKEDLAAAFKTELFNDTI YVLTPH 
I I I I II I I I I ! I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I } I I I I I I I I I I M I I I 
gll7-l AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTI YVLTPH 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 17-1 .pep GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I I I | | | | 
g!17-l GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 17-1 .pep VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 

I I I I I I I I ! I I : I I I I I i I I I M II I I I I I I I I I I I I I I I I I I I I I I I I | | | | 

gll7-l VNWLYEGWVKSGKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
490 500 510 520 530 540 

550 560 570 580 590 600 

ml 17-1 .pep KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLIDGED 
I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 11 I I I I I I I I I I I | | M : I || | | | | | 
gll7-l KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSATTIVKQSKIKKGGKTGVLIDGED 

550 560 570 580 590 600 

610 620 630 640 650 660 

mll7-l .pep GLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
I I I i I I I I I I I I I I I I I I MM IMIIimil!tll:||IIMIIIIIIII I MM II 
gll7-l GLMTTLAKCCKPAPPDDIAGFVTRERGISVHRKTCPSFRHLAEHAPEKVLDASWAALQEG 

610 620 630 640 650 660 

670 680 690 700 710 720 

mll7-l . pep QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
I I I I I I I II II II I I I I M I M I I I II I I I II II I I I M I I I 1 I I II I I II I II I I I I I I 
gl!7-l QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 

670 680 690 700 710 720 

730 

ml 1 7 - 1 . pep VLASLGDVKGVLS VTRLX 
I I I : I I I I I I I I I I I I II 
g 1 1 7 - 1 VLAGLGDVKGVLS VTRLX 

730 

m!17-l/RelA 
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sp|P55l33|RELA_VIBSS GTP PYROPHOSPHOKINASE (ATP : GTP 3 ' -PYROPHOSPHOTRANSFERASE) (PPGPP 
SYNTHETASE I) >gi 1537617 (U13769) ppGpp synthetase I [Vibrio sp.] Length = 744 
Score = 536 bits (1366), Expect = e-151 

Identities = 288/685 (42%), Positives = 432/685 (63%), Gaps - 31/685 (4%) 

Query: 74 LDLLPDAVAATLLADI GRYVPDWNLLVSERCNSTVAELVKGVDEVQKLTHFARVDSL 130 

L + D + A LL + G Y D + E + T+ LV+GV+++ ++ ++ S 
Sbjct : 68 LSMDADTLIAALLYPLVEGGCYSTD ALKEEYSGTILHLVQGVEQMCAIS QLKST 121 

Query: 131 ATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEKRAVAKETLDI 190 

A +A Q + +R+MLL+MV D R V+IKLA R L+ + + PD +RA A+E ■ +1 
Sbjct: 122 AEETAQAAQVDNIRRMLLSMVDDFRCVVIKLAERICNLREVKDQPDEV-RRAAAQECANI 180 

Query: 191 FAPLANRLGVWQLKWQLEDLGFRHQKPEKYREIALLLDEKRTERLEYIENFLNILRGELK 250 

+APLANRLG+ QLKW++ED FR+Q P+ Y++IA L E+R +R +YI +F++ L +K 
Sbjct: 181 YAPLANRLGIGQLKWEIEDYAFRYQHPDTYKQIAKQLSERRIDREDYITHFVDDLSDAMK 240 

Query: 251 KYNVHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQ 310 

N+ EV GRPKHI YSI++KM KK L FD LFD+RAVRI+ + + +CY LG+VH+ ++ 
Sbjct: 241 ASNIRAEVQGRPKHIYSIWRKMQKKSLEFDELFDVRAVRIVAEELQDCYAALGVVHTKYR 300 

Query: 311 PI PGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEG- 369 

+P EFDDY+ANPK NGY+S+HTV++GPE K +E+QIRT MH+ +E GVAAHW+YKEG 
Sbjct: 301 HLPKEFDDYVANPKPNGYQSIHTVVLGPEGKTIEIQIRTKQMHEESELGVAAHWKYKEGT 360 

Query: 370 — GKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPHGKVLSLP 427 

G SAY++KI WLR+LL W+E M+ + SG ++ ++++F+D +Y TP G V+ LP 
Sbjct: 361 ASGGAQSAYDEKINWLRKLLAWQEEMSDSG--EMLDELRSQVFDDRVYAFTPKGDVVDLP 418 

Query: 428 TGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPSVNWLYE- 486 

+ ATP+DFAY +HS +G RC GAKVEG+IVP + L+ G +VEIIT KE +PS +WL 
Sbjct: 419 SNATPLDFAYHIHSEVGHRCIGAKVEGRIVPFTYHLQMGDQVEIITQKEPNPSRDWLNPN 478 

Query: 4 87 -GWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKL--TPKPNLQELAENLGYKKP 543 

G+V S + +A K+ A+ R+Q+ D G+ L+ +L K+ T K A+ K P 

Sbjct: 479 LGFVTSSRARAKVHAWFRKQDRDKNIIAGKEILEAELVKIHATLKDAQYYAAKRFNVKSP 538 

Query: 544 EDLYTAVGQGEIS-NRAIQKACGTLNEPPPVPVSETTIVKQSKI KKGGKNGV 594 

E+LY +G G++ N+ I +N+P + + K S+ KK ++ V 

Sbjct: 539 EELYAGIGSGDLRINQVINHINALVNKPTAEEEDQQLLEKLSEASNKQATSHKKPQRDAV 598 

Query: 595 LIDGEDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASW 654 

+++G D LMT LA+CC + P P DDI GFVT+ RGISVHR C + L HAPE+++D W 
Sbjct: 599 VVEGVDNLMTHLARCCQPIPGDDIQGFVTQGRGISVHRMDCEQLEELRHHAPERIIDTVW 658 

Query: 655 AALQEGQVFAVDI E I RAQDRSGLLRDVSDAL ARHKLN VTAVQTQ- - SRDLEASMRFTLEV 712 

G + + + + A +R+GLL+++++ L K+ V ++++ + + M F LE+ 
Sbjct: 659 GGGFVGN-YTITVRVTASERNGLLKELTNTLMNEKVKVAGMKSRVDYKKQMSIMDFELEL 717 

Query: 713 KQVNDLPRVLASLGDVKGVLSVTRL 737 

+ L RVL + VK V RL 
Sbjct: 718 TDLEVLGRVLKRIEQVKDVAEAKRL 742 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 421>: 

all7-l. seq 

1 ATGACCGCCA 

51 ATTGCGCGAA 

101 AAAAACTTGT 

151 GATGCCGCCA 

201 GGCGCAAATG 

251 CCCTGCTTGC 

301 TCCGAACGCT 

351 AGTGCAGAAA 

4 01 AAGAACGCGC 

451 GTTACCGACA 

501 CCTGCAATTT 

551 CCAAAGAAAC 

601 TGGCAGCTCA 

651 CGAAAAATAC 

701 GCCTCGAATA 

751 AAATACAATA 

801 CATTTACAAA 



TCAGCCCGAT TCAAGACACG 
TGGTTCGACA GCTACTGCAC 
CTTAGCCGCC CGTTCGCTGG 
CGCCGTATGG CGAACCGCTG 
GTTCATGAAC TCGACCTGCT 
CGACATCGGA CGCTACGTCC 
GCAACAGTAC CGTCGCCGAG 
CTCACCCACT TCGCCCGGGT 
CCAGCAGGCA GAAACTATGC 
TCCGCGTCGT GTTAATCAAA 
TTAAGCAACG CCCCCGACAG 
CCTCGACATC TTCGCCCCGC 
AATGGCAGCT CGAAGATTTG 
CGCGAAATCG CCCTGCTTTT 
CATCGAAAAC TTCCTTAATA 
TCCACTTTGA AGTCGCCGGC 
AAAATGGTGA AGAAAAAACT 



CAAAGCGCGA CTCTGCAAGA 
CGCGCTGCCG AACAACGATA 
CGGAAGCACA TTACCCCGCC 
CCCGACCACT TCCTCGGCGC 
CCCCGATGCC GTCGCCGCCA 
CCGACTGGAA CCTATTGGTT 
CTGGTCAAAG GTGTGGACGA 
GGACAGCCTC GCCACGCCGG 
GGAAAATGCT GCTGGCGATG 
CTGGCGATGC GTACGCGCAC 
CCCCGAAAAA CGCGCCGTCG 
TCGCCAACCG TTTGGGCGTG 
GGCTTCCGCC ATCAAGAACC 
GGACGAAAAA CGCACCGAAC 
TCCTGCGTAC GGAACTCAAA 
CGTCCGAAAC ACATCTACTC 
CAGCTTCGAC GGGTTGTTCG 
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851 ACATCCGCGC CGTGCGGATT 

901 ACACTGGGCA TTGTCCACAG 

951 CGACTACATC GCCAACCCGA 

1001 TCATCGTCGG CCCGGAAGAC 

1051 GATATGCACC AATTCAACGA 

1101 AGAGGGCGGC AAAGGCGATT 

1151 GCCAACTTTT GGACTGGCGC 

1201 CTCGCCGCCG CCTTCAAAAC 

1251 GACCCCGCAC GGCAAAGTCC 

1301 ACTTCGCCTA CGCCCTGCAC 

1351 AAAGTCGAAG GGCAGATTGT 

1401 GCGTGTCGAA ATCATTACCG 

1451 TTTACGAAGG CTGGGTCAAA 

1501 TACATCCGCC AGCAAAACGC 

1551 ACTCGACAAA CAGCTTGCCA 

1601 TTGCCGAAAA TCTCGGCTAC 

1651 GGACAAGGCG AAATTTCCAA 

1701 GAACGAACCG CCGCCCGTAC 

1751 CCAAAATCAA AAAAGGCGGC 

1801 GGTCTGATGA CCACGCTTGC 

1851 CATTGTCGGC TTCGTTACCC 

1901 CCTGCCCCTC TTTCCGACAC 

1951 GACGCAAGTT GGGCGGCGTT 

2001 CGAAATCCGC GCCCAAGACC 

2051 CGCTCGCCCG CCACAAACTC 

2101 GACTTGGAAG CCAGCATGAG 

2151 CCTCCCACGC GTCCTCGCCA 

2201 TTACCCGGCT TTAA 



CTGGTTGATA CCGTCCCCGA GTGTTACACC 
CCTCTGGCAG CCCATTCCCG GCGAGTTCGA 
AAGGCAACGG CTATAAAAGT TTGCACACCG 
AAAGGCGTGG AAGTGCAAAT CCGCACCTTC 
ATTCGGTGTC GCCGCGCACT GGCGTTACAA 
CCGCCTACGA ACAAAAAATC GCCTGGTTAC 
GAAAACATGG CGGAAAGCGG CAAGGAAGAC 
CGAGCTTTTC AACGACACGA TTTATGTTTT 
TCTCCCTGCC CACAGGCGCG ACCCCCATCG 
AGCAGCATCG GCGACCGTTG CCGCGGTGCG 
GCCGCTGTCC ACCCCGCTCG AAAACGGACA 
CCAAAGAAGG GCATCCTTCC GTCAACTGGC 
TCCAACAAGG CAATCGGCAA AATCCGCGCC 
CGACACCGTG CGCGAAGAAG GCCGCGTCCA 
AACTCACGCC CAAACCCAAC CTGCAAGAGC 
AAAAAGCCAG AAGACCTCTA CACCGCCGTC 
CCGCGCCATC CAAAAAGCCT GCGGCACGCT 
CCGTCAGCGA AACCACCATC GTCAAACAGT 
AAAAACGGCG TGCTCATCGA CGGCGAAGAC 
CAAATGCTGC AAACCCGCGC CGCCCGACGA 
GCGATCGCGG CATTTCGGTA CACCGCAAAA 
CTCGCCGAAC ACGCGCCCGA AAAAGTACTG 
GCAGGAAGGA CAAGTGTTCG CCGTCGATAT 
GCTCCGGGCT TTTGCGCGAC GTATCCGACG 
AACGTTACCG CCGTGCAAAC CCAGTCCCGC 
GTTCACGCTC GAAGTCAAAC AAGTTACCGA 
GCCTCGGCGA CGTCAAAGGC GTATTGAGCG 



This corresponds to the amino acid sequence <SEQ ID 422; ORF 1 17-La>: 

all7-l.pep 

1 MTAISPIQDT QSATLQELRE WFDSYCTALP NNDKKLVLAA RSLAEAHYPA 

51 DAATPYGEPL PDHFLGAAQM VHELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRKMLLAM 

151 VTDIRVVLIK LAMRTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQEPEKY REIALLLDEK RTERLEYIEN FLNILRTELK 

251 KYNIHFEVAG RPKHI YSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PIPGEFDDYI ANPKGNGYKS LHTVIVGPED KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

4 01 LAAAFKTELF NDTIYVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

4 51 KVEGQIVPLS TPLENGQRVE IITAKEGHPS VNWLYEGWVK SNKAIGKIRA 

501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGEISNRAI QKACGTLNEP PPVPVSETTI VKQSKIKKGG KNGVLIDGED 

601 GLMTTLAKCC KPAPPDDIVG FVTRDRGISV HRKTCPSFRH LAEHAPEKVL 

651 DASWAALQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVTDLPR VLASLGDVKG VLSVTRL* 



all7-l/mll7-l 97.7% identity in 737 aa overlap 

10 20 30 40 50 60 

mll7-l .pep MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWLLAQEHYPADAATPYGEPL 
I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I : I I I : I : : I I I : I I! I I I I I I I I I I I 
a ll7-l MTAISPIQDTQSATLQELREWFDSYCTALPNNDKKLVLAARSLAEAHYPADAATPYGEPL 

10 20 30 40 50 60 



70 80 90 100 110 120 

mll7-l .pep PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
I I I I I I I I I ! I I I I I I I I I I I I I I I I I II I I I I I I I I I I I ! I I I I I I I I I I II I I ! I I I I 
all7-l PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 

70 80 90 100 110 120 



130 140 150 160 170 180 

mll7-l.pep LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 

I 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 

all7-l LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 
130 140 150 160 170 180 



190 200 210 220 230 240 

mll7-l.pep RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQKPEKYREIALLLDEKRTERLEYIEN 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : i I ! I I I I I I I I I I I I I I I I I I I I I 
all7-l RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQEPEKYREIALLLDEKRTERLEYI EN 

190 200 210 220 230 240 
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250 260 270 280 290 300 

ml 17-1 .pep FLNILRGELKKYNVHFEVAGRPKHI YSI YKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 
1 | | | I I I I I J I I : I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I 
a 117-l FLNILRTELKKYNIHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 

250 260 270 280 290 300 



310 320 330 340 350 360 

mll7-l .pep TLGI VHSLWQPI PGEFDDYI ANPKGNGYKSLHTVIVGPEDKGVEVQI RTFDMHQFNEFGV 
I | | | | I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
all7-l TLGI VHSLWQPI PGEFDDYI ANPKGNGYKSLHTVIVGPEDKGVEVQI RTFDMHQFNEFGV 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml 1 7 - 1 . pep AAHWRYKEGGKGDSAYEQKI AWLRQLLDWRENMAESGKEDLAAAFKTELFNDTI YVLTPH 
I ! | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I! I I I I 1 I I I I I I I I I I I I I I I I I I 
all7-l AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTI YVLTPH 

370 380 390 400 410 420 



430 440 450 460 470 480 

mll7-l .pep GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 
I I | j | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I 
a H7-1 GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 

430 440 450 460 470 480 



490 500 510 520 530 540 

mll7-l .pep VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
I | | M | | I I I I I I I I M I I I I I I I I II I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a 117-l VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 

490 500 510 520 530 540 



550 560 570 580 590 600 

mll7-l.pep KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLIDGED 

I M | | | | I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 1 I I I I I I I I II I I M I I I I I I 
all7-l KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLIDGED 
550 560 570 580 590 600 



610 620 630 640 650 660 

ml 17-1. pep GLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
I || | | | | | | I | I I I I I I I : I I I I I : I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I 
all7-l GLMTTLAKCCKPAPPDDIVGFVTRDRGISVHRKTCPSFRHLAEHAPEKVLDASWAALQEG 

610 620 630 640 650 660 



670 680 690 700 710 720 

ml 17-1. pep QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
I | I || | I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I : I I I I 
all7-l QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVTDLPR 

670 680 690 700 710 720 



730 

ml 17-1 .pep VLASLGDVKGVLSVTRLX 
I I I I I I I I I I I I I I I I I I 
a 1 1 7 - 1 VL AS LGDVKGVLS VTRLX 

730 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 423>: 



gll8 . seq 








1 


ATGTGCGAGT 


TCAAGGATTT 


TAGAAGAAAC 


. 51 


TGACGAAAAT 


TCATTTATTG 


GCAAATGGTA 


101 


ATGAAGAATA 


TTGGAAGCTG 


GAGAATGATT 


151 


TATCCTTATC 


CGATGGATAT 


ACCAAGGGAT 


201 


CATTATTGAT 


TTTTTAATGG 


TTCCAAATTG 


251 


CTTCCCCTTG 


GTTGCCTGAT 


AGCGTGGGAA 


301 


TTCACAACGA 


TGCTCCGTTA 


TATTTTTACC 


351 


GCGATTTGAT 


TATTACAaCA 


AAAAATAG 



This corresponds to the amino acid sequence <SEQ ID 424; ORP 1 18.ng>: 

gll8 .pep 

1 MCEFKDFRRN IPCFEEYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRRK 

51 YPYPMDIPRD IVIGIGTIID FLMVPNW ELF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 425>: 

mll8 . seq 

1 ATGTGTGAGT TCAAGGATAT TATAAGAAAC GTTCCTTATT TTGAGGGGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAGTTG GAGAATGATT TAATCGAGGT TAGAAAAAAA 

151 TATCCTTATC CGATGGACAT ACCAAGATAT GTTGTCATTG GAATCGGTAC 

201 CATTATTGAT TTCTTAATGG TTCCAAATTG GAAACTTTTT GAAATTAAAG 

251 CTTCCCCTTG GTTG.CCTGAT AGTGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAACA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 426; ORF 1 18>: 

ml 18 .pep 

1 MCEFKDIIRN VPYFEGYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRKK 

51 YPYPMDIPRY VVIGIGTIID FLMVPNW KLF EIKASPWLPD SVGIHERYER 

101 FTTMLRYI FT EKDIVNVRFD YYNKK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 118 shows 92.8% identity over a 125 aa overlap with a predicted ORF (ORF 1 18.ng) 
from N. gonorrhoeae: 
mll8/gll8 



10 20 30 40 50 60 

ml 18 . pep MCEFKDIIRNVPYFEGYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPYPMDIPRY 

I I I I I I : 11:1 I I I I I I 1 I I I I I I I I I I I I I I II I I I I I I N I I I : I I I I I I ! ! I I 
gll8 MCEFKDFRRNIPCFEEYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRRKYPYPMDIPRD 

10 20 30 40 50 60 



70 80 90 100 110 120. 

mll8 .pep VVIGIGTIIDFLMVPNWKLFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 
: I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
gll8 IVIGIGTIIDFLMVPNWELFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 

70 80 90 100 110 120 



ml 18. pep YYNKKX 
I 1 I I I I 

gll8 YYNKKX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 427>: 

all8 . seq 

1 ATGTGTGAGT TCAAGGATTT TAGAAGAAAC ATCCCTTGTT TTGAAGAGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAATTG GAGAATGATT TAATCGAGGT TAGAAAAAAA 

151 TATCCTTATC CGATGGATAT ACCAAGGGAT ATTGTGATTG GAATCGGTAC 

201 CATTATTGAT TTTTTAATGG TTCCAAATTG GGAGCTTTTT GAAATTAAAG 

251 CTTCCCCTTG GTTGCCTGAT AGTGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAACA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 428; ORF 1 18.a>: 

all 8 . pep 

1 MCEFKDFRRN IPCFEEYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRKK 

51 YPYPMDIPRD IVIGIGTIID FLMVPNW ELF EIKASPWLPD SVGIHERYER 

101 FTTMLRYI FT EKDIVNVRFD YYNKK* 



mll8/all8 93.6% identity in 125 aa overlap 

10 20 30 40 50 60 

ml!8.pep MCEFKDIIRNVPYFEGYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPYPMDIPRY 

I I I I ! I : I I : I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I i I I I I I I I I I I I I I 
all8 MCEFKDFRRN I PCFEEYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKK YPYPMDIPRD 

10 20 30 40 50 60 
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70 80 90 100 110 120 

mll8.pep VVIGIGTIIDFLMV PNWKLFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 

: I I I I I I I I I I I I I I I I : I 1 I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | 
all 8 IVIGIGTIIDFLMVPNWELFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 

70 80 90 100 110 120 



ml 18. pep YYNKKX 
I I I 1 I I 

all8 YYNKKX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 429>: 



gl20 . seq 












1 


ATGATGAAGA 


CTTTTAAAAA 


TATATTTTCC 


GCCGCCATTT 


TGTCCGCCGC 


51 


CCTGCCGTGC 


GCGTATGCGG 


CAAGGCTACC 


CCAATCCGCC 


GTGCTGCACT 


101 


ATTCCGGCAG 


CTACGGCATT 


CCCGCCACGA 


TGACATTTGA 


ACGCAGCGGC 


151 


AATGCTTACA 


AAATCGTTTC 


GACGATTAAA 


GTGCCGCTAT 


ACAATATCCG 


201 


TTTCGAATCC 


GGCGGTACGG 


TTGTCGGCAA 


TACCCTGCAC 


CCTGCCTACT 


251 


ATAAAGACAT 


ACGCAGGGGC 


AAACTGTATG 


CGGAAGCCAA 


ATTCGCCGAC 


301 


GGCAGCGTAA 


CCTACGGCAA 


AGCGGGCGAG 


AGCAAAACCG 


AGCAAAGCCC 


351 


CAAGGCTATG 


GATTTGTTCA 


CGCTTGCCTG 


GCAGTTGGCG 


GCAAATGACG 


401 


CGAAACTCCC 


CCCGGGTCTG 


AAAATCACCA 


ACGGCAAAAA 


ACTTTATTCC 


451 


GTCGGCGGCC 


TGAATAAGGC 


GGGTACGGGA 


AAATACAGCA 


Taggcggcgt 


501 


gGAAACCGAA 


GTCGTCAAAT 


ATCGGGTGCG 


GCGCGGCGAC 


GATACGGTAA 


551 


CGTATTTCTT 


CGCACCGTCC 


CTGAACAATA 


TTCCGGCACA 


AATCGGCTAT 


601 


ACCGAcgaCG 


GCAAAACCTA 


TACGCTGAAG 


CTCAAATCGG 


TGCAGATCAA 


651 


CGGACAGGCC 


GCCAAACCGT 


AA 







This corresponds to the amino acid sequence <SEQ ID 430; ORF 120.ng>: 

gl20 .pep 

1 MMKTFKNIFS AAILSAALPC AYA ARLPQSA VLHYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PAYYKDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DTVTYFFAPS LNNIPAQIGY 

2 01 TDDGKTYTLK LKSVQINGQA AKP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 43 1>: 

ml20 . seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAGGGCTGCC CCAATCCGCC GTGCTGmACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAGTCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTACCTACT 

251 ATAGAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCCAA ATTCGCCGAC 

301 GGCAGCGTAA CTTACGGCAA AGCGGGCGAG AGCAAAACCG AGCAAAGCCC 

3 51 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

4 01 CGAAACTCCC CCCGGGGCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 
4 51 GTCGGCGGTT TGAATAAGGC GGGTACAGGA AAATACAGCA TAGGCGGCGT 
501 GGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATGCGGTAA 
551 TGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 
601 ACCGACGACG GCAAAACCTA TACGCTGAAA CTCAAATCGG TGCAGATCAA 
651 CGGCCAGGCA GCCAAACCG 

This corresponds to the amino acid sequence <SEQ ID 432; ORF 120>: 

ml20 .pep 

1 MMKTFKNIFS AAILSAALPC AYAA GLPQSA VLXYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PTYYRDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DAVMYFFAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 120 shows 97.3% identity over a 223 aa overlap with a predicted ORF (ORF 120.ng) 
from N. gonorrhoeae: 

ml20/gl20 



10 20 30 40 50 60 

ml 2 0 . pep MMKTFKNI FSAAI LSAALPCAYAAGLPQS AVLXYSGS YGI PATMTFERSGNAYKIVSTI K 

1 1 1 1 1 1 1 1 1 1 II llllllllill II II I II II M I II 1 1 M 1 1 1 M I IIIIMIIII 

gl20 MMKTFKNI FSAAI LSAALPCAYAARLPOSAVLHYSGSYGI PATMTFERSGNAYKIVSTI K 

10 20 30 40 50 60 



70 .80 90 100 110 120 

ml20 . pep VPLYNIRFESGGTWGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 

I I I I I I I I I I I I I I I II I I I I : I h I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I 
g 1 2 O VPL YNI RFESGGTWGNTLHPAYYKD I RRGKLYAEAKFADGS VTYGKAGES KTEQS P KAM 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml20 . pep DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEVVKYRVRRGD 

M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I II 
gl20 DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEVVKYRVRRGD 

130 140 150 160 170 180 



190 200 210 220 

ml20 .pep DAVMYF FAPS LNNI PAQ I GYTDDGKTYTLKLKS VQ I NGQAAKP 

hi IIIIMIIII llllllllill Mill II Ml I II I INI 

gl2 0 DTVTYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 

190 200 210 220 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 433>: 

al20. seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAGGGCTGCC CCAATCCGCC GTGCTGCACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAGTCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTACCTACT 

251 ATAGAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCCAA ATTCGCCGAC 

301 GGCAGCGTAA CCTACGGCAA AGCGGGCGAG AGCAAAACCG AGCAAAGCCC 

351 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

4 01 CGAAACTCCC CCCGGGGCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 

4 51 GTCGGCGGTT TGAATAAGGC GGGTACAGGA AAATACAGCA TAGGCGGCGT 

501 GGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATGCGGTAA 

551 TGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 

601 ACCGACGACG GCAAAACCTA TACGCTGAAA CTCAAATCGG TGCAGATCAA 

651 CGGCCAGGCA GCCAAACCGT AA 

This corresponds to the amino acid sequence <SEQ ID 434; ORF 120.a>: 
al20.pep 

1 MMKT FKNIFS AAILSAALPC AYA AGLPQSA VLHYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTVVGNTLH PTYYRDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE VVKYRVRRGD DAVMYFFAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP+ 



ml20/a!20 99.6% identity in 223 aa overlap 

10 20 30 40 50 60 

ml20.pep MMKTFKNI FSAAI LSAALPCAYAAGLPQSAVLXYSGSYG I PATMTFERSGNAYKI VST IK 

I I I I I I I I I ( I I I I I I I I I I II I I I I I I I I I I lilllilllllllllllllllllllll 
al2 0 MMKT FKNIFSAAILSAALPCAYAAGLPQSAVLHYSGSYGIPATMT FERSGNAYKIVSTIK 
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10 20 30 40 50 60 

70 80 90 100 110 120 

ml20 . pep VPLYNIRFESGGTVVGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 
i I I I I I I I I I I I I I I I M M I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I j | | 
al20 VPLYNIRFESGGTVVGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml20 . pep DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEVVKYRVRRGD 
I I I I I I I I I I I I I I I I I I I I I! I I 1. 1 I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I | I | 
al20 DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEVVKYRVRRGD 

130 140 150 160 170 180 

190 200 210 220 

ml20 . pep DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 

I II I I I I I I I I I I I M I I I I I I I I I I I I I I! I I I I I I I I M I I I 
al20 DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 

190 200 210 220 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 435>: 

gl21 . seq 

1 ATGGAAACAC 

51 GGCGGATGCC 

101 AAGGGCACGC 

151 GATTTGCAGG 

201 GCAAGAACTC 

251 GTCAAAACCT 

301 ACCGTCCGAC 

351 GCCGCTGCTG 

4 01 GCCGCGACCT 

4 51 CACGAAGCCC 

501 CGGCGGGATT 

551 GCTTCGACAC 

601 cacTGGcagc 

651 catatTGCcg 

7 01 AACCCcaccc 

751 gaaacctAcc 

801 ttcccgattc 

851 CAGATGCCCG 

901 TTAATGGCGG 

951 CACCGCCGAA 

1001 cgtggttggC 

1051 GCGACCGGCG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 436; ORF 121.ng>: 

gl21 .pep 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFTP YPDRLRRKLL 

51 DLQDTGTDEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRVVLNIGGI ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

2 01 HWQLPYDKNG AKAAQGNILP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 437>: 

ml21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 



AGCTTTACAT CGGCATTATG 
GTGCTGGTAC GGATGGACGG 
CTTTACCCCC TACCCTGACC 
ACACAGGCAC AGACGAACTG 
AGCCGCCTGT ACGCGCAAAC 
CGCTCCGTGC GACATTACCG 
ACGCGCCGGA ACACGGTtac 
GCGGAACTGa cgcggatttT 
TGCTGCCGGC GGacaAGGTG 
TGTTCCGCGA TGACAGGGAA 
GCCAACATCA GCGTACTCCC 
AGGGCCGGGC AATATGCTGA 
TGCCTTACGA CAAAAacggt 
cAACTGCTCG gcaggctGCT 
aaAAAGCACG GGgcGCGaac 
ttgacggcgg cgaaaaccga 
accgcgcaaA ccgTttggga 
TCAAATGTAC ATTTGCGGCG 
ATTTGGCAGA ATGTTTCGGC 
CTGAACCTCG ATCCTCAATG 
GGCGTGTTGG ATTAACCGCA 
CATCCAAACC GTGTATTCTG 



TCGGGAACCA GTATGGACGG 
CGGCAAATGG CTGGGCGCGG 
GGTTGCGCCG CAAATTGCTG 
CACCGCAGCA GGATGTTGTC 
CGCCGCCGAA CTGCTGTGCA 
CCCTCGGCTG CCACGGGCAA 
AGCATACAGC TTGCCGATTT 
TACCGTCggc gacttcCGCA 
CGCCGCTCGT CCCCGCCTTT 
ACACGCGTGG TACTGAACAT 
CCCCGGCGCA CCCGCCTTCG 
TGGAcgcgtg gacgcaggca 
gcAAAGgcgg cacAAGGCAA 
CGCCcaccCG TATTTCTCAC 
TgtttgcccT AAattggctc 
tacgacgtat tgcggacgct 
cgccgtctca CACGCAGCGG 
GCGGCATCCG CAATCCTGTT 
ACACGCGTTT CCCTGCACAG 
GGTGGAGGCG gccgCATTtg 
TTCCCGGTAG TCCGCACAAA 
GGCGCGGGAT ATTATTATTG 
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151 


GATTTGCAGG 


ACACAGGCGC 


201 


GCAAGAACTC 


AGCCGCCTAT 


251 


GTCAAAACCT 


CGCACCGTCC 


301 


ACCGTCCGAC 


ACGCGCCGGA 


351 


GCCGCTGCTG 


GCGxxxxxxx 


401 


xxxxxxxxxx 


xxxxxxxxxx 


451 


xxxxxxxxxx 


xxxxxxxxxx 


501 


xxxxxxxxxx 


xxxxxxxxxx 


551 


xxxxxxxxxx 


xxxxxxxxxx 


601 


xxxxxxCAGC 


TTCCTTACGA 


651 


CATATTGCCG 


CAACTGCTCG 


701 


AACGCCACCC 


TAAAAGCACG 


751 


GAAACCTACC 


TTGACGGCGG 


801 


TTCCCGTTTT 


ACCGCGCAAA 


851 


CAGATGCCCG 


TCAAATGTAC 


901 


TTAATGGCGG 


ATTTGGCAGA 


951 


CACCGCCGAC 


CTGAACCTCG 


1001 


CGTGGTTGGC 


GGCGTGTTGG 


1051 


GCAACCGGCG 


CATCCAAACC 


1101 


A 





AGACGAACTG CACCGCAGCA GGATTTTGTC 
ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 
GACATTACCG CCCTCGGCTG CCACGGGCAA 
ACACGGTTAC AGCATACAGC TTGCCGATTT 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
CAAAAACGGT GCAAAGTCGG CACAAGGCAA 
ACAGGCTGCT CGCCCACCCG TATTTCGCAC 
GGGCGCGAAC TGTTTGCCAT AAATTGGCTC 
CGAAAACCGA TACGACGTAT TGCGGACGCT 
CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 
ATTTGCGACG GCGGCATCCG CAATCCTGTT 
ATGTTTCGGC ACACGCGTTT CCCTGCACAG 
ATCCGCAATG GGTGGAAGCC GCCGnATTTG 
ATTAATCGCA TTCCCGGTAG TCCGCACAAA 
GTGTATTCTG AnCGCGGGAT ATT ATT ATT G 



This corresponds to the amino acid sequence <SEQ ID 438; ORF 12 1>: 

ml21 .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL Axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

201 xxQLPYDKNG AKSAQGNILP QLLDRLLAHP YFAQRHPKST GRELFAINWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICDGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

Computer analysis of the amino acid sequences gave the following results: 
Homology with a predicted ORF from N. meningitidis menA with menB 
ORF 121 shows 73.5% identity over a 366 aa overlap with a predicted ORF (ORF121.ng) 
from N. gonorrhoeae: 

ml21/gl21 

10 ' 20 30 40 50 60 

ml 21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I I I I I I I I I I I I I I I I I I I I : I I I ! I I I I I I I I I I I I I I I I I I I : I I I I I I I I : I I I 
g 121 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml21 - pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I j | I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I 
g 121 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

! : : : 
g 121 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: : I I I I I I I I I I : I I I I I I M I I I I I I I I I I : I I I I I I 

gl21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21 .pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
M I I I I : I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 1 I M I I I I I I I I I I I I 
g 121 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 



WO 99/57280 



PCIYUS99/09346 



345 



310 320 330 340 350 360 

ml21 .pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I | | | | | | | | I I I I I I II I I : I I I I I I I I i I I I I I I I I I I I M i I I I I I I I ( i I I I I I I I 
g 121 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
MINI 

gl21 GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 439>: 



a!21 . seq 



1 


ATGGAAACAC 


AGCTTTACAT 


51 


GGCGGATGCC 


GT ACT GAT AC 


101 


AAGGGCACGC 


CTTTACCCCC 


151 


GATTTGCAGG 


ACACAGGCGC 


201 


GCAAGAACTC 


AGCCGCCTGT 


251 


GTCAAAACCT 


CGCGCCGTCC 


301 


ACCGTCAGAC 


ACGCGCCGGA 


351 


GCCGCTGCTG 


GCGGAACGGA 


401 


GCCGCGACCT 


TGCGGCCGGC 


451 


CACGAAGCCC 


TGTTCCGCGA 


501 


CGGCGGGATT 


GCCAACATCA 


551 


GCTTCGACAC 


AGGACCGGGC 


601 


CACTGGCAGC 


TTCCTTACGA 


651 


CATATTGCCG 


CAACTGCTCG 


701 


AACCCCACCC 


TAAAAGCACG 


751 


GAAACCTACC 


TTGACGGCGG 


801 


TTCCCGATTC 


ACCGCGCAAA 


851 


CAGATGCCCG 


TCAAATGTAC 


901 


TTAATGGCGG 


ATTTGGCAGA 


951 


CACCGCCGAA 


CTGAACCTCG 


1001 


CATGGATGGC 


GGCGTGTTGG 


1051 


GCAACCGGCG 


CATCCAAACC 


1101 


A 





CGGCATCATG TCGGGAACCA GCATGGACGG 
GGATGGACGG CGGCAAATGG CTGGGCGCGG 
TACCCCGGCA GGTTACGCCG CAAATTGCTG 
GGACGAACTG CACCGCAGCA GGATGTTGTC 
ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 
GACATTACCG CCCTCGGCTG CCACGGGCAA 
ACACAGTTAC AGCGTACAGC TTGCCGATTT 
CTCAGATTTT TACCGTCGGC GACTTCCGCA 
GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 
CGACAGGGAA ACACGCGCGG TACTGAACAT 
GCGTACTCCC CCCCGACGCA CCCGCCTTCG 
AATATGCTGA TGGACGCGTG GATGCAGGCA 
CAAAAACGGT GCAAAGGCGG CACAAGGCAA 
ACAGGCTGCT CGCCCACCCG TATTTCGCAC 
GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 
CGAAAACCGA TACGACGTAT TGCGGACGCT 
CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 
ATTTGCGGCG GCGGCATCCG CAATCCTGTT 
ATGTTTCGGC ACACGCGTTT CCCTGCACAG 
ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 
GTCAACCGCA TTCCCGGTAG TCCGCACAAA 
GTGTATTCTG GGCGCGGGAT ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 440; ORF 121.a>: 

al21 .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml21/al21 74.0% identity in 366 aa overlap 



10 20 30 40 50 60 

ml21.pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
| | | | | | I I 1 I I I i I I I I I I I I I I II I I I I I I I I I I i I I I I I I I I I I I : I I I I I I I I I I I I 
a 121 METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml21.pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I : I I : I I I I M I I 
al21 HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 
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130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : : : 
a 121 AERTQI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLN I GG I AN I SVLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I 11 If 

a 121 PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 21 .pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I I I I I I : I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I II I I I ! I I I I I I I I M I I I I I I I 

a 121 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 21 . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAX FAWLAACWINRIPGSPHKATGASKPCIL 

I I I M I I I I I I I I I I I I I I : i I! I I I i I 1 I I I I I : I I I I : I I I I I I I I I I I I II I I I I I 
a 121 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml 21. pep XAGYYYX 
MINI 

al21 GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 44 1>: 

ml21-l . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA * GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG . 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CGCGGATTTT TACCGTCGGC GACTTCCGCA 

401 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCACTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA CAACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGACGCGTG GACGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGNATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG ANCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 442; ORF 121-1>: 

ml21-l.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AERTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDNRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

ml21-l/gl21 95.6% identity in 366 aa overlap 

10 20 30 40 50 60 
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347 



ml21-l pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 

I | | | I | I I I I I I I I I I I I t I I I : I I I I 1 I I I I I I I I I I I I I f I I I I : I I I : I I I 

g 121 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml21-l pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
| | I I : I 1 I I I I M I I M I I I I I I I i I ! I 1 I I I I I 1 I I I ! I I I I I I I I I I I I I I I I I I I I 
g 121 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l .pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGIANISVLPPDA 
i | | | | | | I I I I I I I I I I I I I I I i I I I I I I I I I I I I I : I I I I : I I I I I I I I I I I I I ! I I 
g 121 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRVVLNIGGIANISVLPPGA 
130 140 150 160 170 180 

190 200 210 220 230 240 

ral21-l.pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

I ! I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I i II I I I I : I I I I I I I 

g 121 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l . pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 1 1 I I I I I 1 I 1 I I I I I I I I I I I I I I I I I I 

g 121 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 
250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 

I I I I I I I I I I I I 11 : I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I 

gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 
310 320 330 340 350 360 



ml21-l.pep XAGYYYX 
I 1 I t I I 

gl21 GAGYYYX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 443>: 

al21-l. seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG f 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 444; ORF 121-l.a>: 

al21-l .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 
51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 
101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 
151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 
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201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

3 51 ATGASKPCIL GAGYYY* 

ml21-l/al21-l 96.4% identity in 366 aa overlap 

10 20 30 40 50 60 

ml21-l .pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I I I I I I i I I I I I I I ! I I ! I I I I I I I I I I I I I I I I I I I M I I I I I I : I I I I I I I I i I ( I 
al21-l METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml21-l.pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I | I I : I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I : I I I I I I I I 
al21-l HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l .pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGIANISVLPPDA 
I | I | : I I I I I I I I I I I I I I I I I I I M I I I I 1 I I I I I I : I I I I I I I I I I I I II I I I I I I ! I 
a 1 2 1 - 1 AERTQI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGI ANI S VLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21-l . pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

al21-l PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l.pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMY ICGGGIRNPV 
I | | | | | I I I I I I I I I I I I I i I I I I I I I i I I I I I I I I I I I I I I I I I I II I I II I I I I I I I 
al21-l GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I I I I I 1 I I I I I I I I I I I :| I I I I I I I I I I I I I : I i I I : 1 I I I I I I I I I I I I I I II I I 
al21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHP^ATGASKPCIL 

310 320 330 340 350 360 



ml21-l.pep XAGYYYX 
I I I I I I 

al21 GAGYYYX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 445>: 

gl22.seq 

1 ATGGCTTTAC TGAGCATCCG CAAGCTGCAC AAACAATACG GCAGCGTAAC 

51 CGCCATCCAA TCCTTAGACT TGGACTTGGA AAAAGGCGAA GtcatCGTAC 

101 TGCTGGGCCC gTccggctgc ggCAAATCCA CCCTcctgcg ctgcgtcaaC 

151 GGTTTGGAGC CGCACCAagg cgGCAGCATC GTGATGGACG GTgtcgGCGA 

201 ATTCggcAAA GACGTTTCCT GGCAAACCGC CCGGCAAAAa gtcggtatgg 

251 tctttcaaag taacgAactg Tttgcccaca tgaccgtcat cgAaaacatc 

301 ttcttAggcC CGGTAAagga aCAAAAcCgc gaccgtgccg aagcaGAGGC 

3 51 gCAAGCCGGC AAactGttgg aacgcgTCGG actgctAGAC CGCAAAAACG 

4 01 CCTATCCGCG CGAACTTTCC GGCGGTCAGA AACAGCGCAT CGCCATTGTC 
4 51 CGCGCCCTGT GCCTGAATCC GGAAGTCATC CTGCTGGACG AAATCACCGC 
501 CGCACTTGAC CCCGAAATGG TGCGCGAAGT CTTGGAAGTG GTTTTGGAAC 
551 TCGCCCGCGA AGGGATGAGT ATGCTCATCG TAACCCACGA AATGGGGTTC 
601 GCACGCAAAG TTGCCGACCG CATCGTCTTT ATGGACAAAG GCGGCATCGT 
651 CGAATCGTCC GACCCCGAAA CCTTTTTTTC CGCACCAAAA AGCGAACGCG 
701 CCCGCCAATT TCTGGCAGGT ATGGACTACT GA 

This corresponds to the amino acid sequence <SEQ ID 446; ORF 122,ng>: 



WO 99/57280 



PCT/US99/09346 



349 



gl22 .pep 

1 MALLSIRKLH KQYGSVTAIQ SLDLDLEKGE VIVLLGPSGC GKSTLLRCVN 

51 GLEPHQGGSI VMDGVGEFGK DVSWQTARQK VGMVFQSNEL FAHMTVIENI 

101 FLGPVKEQNR DRAEAEAQAG KLLERVGLLD RKNAYPRELS GGQKQRIAIV 

151 RALCLNPEVI LLDEITAALD PEMVREVLEV VLELAREGMS MLIVTHEMGF 

201 ARKVADRIVF MDKGGIVESS DPETFFSAPK SERARQFLAG MDY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 447>: 

ml22 . seq 

1 GTTGTCATGA TTAAAATCCG CAATATCCAT AAGACCTTTG GCGAAAACAC 

51 TATTTTGCGC GGCATCGATT TGGATGTGTG CAAAGGGCAG GTGGTCGTCA 

101 TCCTCGGGcC TTCCGGCTCA GGCAAAACGA CGTTTCTGCG ATGCCTAAAC 

151 GCGTTGGAAA TGCCCGAAGA CGGACAAATC GAGTTCGACA ACGAGCGACC 

201 GCTGAAAATC GATTTTTCTA AAAAACCAAG CAAACACGAT ATTTTGGCAC 

251 TGCGCCGCAA ATCAkGCATG GTGTTTCAAC AATACAAyCT CTTTCCGCAC 

3 01 AAAACCGCCT TGGAAAACGT AATGGAAGGA CCGGTTGCCG TACAgGGCAA 
351 GCCTGCCGCC CAAGCGCGCG AAGAGGCTCT GAAACTGCTG GAAAAAGTCG 

4 01 GCTTGGGCGA CAAAGTGGAT TTGTATCCCT ACCAGCTTTC CGGCGGTCAG 
4 51 CAGCAGCGCG TCGGCATTGC CCGCGCATTG GCGATTCAGC CTGAACTGAT 
501 GCTGTTTGAC GAACCGACTT CCGCGCTCGA TCCTGAATTG GTGCAAGATG 
551 TTTTGGATmC CATGAAGGAA TTGGCGCAAG AAGGCTGGAC CATGGTTGTC 
601 GTTACGCATG AAATCAAGTT CGCCTTAGAA GTGGCAACCA CCGwCGTCGT 
651 GATGGACrGC GGCGTTATTG TCGAACAAGG CAGCCCGCAA GATTTGTTCG 
701 ACCACCCCAA ACACGAACGG ACGCGGAGAT TTTTAAGCCA AATCCAATCT 
751 ACCAAGATTT GA 

This corresponds to the amino acid sequence <SEQ ID 448; ORF 122>: 

ml22 .pep 

1 WMIKIRNIH KTFGENTILR GIDLDVCKGQ VWILGPSGS GKTTFLRCLN 
51 ALEMPEDGQI EFDNERPLKI DFSKKPSKHD ILALRRKSXM VFQQYNLFPH 
101 KTALENVMEG PVAVQGKPAA QAREEALKLL EKVGLGDKVD LYPYQLSGGQ 
151 QQRVGIARAL AIQPELMLFD EPTSALDPEL VQDVLDXMKE LAQEGWTMW 
201 VTHE I KFALE VATTXWMDX GVIVEQGSPQ DLFDHPKHER TRRFLSQIQS 
251 TKI* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 122 shows 47.2% identity over a 246 aa overlap with a predicted ORF (ORF 122.ng) 
from N. gonorrhoeae: 

ml22/gl22 

10 20 30 40 50 60 

ml 2 2 . pep WMI KI RNI HKTFGENTI LRG IDLDVCKGQVWI LGPSGSGKTTFLRCLNALEMPEDGQ I 

:::::||::|| :| | : : : : | | | « | | s | : | : | | | | | | | : | : | | | : | : | | : |:| 
gl22 MALLSIRKLHKQYGSVTAIQSLDLDLEKGEVIVLLGPSGCGKSTLLRCVNGLEPHQGGSI 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml22 . pep EFDNERPLKIDFSKKPSKHDILALRRKSXMVFQQYNLFPHKTALENVMEGPVAVQGKPAA 
:|: s | | : : |s| | | | | : : | | | | = = | |: s | || |:: | 

g 1 2 2 VMDGVGEFGKDVSWQTA RQKVGMVFQSNELFAHMTVI ENI FLGPVKEQNRDRA 

70 80 90 100 110 

130 140 150 160 170 180 

ml22 .pep QAREEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPEL 

:|: :| II I hi I I h = II I I I I h I I - h I I I -II- hi I hllllh 
gl22 EAEAQAGKLLERVGLLDRKNAYPRELSGGQKQRIAIVRALCLNPEVILLDEITAALDPEM 
120 130 140 150 160 170 

190 200 210 220 230 240 

ml 2 2 . pep VQDVLDXMKELAQEGWTMVWTHEIKFALEVATTXWMDXGVIVEQGSPQDLFDHPKHER 

|::||: ' llhll :|::|||h II :|| I I I I I I h - h : h I II I 
gl22 VREVLEWLELAREGMSMLIVTHEMGFARKVADRIVFMDKGGIVESSDPETFFSAPKSER 
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180 190 200 210 220 230 

250 

ml22 . pep TRRFLSQIQSTKIX 
:|:|h 

gl2 2 ARQFLAGMDYX . 

240 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 449>: 

a!22.seq 

1 GTTGTCATGA TTAAAATCCG CAATATCCAT AAGACCTTCG GCAAAAATAC 

51 CATTTTGCGC GGCATCAATT TGGATGTGTG CAAAGGGCAG GTGGTCGTCA 

101 TCCTCGGGCC TTCCGGCTCA GGCAAAACGA CGTTTCTGCG ATGCCTAAAC 

151 GCGTTGGAAA TGCCCGAAGA CGGACAAATC GAGTTCGACA ACGAGCGACC 

201 GCTGAAAATC GATTTTTCTA AAAAACCAAG CAAACACGAT ATTTTGGCAC 

251 TGCGCCGCAA ATCAGGCATG GTGTTTCAAC AATACAACCT CTTTCCGCAC 

301 AAAACCGCCT TGGAAAACGT GATGGAAGGA CCGGTTGCCG TACAGGGCAA 

351 GCCTGCCGCC CAAGCGCGCG AAGAGGCTCT GAAACTGCTG GAAAAAGTCG 

4 01 GCTTGGGCGA CAAAGTGGAT TTGTATCCCT ACCAGCTTTC CGGCGGTCAG 

4 51 CAGCAGCGCG TCGGCATTGC CCGAGCATTG GCGATTCAGC CCGAGCTGAT 

501 GTTGTTTGAC GAACCCACTT CCGCGCTTGA CCCCGAGTTG GTGCAAGACG 

551 TGTTGAACGC CATGAAGGAA TTGGCGCGGG AAGGTTGGAC GATGGTCGTC 

601 GTTACCCACG AAATCAAGTT CGCGCTGGAA GTTGCCACGA CCGTTGTCGT 

651 GATGGACGGC GGCGTTATCG TAGAGCAGGG CAGCCCGAAA GAGTTGTTCG 

701 ACCACCCCAA ACACGAACGG ACGCGGAGAT TTTTAAGCCA AATCCAATCT 

751 ACCAAGATTT GA 

This corresponds to the amino acid sequence <SEQ ID 450; ORF 122.a>: 

al22.pep 

1 WMIKIRNIH KTFGKNTILR GINLDVCKGQ VVVILGPSGS GKTTFLRCLN 

51 ALEMPEDGQI EFDNERPLKI DFSKKPSKHD ILALRRKSGM VFQQYNLFPH 

101 KTALENVMEG PVAVQGKPAA QAREEALKLL EKVGLGDKVD LYPYQLSGGQ 

151 QQRVGIARAL AIQPELMLFD EPTSALDPEL VQDVLNAMKE LAREGWTMVV 

201 VTHEIKF ALE VATTVWMDG GVIV EQGSPK ELFDHPKHER TRRFLSQIQS 

251 TKI* 

ml22/a!22 96.0% identity in 253 aa overlap 

10 20 30 40 50 60 

ml22.pep VVMIKIRN I HKTFGENTILRGIDLDVCKGQWVILGPSGSGKTTFLRCLN ALEMPEDGQI 

I | | | | | | I I I I I I I : I I I I I I I : I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al22 VVMIKIRN I HKT FGKNTILRGINLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQI 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml22.pep EFDNERPLKI DFSKKPSKHDILALRRKSXMVFQQYNLFPHKTALENVMEGPVAVQGKPAA 

I I I I I I I I ! I I I ! I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a 122 EFDNERPLKI DFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml22 . pep QAREEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPEL 
! I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I II I I I I I I I I I I I I I I I I 
al22 QAREEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPEL 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml22 . pep VQDVLDXMKE LAQEGWTMVVVTHE I KFALEVATTX VVMDXGVI VEQG S PQDL FDHPKHER 

Mill: I I I 1 I : I I I I I I I I I I II I I I I I I I I I MM II I I I II II : : I I M I II I I 
a 122 VQDVLNAMKELAREGWTMVVVTHE I KFALEVATTVVVMDGGVI VEQG SPKEL FDHPKHER 

190 200 210 220 230 240 



ml22 .pep 



250 

TRRFLSQIQSTKIX 
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al22 



I I I I I I I I I I I I I I 
TRRFLSQIQSTKIX 
250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 45 1>: 



1 ATGATTAAAA TCCGCAATAT CCATAAGACC TTTGGCGAAA ACACCATTTT 

51 GCGCGGCATC GATTTGGATG TGGGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCGGGTAAA ACAACATTTC TGCGCTGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGCGC GGCCGTTACG 

201 CATTGATTTT TCCAAAAAAA CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAGTCCGG AATGGTATTC CAACAATACA ACCTCTTCCC GCATAAAACC 

301 GTGTTGGAAA ACGTGATGGA AGGGCCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTTTGAAACT GCTGGAAAAA GTCGGCTTGG 

4 01 GCGATAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

4 51 CGTGTCGGTA TCGCCCGCGC ACTGGCGATT CAGCCTGAAT TGATGCTGTT 

501 TGACGAACCC ACTTCCGCGC TGGACCCCGA GTTGGTGCAA GACGTGTTGG 

551 ACGCCATGAA GGAATTGGCG CGGGAAGGTT GGACGATGGT CGTCGTTACC 

601 CACGAAATCA AGTTCACGCT GGAAGTTGCC ACGAACGTCG TCGTGATGGA 

651 CGGCGGCGTT ATCGTAGAGC AGGGCAGCCC GAAAGAGTTG TTCGACCACC 

701 TCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTGCCAAG 

751 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 452; ORF 122-1. ng>: 

gl22-l.pep 

1 MIKIRNIHKT FGENTILRGI DLDVGKGQVV VILGPSGSGK TTFLRCLNAL 

51 EMPEDGQIEF DNARPLRIDF SKKTSKHDIL ALRRKSGMVF QQYNLFPHKT 

101 VLENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 

151- RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDAMKELA REGWTMVVVT 

201 HEIKFTLEVA TNVVVMDGGV IVEQGSPKEL FDHLKHERTR RFLSQIQSAK 

251 I* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 45 3>: 

ml22-l .seq 

1 ATGATTAAAA TCCGCAATAT CCATAAGACC TTTGGCGAAA ACACTATTTT 

51 GCGCGGCATC GATTTGGATG TGTGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCAGGCAAA ACGACGTTTC TGCGATGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGAGC GACCGCTGAA 

201 AATCGATTTT TCTAAAAAAC CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAATCAGG CATGGTGTTT CAACAATACA ACCTCTTTCC GCACAAAACC 

301 GCCTTGGAAA ACGTAATGGA AGGACCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTCTGAAACT GCTGGAAAAA GTCGGCTTGG 

401 GCGACAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

451 CGCGTCGGCA TTGCCCGCGC ATTGGCGATT CAGCCTGAAC TGATGCTGTT 

501 TGACGAACCG ACTTCCGCGC TCGATCCTGA ATTGGTGCAA GATGTTTTGG 

551 ATACCATGAA GGAATTGGCG CAAGAAGGCT GGACCATGGT TGTCGTTACG 

601 CATGAAATCA AGTTCGCCTT AGAAGTGGCA ACCACCGTCG TCGTGATGGA 

651 CGGCGGCGTT ATTGTCGAAC AAGGCAGCCC GCAAGATTTG TTCGACCACC 

701 CCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTACCAAG 

751 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 454; ORF 122-1>: 

ml22-l .pep 

1 MIKIRNIHKT FGENTILRGI DLDVCKGQVV VILGPSGSGK TTFLRCLNAL 
51 EMPEDGQIEF DNERPLKIDF SKKPSKHDIL ALRRKSGMVF QQYNLFPHKT 
101 ALENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 
151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDTMKELA QEGWTMVVVT 
201 HEIKFALEVA TTVVVMDGGV IVEQGSPQDL FDHPKHERTR RFLSQIQSTK 
251 I* 

inl22-l/gl22-l 94.8% identity in 251 aa overlap 



gl22-l .seq 



ml22-l .pep 



gl22-l 



10 20 30 40 50 60 

MIKIRNIHKTFGENTILRGIDLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 

I | J | 1 | I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
MIKIRNIHKTFGENTILRGIDLDVGKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 
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10 20 30 40 50 60 

70 80 90 100 110 120 

ml22-l.pep DNERPLKIDFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAAQA 

i I 111:1111 1 I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I i I I 

gl22-l DNARPLRIDFSKKTSKHDILALRRKSGMVFQQYNLFPHKTVLENVMEGPVAVQGKPAAQA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml22-l pep REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 I 

gl22-l REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 
130 140 150 160 170 180 



190 200 210 220 230 240 

ml22-l pep DV L DTMKELAQEG WTM V VVTH E I K FAL E V AT T V V VMDGG V IVEQGSPQDLFDHPKHERTR 
| M I : I I I I I : t I I I ! I I i I 1 I I I I : I M I I : I I I I M I I I I I I I I I I I I I I I I I I I 
g!22-l DVLDAMKELAREGWTMVVVTHEIKFTLEVATNVVVMDGGVIVEQGSPKELFDHLKHERTR 

190 200 210 220 230 240 



250 

ml22-l .pep RFLSQIQSTKIX 
I I I I I I I I : I I I 
g!22-l RFLSQIQSAKIX 
250 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 455>: 

al22-l . seq 

1 ATGATTAAAA TCCGCAATAT CCATAAGACC TTCGGCAAAA ATACCATTTT 

51 GCGCGGCATC AATTTGGATG TGTGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCAGGCAAA ACGACGTTTC TGCGATGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGAGC GACCGCTGAA 

201 AATCGATTTT TCTAAAAAAC CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAATCAGG CATGGTGTTT CAACAATACA ACCTCTTTCC GCACAAAACC 

301 GCCTTGGAAA ACGTGATGGA AGGACCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTCTGAAACT GCTGGAAAAA GTCGGCTTGG 

401 GC GACAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

4 51 CGCGTCGGCA TTGCCCGAGC ATTGGCGATT CAGCCCGAGC TGATGTTGTT 

501 TGACGAACCC ACTTCCGCGC TTGACCCCGA GTTGGTGCAA GACGTGTTGA 

551 ACGCCATGAA GGAATTGGCG CGGGAAGGTT GGACGATGGT CGTCGTTACC 

601 CACGAAATCA AGTTCGCGCT GGAAGTTGCC ACGACCGTTG TCGTGATGGA 

651 CGGCGGCGTT ATCGTAGAGC AGGGCAGCCC GAAAGAGTTG TTCGACCACC 

701 CCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTACCAAG 

751 ATTTGA 



This corresponds to the amino acid sequence <SEQ ID 456; ORF 122-1. a>: 

al22-l .pep 

1 MIKIRNIHKT FGKNTILRGI NLDVCKGQVV VILGPSGSGK TTFLRCLNAL 
51 EMPEDGQIEF DNERPLKIDF SKKPSKHDIL ALRRKSGMVF QQYNLFPHKT 
101 ALENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 
151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLNAMKELA REGWTMVVVT 
201 HEIKFALEVA TTVVVMDGGV IVEQGSPKEL FDHPKHERTR RFLSQIQSTK 
251 I* 

al22-l/ml22-l 97.2% identity in 251 aa overlap 



10 20 30 40 50 60 

al22-l . pep MIKIRNIHKTFGKNTILRGINLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 
| | | | | | I I I I I I : I I I I I I I : I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml22-l MIKIRNIHKTFGENTILRGIDLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 

10 20 30 40 50 60 

70 80 90 100 110 120 

al22-l .pep DNERPLKIDFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAAQA 
I | I | || | | | | | I | I I I I I I I I II I I II I I II I I I I f I I I I I I I I I I I I I I I I ! I I I I I I I 
ml22-l DNERPLKIDFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAAQA 

70 80 90 100 110 120 

130 140 150 160 170 180 

al22-l.pep REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 
I | | | | I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I 
ml22-l REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 



WO 99/57280 



PCT/US99/09346 



353 



130 140 150 160 170 180 

190 200 210 220 230 240 

al22-l.pep D VLN AMKELAREGWTM V V VTHE I K FALEVATTVVVMDGGVT VEQGS PKEL FDH PKHERT R 
I i I :: I I I I I : I I I I I I I I I It I I I I I f I I I I I I I I I t I I I I I I I I I :: I I I I I I I M I I 
m 122-l DVLDTMKELAQEGWTMVVVTHEIKFALEVATTVVVMDGGVI VEQGS PQDLFDHPKHERTR 

190 200 210 220 230 240 

250 

al22-l.pep RFLSQIQSTKIX 
I I I I I t I I I I I I 
ml22-l RFLSQIQSTKIX 
250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 45 7>: 

gl25 . seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCGCCGCCA TCGGGCTGGT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACGCTGC 

101 TCGCCCCCTT GGGCTGGCAG CGCGGTCTGG CGGCCCTGCT TTTGGGTCAT 

151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

201 CGGACGCAGC TCGATGGAAA GTGTGCGCCT GTCGTTCGGC AAATGCGGTT 

251 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

301 GTGATGATTT ACGTCGGCGC AacggTCAGC TCCGCTTTGG GCAAAGTGTT 

351 GTGGGACggc gaATCCTTTG TCTGGTGGGC ATTGGCAAAC GGCGCACTGA 

4 01 TCGTGCTGTG GCTGGTTTTC GGCGCACGCA GAACGGGCGG GCTGAAAACC 

451 GTTTCGATGC TGCTGATGCT GCTTGCCGTG TTGTGGTTGA GCGTCGAAGT 

501 GTTCGCTTCG TCCGGCACAA ACGCCGCGCC CGCCGTTTCA GACGGCATGA 

551 CCTTCGGAAC GGCAGTCGAA CTGTCCGCCG TCATGCCGCT TTCCTGGCTG 

601 CCGCTGGCCG CCGACTACAC GCGCCAAGCA CGCCGCCCGT TTGCGGCAAC 

651 CCTGACGGCA ACGCTCGCCT ATACGCTGAC GGGCTGCTGG ATGTATGCCT 

701 TGGGTTTGGC GGCGGCTCTG TTTACCGGAG AAACCGACGT GGCGAAAATC 

751 CTGTTGGGCG CGGGCTTGGG CATAACGGGC ATTCTGGCAG TCGTCCTCTC 

801 CACCGTTACC ACAACGTTTC TCGATACCTA TTCCGCCGGC GCGAGTGCGA 

851 ACAACATTTC CGCGCGTTTT GCGGAAATAC CCGTCGCTGT CGGCGTTACC 

901 CTGatccgca ccgtgcttgc cgtcatgctg cccgttaccg aatataaaaa 

951 cttcctgctg cttatccgct cggtatttgg gccgatggcg ggtggttttg 

1001 attgccgaCT TTTttgtctt AAAACGGCGT GA 

This corresponds to the amino acid sequence <SEQ ID 458; ORF 125.ng>: 



The following partial DNA sequence was 


ml25 . seq 






1 


ATGTCGGGCA 


ATGCCTCCTC 


51 


TTGGTTCGGC 


GCGGCGGTAT 


101 


TTGCGCCTTT 


GGGCTGGCAG 


151 


GCCGTCGGCG 


GCGCGCTGTT 


201 


CGGACGCAGC 


TCGATGGAAA 


251 


CAGTGCTGTT 


TTCCGTGGCG 


301 


GTGATGATTT 


ACGCCGGCGC 


351 


GTGGGACGGC 


GAATCTTTTG 


401 


TTGTGCTGTG 


GCTGGTTTTC 


451 


GTTTCGATGC 


TGCTGATGCT 


501 


CTTTTCCACG 


GCAGGCAGCA 


551 


TCGGAACGGC 


AGTCGAGCTG 


601 


CTTGCCGCCG 


ACTACACGCG 


651 


GACGGCAACG 


CTCGCCTACA 


701 


GTTTGGCAGC 


GGCGTTGTTC 


751 


CTGGGCGCAr 


GTTTGgGTGC 


801 


CGTTACCACA 


ACGTTTCTCG 



identified in N. meningitidis <SEQ ID 459>: 

TCCTTCATCT TCCTCCGCCA TCGGGCTGAT 
CGATTGCCGA AATCAGCACG GGTACGCTGC 
CGCGGTCTGG CGGCTCTACT TTTGGGTCAT 
TTTTGCGGCG GCGTATATCG GCGCACTGAC 
GCGTGCGCCT GTCGTTCGGC AAACGCGGTT 
AATATGCTGC AACTGGCCGG CTGGACGGCG 
AACGGTCAGC TCCGCTTTGG GCAAAGTGTT 
TCTGGTGGGC ATTGGCAAAC GGCGCGCTGA 
GGCGCACGCA AAACAGGCGG GCTGAAAACC 
GTTGGCGGTT CTGTGGCTGA GTGCCGAAGT 
CCGCCGCACA GGTTTCAGAC GGCATGAGTT 
TCCGCCGTGA TGCCGCTTTC CTGGCTGCCG 
CCACGCGCGC CGCCCGTTTG CGGCAACCCT 
CGCTGACCGG CTGCTGGATG TATGCCTTGG 
ACCGGAGAAA CCGACGTGGC AAAAATCCTG 
GGCAGGCATT TTGGCGGTCG TCCTCTCCAC 
ATGCCTATTC CGCCGGCGCG AGTGCGAACA 



WO 99/57280 



PCT/US99/09346 



354 



8 51 ACATTTCCGC GCGTTTTGCG GAAACACCCG TCGCTGTCrG CGTTACCCTG 

901 ATCGGCACGG TACTTGCCGT CATGCTGCCC GTTACCGAAT ATGAAAACTT 

951 CCTGCTGCTT ATCGGCTCGG TATTTGCGCC GATGGCGGgC GGTTTTGATT 

10 01 GCCGACTTTT TCGTCTTGAA ACGGCGTGA 

This corresponds to the amino acid sequence <SEQ ID 460; ORF 125>: 

ml2 5 .pep 

1 MSGNASSPSS SSAIGLIWFG AAVSIAEIST GTLLAPLGWQ RGLAALLLGH 

51 AVGGALFFAA AYIGALTGRS SMESVRLSFG KRGSVLFSVA NMLQLAGWTA 

101 VMIYAGATVS SALGKVLWDG ES FVWWALAN GALIVLWLV F GARKTGGLKT 

151 VSMLL MLLAV LWLSAEVF ST AGSTAAQVSD GMSFGTAVEL SAVMPLSWLP 

201 LAADYTRHAR RPFAATLTAT LAYTLTGCWM YALGLAAALF TGETDVAKIL 

251 LGAXLGAAGI LAWLSTVTT TFLDAYSAGA SANNISARF A ETPVAVXVTL 

3 01 IGTVLAVMLP VTEYENFLLL IGSVFAPMAG GFDCRLFRLE TA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 125 shows 92.1% identity over a 343 aa overlap with a predicted ORF (ORF 125.ng) 

from N. gonorrhoeae: 
ml25/gl25 

10 20 30 40 50 60 

MSGNASSPSSSSAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 

M M II I i MIMM Ml i lllllll MUM III M I IMIMIM 1 1 MM M IMM 

MSGNASSPSSSAAIGLWFGAAVSIAEISTGTLLAPLGWQRGIAALLLGHAVGGALFFAA 
10 20 30 40 50 60 

70 80 90 100 110 120 

AYIGALTGRSSMESWLSFGKRGSVLFSVANMLQLAGWTAVM I YAGATVS SALGKVLWDG 

1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AYIGALTGRSSMESVRLS FGKCGSVLFSVANMLQLAGWTAVM I YVGATVS SALGKVLWDG 
70 80 90 100 110 120 

130 140 150 160 . 170 179 

ESFVWWALANGALIVLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFSTAGSTAAQ-VS 

1 1 M 1 1 MIMIMMMMMIMIMIMI MIIM II I II Ml I M-MM 1 1 

ESFVWWALANGALIVLWLVFGARRTGGLKTVSMLLMLLAVLWLSVEVFASSGTNAAPAVS 
130 140 150 160 170 180 

180 190 200 210 220 230 239 

DGMS FGTAVELSAVMPLSWLPLAADYTRHARRPFAATLTATIJVYTLTGCWMYALGLAAAL 

II MM M 1 1 II I M I II 1 1 1 1 1 III MM II II I II 1 1 1 1 1 M I M M II MM 1 1 1 M 

DGMTFGTAVELSAVMPLSWLPLT^ADYTRQARRPFAATLTATLAYTLTGCWMYALGLAAAL 
190 200 210 220 230 240 

240 250 260 270 280 290 299 

FTGETDVAKILLGAXLGAAGILAWLSTVTTTFLDAYSAGASANNISARFAETPVAVXVT 

IIIMIIIIIMII II Ml II II 1 1 IM I MM MM I M II M 1 1 II 1 1 1 II 1 1 II 

FTGETDVAKILLGAGLGITGILAWLSTVTTTFLDTYSAGASANNISARFAEIPVAVGVT 
250 260 270 280 290 300 

300 310 320 330 340 

LIGTVLAVMLPVTEYENFLLLIGSVFAPMAGGFDCRLFRLETAX 

II III I MM I MMMIM I MIMMIMMIM hill 

LIRTVLAVMLPVTEYKNFLLLIRSVFGPMAGGFDCRLFCLKTAX 
310 ' 320 330 340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 46 1>: 

al25. seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCGCCGCCA TCGGGCTGAT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACACTGC 

101 TTGCGCCTTT GGGCTGGCAG CGCGGTCTGG CGGCTCTGCT TTTGGGTCAT 



ml25 .pep 
gl25 

ml25 .pep 
gl25 

ml25 .pep 
gl25 

ml25 .pep 
gl25 

ml25 .pep 
gl25 

ml25.pep 
gl25 
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151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

201 CGGACGCAGC TCGATGGAAA GCGTGCGCCT GTCGTTCGGC AAACGCGGTT 

251 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

301 GTGATGATTT ACGCCGGCGC AACGGTCAGC TCCGCTTTGG GCAAAGTGTT 

351 GTGGGACGGC GAATCTTTTG TCTGGTGGGC ATTGGCAAAC GGCGCGCTGA 

4 01 TTGTGCTGTG GCTGGTTTTC GGCGCACGCA AAACAGGCGG GCTGAAAACC 

4 51 GTTTCGATGC TGCTGATGCT GTTGGCGGTT CTGTGGCTGA GTGCCGAAGT 

501 CTTTTCCACG GCAGGCAGCA CCGCCGCACA GGTTTCAGAC GGCATGAGTT 

551 TCGGAACGGC AGTCGAGCTG TCCGCCGTGA TGCCGCTTTC TTGGCTGCCG 

601 CTGGCCGCCG ACTACACGCG CCACGCGCGC CGCCCGTTTG CGGCAACCCT 

651 GACGGCAACG CTCGCCTACA CGCTGACCGG CTGCTGGATG TATGCCTTGG 

701 GTTTGGCAGC GGCGTTGTTC ACCGGAGAAA CCGACGTGGC AAAAATCCTG 

7 51 CTGGGCGCAG GTTTGGGTGC GGCAGGCATT TTGGCGGTCG TCCTGTCGAC 

801 CGTTACCACC ACTTTTCTCG ATGCCTACTC CGCCGGCGTA AGTGCCAACA 

851 ATATTTCCGC CAAACTTTCG GAAATACCCA TCGCCGTTGC CGTCGCCGTT 

901 GTCGGCACAC TGCTTGCCGT CCTCCTGCCC GTTACCGAAT ATGAAAACTT 

951 CCTGCTGCTT ATCGGCTCGG TATTTGCGCC GATGGCG . GC GGTTTTGATT 

1001 GCCGACTTTT TCGTCTTGAA ACGGCGTGA 

This corresponds to the amino acid sequence <SEQ ID 462; ORF 125.a>: 

al25 .pep 

1 MSGNASSPSS SAAIGLIWFG AAVSIAEIST GTLLAPLGWQ RGLAALLLGH 

51 AVGGALFFAA AYIGALTGRS SMESVRLSFG KRGSVLFSVA NMLQLAGWTA 

101 VMIYAGATVS SALGKVLWDG ES FVWWALAN GALIVLWLV F GARKTGGLKT 

151 VSMLLMLLAV LWLSAEV FST AGSTAAQVSD GMSFGTAVEL SAVMPLSWLP 

201 LAADYTRHAR RPFAATLTAT LAYTLTGCWM YALGLAAALF TGETDVAKIL 

251 LGAGLGAAGI LAVVL STVTT TFLDAYSAGV SANNISAKLS EIPIAVAVAV 

301 VGTLLAVLLP VTEYENFLLL IGSVFAPMAX GFDCRLFRLE TA* 



ml25/a!25 95.6% identity in 342 aa overlap 

10 20 30 40 50 60 

ml25 . pep MSGNASSPSSSSAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 
I I I I I I I I t I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | N | | | j | | | | | | | | { 
al25 MSGN AS SPSS SAAIGLIWFGAAVS I AE I STGTLLAPLGWQRGLAALLLGHAVGGALFFAA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml25 . pep AYIGALTGRSSMESVRLSFGKRGSVLFSVANMLQLAGWTAVMIYAGATVS SALGKVLWDG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I M I I I I I I I I I I I I I I I I | | | | | | | | | | j 
al25 AYIGALTGRS SMESVRLSFGKRGSVLFSVANMLQLAGWTAVMIYAGATVSSALGKVLWDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m!25 . pep ESFVWWALANGALIVLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFSTAGSTAAQVSD 
I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | 
al25 ESFVWWALANGALIVLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFSTAGSTAAQVSD 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml25 . pep GMSFGTAVELSAVMPLSWLPLAADYTRHARRPFAATLTATLAYTLTGCWMYALGLAAALF 
I I I I I I I M I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | j | | 
al25 GMSFGTAVELSAVMPLSWLPLAADYTRHARRPFAATLTATLAYTLTGCWMYALGLAAALF 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml25 . pep TGETDVAKILLGAXLGAAGILAVVLSTVTTTFLDAYSAGASANNISARFAETPVAVXVTL 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I i I I I ::: I I : I I I : : 
al25 TGETDVAKILLGAGLGAAGILAVVLSTVTTTFLDAYSAGVSANNISAKLSEIPIAVAVAV 

250 260 270 280 290 300 

310 320 330 340 

ml25 . pep IGTVLAVMLPVTEYENFLLLIGSVFAPMAGGFDCRLFRLETAX 
: I I : I I I : I I I I I I I I I I I I I I I I I I I I I MINIMUM! 
al25 VGTLLAVLLPVTEYENFLLLIGSVFAPMAXGFDCRLFRLETAX 
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310 320 330 340 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 463>: 

gl26 . seq 

1 AtgccgtcTG AAaccCcaaa ggcACGCCGC CGGCTTTCAG ACGGCATCGC 

51 GTCCGACAAC CATACCAAAG AATCCATCAT GCTCACCctg tacggcGAAA 

101 CTTTCCCTTC GCGGCTGCTg ctcggcacgG cggcctacCC GACCCCTGAA 

151 ATCCTCAAAC AATCCGTCCG AACCGCCCGG CCCGCGATGA ttaccGTCTC 

201 GCTGCGCCGC ACGGGATGCG GCGGCGAGGC GCACGGTCAG GGGTTTTGGT 

251 CGCTGCTTCA AGAAACCGGC GTTCCCGTCC TGCCGAACAC GGCAGGCTGC 

3 01 CAAAGCGTGC AGGAAGCGGT. AACGACGGCG CAAATGGCGC GCGAAGTGTT 

3 51 TGAAACCGAT TGGATAAAAT TGGAACTCAT CGGCGACGAC GACACCTTGC 

4 01 AGCCGGACGT GTTCCAACTC GTCGAAGCGG CGGAAATCCT GATTAAAGAC 
4 51 GGCTTCAAAG TGCTGCCTTA TTGCACCGAA GACCTGATTG CCTGCCGCCG 
501 CCTGCTCGAT GCGGGCTGTC AGGCGTTGAT GCCGTGGGCG GCTCCCATCG 
551 GCACGGGTTT GGGGGCGGTT CACGCCTATG CGCTCAAAAT CCTGCGCGAA 
601 CGCCTGCCCG ACACGCCGCT GATTATCGAC GCGGGCTTGG GTTTGCCTTC 
651 CCAAGCGGCA CAAGTGATGG AATGGGGTTT TGACGGCGTA TTGTTAAACA 
701 CCGCCGTTTC CCGCAGCGGC GACCCCGTCA ACATGGCGCG CGCCTTCGCA 
751 CTCGCCGTCG AATCCGGACG GCTGGCATTT GAAGCCGGGC CGGTCGAAGC 
801 GCGAACCAAA GCCCAAGCCA GCACGCCGAC AGTCGGACAA CCGTTTTGGC 
851 ATTCGGCGGA ATATTGA 

This corresponds to the amino acid sequence <SEQ ID 464; ORF 126.ng>: 

gl26 .pep 

1 MPSETPKARR RLSDGIASDN HTKESIMLTL YGETFPSRLL LGTAAYPTPE 

51 ILKQSVRTAR PAMITVSLRR TGCGGEAHGQ GFWSLLQETG VPVLPNTAGC 

101 QSVQEAVTTA QMAREVFETD WIKLELIGDD DTLQPDVFQL VEAAEILIKD 

151 GFKVLPYCTE DLIACRRLLD AGCQALMPWA APIGTGLGAV HAYALKI LRE 

201 RLPDTPLIID AGLGLPSQAA QVMEWGFDGV LLNTAVSRSG DPVNMARAFA 

251 LAVESGRLAF EAGPVEARTK AQASTPTVGQ PFWHSAEY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 465>; 

ml26.seq (partial) 

1 . . CACTATACAA AGGAACCCAT TATGCTCACC CTATACGGCG AAACTTTCCC 

51 CTCGCGGCTG CTGCTCGGCA CGGCTGCCTA CCCGACCCCC GAAATCCTCA 

101 AACAATCCAT CCAAACCGCC CAGCCTGCGA TGATTACCGT CTCGCTGCGC 

151 CGCGCGGGAA GCGGCGGCGA GGCGCACGGT CAGGGGTTTT GGTCGCTGCT 

201 TCAAGAAACC GGCGTTCCCG TCCTGCCGAA CACGGCAGGC TGCCAAAGCG 

251 TGCAGGAAGC GGTAACGACG GCGCAAATGG CGCGCGAAGT GTTTGAAACC 

301 GATTGGATAA AATTGGAACT CATCGGAGAT GACGACACCT TGCAGCCGGA 

3 51 TGTGTTCCAG CTTGTCGAAG CGGCGGAAAT CCTGATTAAA GACGGCTTCA 

401 AAGTGCTGCC TTATTGCACC GAAGACCTGA TTGCCTGCCG CCGCCTGCTC 

451 GACGCGGGCT GTCAGGCGTT GATGCCGTGG GCGGCCCCGA TCGGCACGGG 

501 TTTGGGCGCG GTTCACGCCT ACGCGTTGAA CGTCCTGCGC GAACGCCTGC 

551 CCGACACGCC GCTGATTATC GACGCGGGCT TGGGTTTGCC CTCACAGGCG 

601 GCACAAGTGA TGGAATGGGG CTTTGACGGC GTGCTTTTGA ATACTGCCGT 

651 TTCCCGCAGC GGCGATCCGG TCAATATGGC ACGCGCCTTC GCACTCGCCG 

701 TCGAATCCGG ACGGCTGGCA TTTGAAGCCG GACCGGTCGA AGCACGCGAC 

751 AAAGCGCAAG CCAGCACGCC GACAGTCGGA CAACCGTTTT GGCATTCGGC 

801 GGAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 466; ORF 126>: 

ml26 .pep (partial) 

1 . . HYTKEPIMLT LYGETFPSRL LLGTAAYPTP EILKQSIQTA QPAMITVSLR 

51 RAGSGGEAHG QGFWSLLQET GVPVLPNTAG CQSVQEAVTT AQMAREVFET 
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101 DWIKLELIGD DDTLQPDVFQ 

151 DAGCQALMPW AAPIGTGLGA 

201 AQVMEWGFDG VLLNTAVSRS 

251 KAQASTPTVG QPFWHSAEY* 



LVEAAEILIK DGFKVLPYCT EDLIACRRLL 
VHAYALNVLR ERLPDTPLI I DAGLGLPSQA 
GDPVNMARAF ALAVESGRLA FEAGPVEARD 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 126 shows 95.9% identity over a 269 aa overlap with a predicted ORF (ORF 126.ng) 
from N. gonorrhoeae: 

ml26/gl26 



10 20 30 . 40 

ml26 .pep HYTKEPIMLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQ 

-III M I I !! I I I I I I [ I I I | I M I I I I I M | | | ::| |: 
g 1 2 6 MPSETPKARRRLSDGI ASDNHTKES IMLTLYGETFPSRLLLGTAAYPTPE I LKQSVRTAR 

10 20 30 40 50 60 



50 60 70 80 90 100 

ml2 6 . pep PAMITVSLRRAGSGGEAHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETD 

1 1 1 ! 1 1 1 1 1 ! = f I J ! i 1 1 1 f ! M 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 [ I j I j I f I M 1 1 1 f 1 1 E 1 1 1 

g 1 2 6 PAMI TVSLRRTGCGGEAHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETD 

70 80 90 100 110 120 



110 120 130 140 150 160 

ml26 . pep WIKLELIGDDDTLQPDVFQLVEAAEILIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWA 

i i i i 1 1 f 1 1 1 1 1 1 1 1 1 } i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 j 1 1 1 f 1 1 1 r 1 1 1 1 1 1 f 1 1 

g 1 2 6 • W I KLELI GDDDTLQPDVFQLVEAAE I LI KDGFKVLP YCTEDLI ACRRLLDAGCQALMPWA 

130 140 150 160 170 180 



170 180 190 200 210 220 

ml26 . pep APIGTGLGAVHAYALNVLRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSG 

MM IM 1 1 1 Ml IMM MIMMIMM Mill MMM M M 1 1 MM IMIMM I 

gl2 6 API GTGLGAVHAYALKI LRERL PDT PL I IDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSG 

190 200 210 220 230 240 



230 240 250 260 270 

ml2 6 . pep DPVNMARAFALAVESGRLAFEAGPVEARDKAQASTPTVGQPFWHSAEYX 

MM Mill Mill I MIMMMMM II M M M 1 1 1 1 1 1 M M 1 1 

gl2 6 DPVNMARAFALAVESGRLAFEAGPVEARTKAQASTPTVGQPFWHSAEYX 

250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 467>: 

al26 . seq 

1 TTGTTAATCC ACTATACAAA GGAACCCATT ATGCTCACCC TGTACAGCGA 

51 AACTTTCCCT TCGCGGCTGC TGCTCGGCAC AGCCGCCTAC CCGACCCCTG 

101 AAATCCTCAA ACAATCCGTC CGAACCGCCC GGCCCGCGAT GATTACCGTC 

151 TCGCTGCGCC GCGCGGGATG CGGCGGCGAG GCGCACGGTC AGGGGTTTTG 

201 GTCGCTGCTT CAAGAAACCG GCGTTCCCGT CCTGCCGAAC ACGGCAGGCT 

251 GCCAAAGCGT GCAGGAAGCG GTAACGACGG CGCAAATGGC GCGCGAAGTG 

301 TTTGAAACCG ATTGGATTAA ACTCGAACTC ATCGGCGACG ACGACACCTT 

351 GCAGCCGGAT GTGTTCCAAC TTGTCGAAGC GGCGGAAATC CTGATTAAAG 

4 01 ACGGCTTCAA AGTGCTGCCT TATTGCACCG AAGACCTGAT TGCCTGCCGC 

4 51 CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG ATGCCGTGGG CGGCCCCGAT 

501 CGGCACGGGT TTGGGCGCGG TTCACGCCTA CGCGTTGAAC GTCCTGCGCG 

551 AACGCCTGCC CGACACGCCG CTGATTATCG ACGCGGGCTT GGGTTTGCCC 

601 TCACAGGCGG CACAAGTGAT GGAATGGGGC TTTGACGGCG TGCTTTTGAA 

651 TACTGCCGTT TCCCGCAGCG GCGATCCGGT CAATATGGCA CGCGCCTTCG 

701 CACTCGCCGT CGAATCCGGA CGGCTGGCAT TTGAAGCCGG ACCGGTCGAA 

751 GCACGCGACA AAGCGCAAGC CAGCACGCCG ACAGTCGGAC AACCGTTTTG 

801 GCATTCGGCG GAATATTGA 



WO 99/57280 



PCT/US99/09346 



358 



This corresponds to the amino acid sequence <SEQ ID 468; ORF 126.a>: 

al2 6.pep 

1 LLIHYTKEPI MLTLYSETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV 

51 SLRRAGCGGE AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV 

101 FETDWIKLEL IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR 

151 RLLDAGCQAL MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP 

201 SQAAQVMEWG FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAG PVE 

251 ARDKAQASTP TVGQPFWHS A EY * 

ml26/al26 98.1% identity in 269 aa overlap 

10 20 30 40 50 

HYTKEPIMLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPAMITVSLRRAGSGGE 
i f I I I I M I I i I : I I.I I I I I I I I I I I i i I I I I I I I I :: I I : I I I I I I I ! I I I I | | j 
LLIHYTKEPIMLTLYSETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRAGCGGE 
10 20 30 40 50 60 

60 70 80 90 100 110 

AHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPD 
I I I I I M I I ! I I I I M I I I I I I I II I I I I I I I I II I I I I I I I I | I | | M I I I I I I I I I I I 
AHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPD 
70 80 90 100 110 120 

120 130 140 150 160 170 

VFQLVEAAEI LI KDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALN 

I I I I I I I I I I I I I I I I I M I I II I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

VFQLVEAAEILIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALN 
130 140 150 160 170 180 

180 190 200 210 220 230 

VLRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESG 
I I I I I I I I I I ! I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I II I 
VLRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESG 
190 200 210 220 230 240 

240 250 260 270 

RLAFEAG PVEARDKAQASTPTVGQPFWHSAEYX 
I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
RLAFEAG PVE AR DKAQAS T PT VGQPFWHS AE YX 
250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 469>: 

gl26-l . seq 

1 ATGCTCACCC TGTACGGCGA AACTTTCCCT TCGCGGCTGC TGCTCGGCAC 

51 GGCCGCCTAC CCGACCCCTG AAATCCTCAA ACAATCCGTC CGAACCGCCC 

101 GGCCCGCGAT GATTACCGTC TCGCTGCGCC GCACGGGATG CGGCGGCGAG 

151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATAAA ATTGGAACTC 

301 ATCGGCGACG ACGACACCTT GCAGCCGGAC GTGTTCCAAC TCGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ATGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCTCCCAT CGGCACGGGT TTGGGGGCGG TTCACGCCTA 

501 TGCGCTCAAA ATCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCT TCCCAAGCGG CACAAGTGAT GGAATGGGGT 

601 TTTGACGGCG TATTGTTAAA CACCGCCGTT TCCCGCAGCG GCGACCCCGT 

651 CAACATGGCG CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG GCCGGTCGAA GCGCGAACCA AAGCCCAAGC CAGCACGCCG 

751 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 470; ORF 126-l.ng>: 

gl26-l .pep 

1 MLTLYGETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV SLRRTGCGGE 



ml26 .pep 
al26 

ml26.pep 
al26 

ml26 .pep 
al26 

ml26.pep 
a!26 

ml26.pep 
al26 
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51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALK ILRERLPDTP LI IDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARTKAQASTP 

251 TVGQPFWHSA EY* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 47 1>: 

ml26-l . seq 

1 ATGCTCACCC TATACGGCGA AACTTTCCCC TCGCGGCTGC TGCTCGGCAC 

51 GGCTGCCTAC CCGACCCCCG AAATCCTCAA ACAATCCATC CAAACCGCCC 

101 AGCCTGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGAAG CGGCGGCGAG 

151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATAAA ATTGGAACTC 

301 ATCGGAGATG ACGACACCTT GCAGCCGGAT GTGTTCCAGC TTGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCCCCGAT CGGCACGGGT TTGGGCGCGG TTCACGCCTA 

501 CGCGTTGAAC GTCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCC TCACAGGCGG CACAAGTGAT GGAATGGGGC 

601 TTTGACGGCG TGCTTTTGAA TACTGCCGTT TCCCGCAGCG GCGATCCGGT 

651 CAATATGGCA CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG ACCGGTCGAA GCACGCGACA AAGCGCAAGC CAGCACGCCG 

"7 51 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 5; ORF 126-1>: 

ml26-l .pep 

1 MLTLYGETFP SRLLLGTAAY PTPEILKQSI QTAQPAMITV SLRRAGSGGE 

51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALN VLRERLPDTP LI IDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARDKAQASTP 

251 TVGQPFWHSA EY* 

ml26-l/gl26-l 96.9% identity in 262 aa overlap 

10 20 30 40 50 60 

ml26-l.pep MLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPAMITVSLRRAGSGGEAHGQGFWSLL 
I Mil I1IIMII II llll !IM!Mlll::||:||||ltil M:| I I I i I j ! I I | | | j 
gl26-l MLTLYGETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRTGCGGEAHGQGFWSLL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml2 6-l.pep QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 
M I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I || | | | | | | | | | | | | | | j | | | | | | | | | | | 
gl26-l QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml26-l .pep LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 
I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I :: | I | | | | | | | 
gl26-l LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALKILRERLPDTP 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml26-l .pep LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | 
gl26-l LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 

190 200 210 220 230 240 

250 260 
ml26-l .pep ARDKAQASTPTVGQPFWHSAEYX 

II I I I I I I I I I I I I I I I I I I I I 
gl26-l ARTKAQASTPTVGQPFWHSAEYX 

250 260 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 473>: 

al26-l.seq 

1 ATGCTCACCC TGTACAGCGA AACTTTCCCT TCGCGGCTGC TGCTCGGCAC 

51 AGCCGCCTAC CCGACCCCTG AAATCCTCAA ACAATCCGTC CGAACCGCCC 

101 GGCCCGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGATG CGGCGGCGAG 
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151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

?S1 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATTAA ACTCGAACTC 

301 ATCGGCGACG ACGACACCTT GCAGCCGGAT GTGTTCCAAC TTGTCGAAGC 

151 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCCCCGAT CGGCACGGGT TTGGGCGCGG TTCACGCCTA 

501 CGCGTTGAAC GTCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCC TCACAGGCGG CACAAGTGAT GGAATGGGGC 

601 TTTGACGGCG TGCTTTTGAA TACTGCCGTT TCCCGCAGCG GCGATCCGGT 

651 CAATATGGCA CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG ACCGGTCGAA GCACGCGACA AAGCGCAAGC CAGCACGCCG 

751 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 474; ORF 126-1. a>: 

al26-l *P GP LysETFp SRLLLGTAAY PTPEILKQSV RTARPAMITV SLRRAGCGGE 
51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 
101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 
151 MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP SQAAQVMEWG 
201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARDKAQASTP 
251 TVGQPFWHSA EY* 

al26-l/ml26-l 98.1% identity in 262 aa overlap 

10 20 30 40 50 60 

.19 6-1 nPD MLTLYSETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRAGCGGEAHGQGFWSLL 
al26 l.pep MLTLYS ,,,,,,,,,,, ,,,,,, s s ,, : ,,,, | | | , I ,, | | I I I I I I i I I I I I 

ml ?6 1 MLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPAMITVSLRRAGSGGEAHGQGFWSLL 

10 20 30 40 50 60 

70 80 90 100 110 120 

al26-l Dep QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 
al26 l.pep QETfc ( ,,,,,,,,, M | , | ,[ | | | | | | | | I || I I I I 1 I I I I I I I I I M I 

ml? 6-1 qetgvPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 

70 80 90 100 110 120 



al26-l.pep 



130 140 150 160 170 180 

LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 

niiiiiiiiiiiiiMiiiiiiiiiiiiiiiiiiiiiiiiii'niiiiiiiiiiiiii 



ml 2 6-1 LIKDGFKVLP YCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 

130 140 150 160 170 180 

190 200 210 220 230 240 

_ n o 6— 1 DSD LI I DAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 
al26 l.pep ^^LG « « ( , ,,,,, ,,,,,, I I I I I I I I I 1 1 I I t 1 I I t 1 t I 

ml 26 1 LI I DAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 

m 190 200 210 220 230 240 

250 260 
a 12 6- l.pep ARDKAQASTPTVGQPFWHSAEYX 
I I I I I I I i I I I II I t I I i I I I I I 
ml2 6-l ARDKAQASTPTVGQPFWHSAEYX 
250 260 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 475>: 

gl27.seq ^^^^ GGAATATGTT GAACACTTGG CCCGATGCCG TCCCGATACG 

51 CGCGGAGGCG GCCGAATCCG TGGCGGCGGT CGCGGCTTTG CTGCTGGCGC 

101 GCGCCCTTCT GTTGAATATC CACTTCAGAC GGCATCCGGA TTTCGGCATC 

151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 

201 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC GGCGCAAATT CAAACGCTGG 

251 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACAAAAGAA 

301 CTGATTATGT GTCTGTCGGG CAGTATTTTA aggtctGCCA CCCAGCAATA 

3 51 CTCGGTCGGC GACTATATCG AAATCAACGG CCTGCGCGGG CGCGTGGTCG 

4 01 ACATCAATCT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 
451 GTCGGACAGC TTGCGGGAAC CACCGTTTCT TTCCCCAACA GCCTGTTGTT 
501 GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAT GTCATCCATA 
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5 51 CGGTCGAAAT CCCCGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 

601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 

651 TCAGCGGTAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 

701 CCGCCAGGCC GCGCGTTACC CGCGTACCGT ACGACGACAA GGCATACCGC 

751 ATCATCGTCC GCTTCGCCTC CCCCGTTTCA AAGCGGCTGG AAATCCAACA 

801 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATCATC 

851 CCGCCGgctc cgAAACACTT TAA 

This corresponds to the amino acid sequence <SEQ ID 476; ORF 127.ng>: 

ql27 .pep 

1 MEIWNMLNTW PDAVPIRAEA AES VAAVAAL LLARALLLNI HFRRHPDFGI 

51 ESKRRFLVAS R NITLLLVLF SLAFIWSA QI Q TLALSMFAV AAAVWAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRY LENVQAEKLF ITPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGSETL * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 477>: 

ml27 . seq 

1 ATGGAAATAT GGAATATGTT GGACACTTGG CTCGGTGCCG TCGCGATACG 

51 TGCGGAGGCG GTCGAATCCG TGGCGGCGGT TGCGGCTTTG CTGCTGGCGC 

101 GCGCCCTTCT GTTGAATATC CACTTCAAAC GGCATCCGGA TTTCGGCATC 

151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 

2 01 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC GGCGCAAATC CAAACGCTGG 
251 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACGAAGGAA 

3 01 CTGATTATGT GTCTGTCGGG CAGTATTTTA AGGTCTGCCA CCCAGCAATA 
351 CTCGGTCGGC GACTATATCG AAATCAACGG CCTGCGCGGG CGCGTGGTGG 

4 01 ACATCAACCT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 
451 GTCGGACAGC TTGCGGGAAC CACCGTTTCT TTCCCCAACA GCCTGTTGTT 
501 GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAT GTCATCCATA 
551 CGGTCGAAAT CCCCGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 
601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 
651 CCAACGGsAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 
701 CCGCCAGACC GCGCGTTACC CGCGTGCCGT ACGATGACAA GGCATACCGC 
751 ATCATCGTCC GCTTCGCTTC CCCCGTTTCA AAGCGGCTGG AAATCCAACA 
8 01 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATCACC 
851 CCGCCGGCTC CGAAACACTT TAA 

This corresponds to the amino acid sequence <SEQ ID 478; ORF 127>: 

ml27 .pep 

1 MEIWNMLDTW LGAVPIRAEA VES VAAVAAL LLARALLLNI HFKRHPDFGI 

51 ESKRRFLVAS R NITLLLVLF SLAFIWSA QI Q TLALSMFAV AAAVWAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRX LENVQAEKLF ITPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGSETL * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 127 shows 97.9% identity over a 290 aa overlap with a predicted ORF (ORF 127.ng) 
from N. gonorrhoeae: 

ml27/gl27 



ml27 .pep 
gl27 



10 20 30 40 50 60 

ME I WNMLDTWLGAVP I RAEAVES VAAVAALLLARALLLNI HFKRHPDFGI ES KRRFLVAS 

IIIMIhll 1 1 1 1 1 1 1 1 » 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 hMIIIIIIIIIIIIMI 

ME IWNMLNTWPDAVP I RAEAAESVAAVAALLLARALLLN I HFRRHPDFGI ESKRRFLVAS 
10 20 30 40 50 60 



70 80 90 100 110 120 

RNITLLLVLFSLAFIWSAQIQTLALSMFAVAAAWVATKELIMCLSGSILRSATQQYSVG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 f 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

gl27 RKITLLLVLFSLAFIWSAQIQTLALSMFAVAAAVWATKELIMCLSGSILRSATQQYSVG 



ml27 .pep 
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70 



80 



90 



100 



110 



120 



ml27 .pep 
gl27 

m!27 .pep 
gl27 

ml2 7 .pep 
gl27 



130 140 150 160 170 180 

DYIEINGLRGRWDINLLNTLMMQVGPNPLVGQLAGTTVSFPNSLLLSHPVRRDNILGDY 

M 1 1 1 1 1| M MIMIItlMIIMM IMMII II II IMMM 1 1 1 1 II II i IMIII 

DYIEINGLRGRWDINLLNTLMMQVGPNPLVGQLAGTTVSFPNSLLLSHPVRRDNILGDY 
130 140 150 160 170 180 

190 200 210 220 230 240 

VIHTVEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAIQRXLENVQAEKLFITPAARPRVT 

1 1 } 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 MilllllllllllllHM 

VI HTVE I PVP IHLDSDEAVCRLKAVLEPLCAPY I PAIQRYLENVQAEKLFI TPAARPRVT 
190 200 210 220 230 240 

250 260 270 280 290 

RVP YDDKAYRI I VRFAS P VS KRLE I QQ AVMDE FLRVQ YRLLNHP AGS ETLX 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 « 

RV P YDDKAYR 1 1 VRFAS PVS KRLE I QQAVMDE FLRVQ YRLLNHP AGS ETLX 
250 260 270 280 290 



The following partial DNA sequence was 

al27 .seq 

1 ATGGAAATAT GGAATATGTT 

51 TGCGGAGGCG GTCGAATCCG 

101 GCGCCCTTCT GTTGAATATC 

151 GAAAGCAAGC GGCGGTTTTT 

201 GGTGCTGTTT TCGCTGGCAT 

251 CTTTGTCGAT GTTTGCGGTG 

301 CTGATTATGT GTCTGTCGGG 

351 CTCGGTCGGC GACTATATCG 

4 01 ACATCAACCT GTTGAACACG 

4 51 GTCGGACAGC TTGCGGGAAC 

501 GAGCCACCCC GTGCGCCGCG 

551 CGGTCGAAAT CCCGGTTCCC 

601 CGTCTGAAAG CCGTACTCGA 

651 CCAACGGCAT TTGGAAAACG 

701 CCGCCAAACC GCGCGTTACC 

7 51 ATCATCGTCC GCTTCGCCTC 

801 GGCGGTTATG GACGAATTTT 

851 CCGCCGGCTC CGAAACACTT 



identified in N. meningitidis <SEQ ID 479>: 



GGACACTTGG 
TGGCGGTGGT 
CACTTCAAAC 
GGTTGCCAGC 
TTATCTGGTC 
GCGGCGGCGG 
CAGCATTTTA 
AAATCAACGG 
CTGATGATGC 
CACCGTTTCT 
ACAATATTTT 
ATCCATTTGG 
GCCCTTGTGC 
TGCAGGCGGA 
CGCGTGCCGT 
CCCCGTTTCA 
TGCGCGTACA 
TAA 



CTCGGTGCCG 
CGCGGCTTTG 
GGCATCCGGA 
CGCAATATAA 
GGCGCAAATC 
TCGTCGTGGC 
AGGTCTGCCA 
CCTGCGCGGG 
AGGTCGGTCC 
TTCCCCAACA 
GGGCGACTAC 
ATTCGGATGA 
GCGCCCTACA 
AAAACTGTTT 
ACGATGACAA 
AAGCGGCTGG 
ATACCGCCTG 



This corresponds to the amino acid sequence <SEQ ID 480; ORF 

al27 .pep 
1 
51 
101 
151 



MEIWNMLDTW LGAVPIRAEA VES VAVVAAL LLARALLLNI 
ESKRRFLVAS RN ITLLLVLF SLAFIWSA QI Q TLALSMFAV 



TCCCGATACG 
CTGCTGGCGC 
TTTCGGCATC 
CGCTGCTTTT 
CAAACGCTGG 
GACGAAGGAA 
CCCAGCAATA 
CGCGTGGTCG 
GAACCCCTTG 
GCCTGTTGTT 
GTCATCCATA 
AGCCGTATGC 
TCCCCGCCAT 
ATCACGCCCG 
GGCATACCGC 
AAATCCAACA 
TTAAATTACC 



127.a>: 

HFKRHPDFGI 
AAAVWATKE 



201 
251 



LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT 
VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP 
RLKAVLEPLC APYIPAIQRH LENVQAEKLF ITPAAKPRVT 
IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNYPAGSETL 



LMMQVGPNPL 
IHLDSDEAVC 
RVPYDDKAYR 



ml27/al27 98.6% identity in 290 aa overlap 

10 20 30 40 50 60 

ml27 pep ME I WNMLDT WLGAV PI RAE AVE SVAAVAALLLARALLLN I HFKRHPDFGI ESKRRFLVAS 

I | I M I M I M M I M I UMM M:M I I I 11 I I I I I I M M M M M I II ti I I I ill 
MEIWNMLDTWLGAVPIRAEAVESVAVVAALLLARALLLNIHFKRHPDFGIESKRRFLVAS 
10 20 30 4 0 50 



al27 

ml27 .pep 
al27 



60 



70 80 90 100 110 120 

RNITLLLVLFSLAFIWSAQIQTLALSMFAVAAAVVVATKELIMCLSGSILRSATQQYSVG 

I I I I I 1 I I I I I I ! I I I I I I I I t I I I I I I M I I I I M 1 I I I I I I I I I i t I i I I i I I I I I I I 

RNITLLLVLFSLAFIWSAQIQTLALSMFAVAAAVVVATKELIMCLSGSILRSATQQYSVG 
70 80 90 100 110 120 
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130 140 150 160 170 180 

ml27 . pep DYIEINGLRGRVVDINLLNTLMMQVGPNPLVGQLAGTTVSFPNSLLLSHPVRRDNILGDY 

i I I I I I I 1 1 I I I I I I I 1 1 1 1 I 1 1 I I I I I I I I I I I ! I II I | (l | | | | || | | | | | | | 

al27 DYIEINGLRGRVVDINLLNTLMMQVGPNPLVGQLAGTTVSFPNSLLLSHPVRRDNILGDY 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml27 . pep VIHTVEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAIQRXLENVQAEKLFITPAARPRVT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I ! I! I: M I I 
al27 VIHTVEI PVPIHLDSDEAVCRLKAVLEPLCAPYI PAIQRHLENVQAEKLFITPAAKPRVT 

190 200 210 220 230 240 

250 260 270 280 290 

ml27 . pep RVPYDDKAYRIIVRFASPVSKRLEIQQAVMDEFLRVQYRLLNHPAGSETLX 
I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I : I | | | | | | | 
al27 RVPYDDKAYRIIVRFASPVSKRLEIQQAVMDE FLRVQYRLLNYPAGSETLX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 48 1>: 

gl28.seq 

1 atgattgaca acgCActgct ccacttgggc gaagaaccCC GTTTTaatca 
51 aatccaaacc gaagACAtca AACCCGCCGT CCAAACCGCC ATCGCCGAAG 
101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 
151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 
201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 
251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 
301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

3 51 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

4 01 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 
451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

5 01 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 
551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
601 GC CGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 
651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 
701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 
751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 
801 AACCGccaaa cTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 
851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

10 51 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

12 01 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
1251 CGCGTGGATG AACGACtaca AAGGCCGCCG CCGCTTTGCC GACGgcacGC 

13 01 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

13 51 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AacCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
1451 TGTCCGGCAT CAAcggcgtA GAATGGGACG CGGTCGAACT GCCCAGCCAG 
1501 TTTATGGAAA ACTTCGTTTG GGAATACAAT GTATTGGCAC AAATGTCCGC 
1551 CCACGAAGAA AccgGCGAGC CCCTGCCGAA AGAACTCTTC GACAAAATGC 
1601 TcgcCGCCAA AAACTTCCAG CGCGGTATGT TCCTCGTCCG GCAAATGGAG 
1651 TTCGCCCTCT TCGATATGAT GATTTACAGT GAAAGCGACG AATGCCGTCT 
1701 GAAAAACTGG CAGCAGGTTT TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 
1751 TCCAACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCacatctTC 
1801 GCcggcGGCT ATTCCGCAGG CTATTACAGC TACGCATGGG ' CCGAAGTCCt 
1851 CAGCACCGAT GCCTACGCCG CCTTTGAAGA AAGcGACGac gtcGCCGCCA 
1901 CAGGCAAACG CTTCTGGCAA GAAAtccttg ccgtcggcgg ctCCCGCAGC 
1951 gcgGCGGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 
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2001 ACTGCTGCGC CAaagcggtT TCGACAACGC gGCttgA 

This corresponds to the amino acid sequence <SEQ ID 482; ORP 128.ng>: 



ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 
KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 
QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 
KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 



gl28 .pep 




1 


MIDNALLHLG 


51 


NTVERLTGIT 


101 


GQDIELYNRF 


151 


ELAKLQTEGA 


201 


AAQSEGKTGY 


251 


KFDNTANIDR 


301 


ARRAKPYAEK 


351 


EVKKYFPVGK 


401 


I GGVYMDL YA 


451 


GKEARLSHDE 


501 


FMENFVWEYN 


551 


FALFDMMIYS 


601 


AGGYSAGYYS 


651 


AAESFKAFRG 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 483>: 

ml28 . seq (partial) 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTGCGTCGC CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

3 01 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCAC 

1 TACGCCAGCG AAAAACTGCG CGAAGCCAAA TACGCGTTCA GCGAAACCGA 

51 wGTCAAAAAA TAyTTCCCyG TCGGCAAwGT ATTAAACGGA CTGTTCGCCC 

101 AAmTCAAAAA ACTmTACGGC ATCGGATTTA CCGAAAAAAC yGTCCCCGTC 

151 TGGCACAAAG ACGTGCGCTA TTkTGAATTG CAACAAAACG GCGAAmCCAT 

2 01 AGGCGGCGTT TATATGGATT TGTACGCACG CGAAGGCAAA CGCGGCGGCG 
251 CGTGGATGAA CGACTACAAA GGCCGCCGCC GTTTTTCAGA CGGCACGCTG 

3 01 CAAyTGCCCA CCGCCTACCT CGTCTGCAAC TTCGCCCCAC CCGTCGGCGG 

3 51 CAGGGAAGCC CGCyTGAGCC ACGACGAAAT CCTCATCCTC TTCCACGAAA 
401 * CCGGACACGG GCTGCACCAC CTGCTTACCC AAGTGGACGA ACTGGGCGTA 

4 51 TCCGGCATCA ACGGCGTAkA ATGGGACGCG GTCGAACTGC CCAGCCAGTT 

5 01 TATGGAAAAT TTCGTTTGGG AATACAATGT CTTGGCACAA mTGTCAGCCC 
551 ACGAAGAAAC CGGcgTTCCC yTGCCGAAAG AACTCTTsGA CAAAwTGCTC 
601 GCCGCCAAAA ACTTCCAAsG CGGCATGTTC yTsGTCCGGC AAwTGGAGTT 
651 CGCCCTCTTT GATATGATGA TTTACAGCGA AGACGACGAA GGCCGTCTGA 
701 AAAACTGGCA ACAGGTTTTA GACAGCGTGC GCAAAAAAGT CGCCGTCATC 
751 CAGCCGCCCG AATACAACCG CTTCGCCTTG AGCTTCGGCC ACATCTTCGC 
801 AGGCGGCTAT TCCGCAGCTn ATTACAGCTA CGCGTGGGCG GAAGTATTGA 
851 GCGCGGACGC ATACGCCGCC TTTGAAGAAA GCGACGATGT CGCCGCCACA 
901 GGCAAACGCT TTTGGCAGGA AATCCTCGCC GTCGGGGnAT CGCGCAGCGG 
951 nGCAGAATCC TTCAAAGCCT TCCGCGGCCG CGAACCGAGC ATAGACGCAC 

1001 TCTTGCGCCA CAGCGGTTTC GACAACGCGG TCTGA 

This corresponds to the amino acid sequence <SEQ ID 484; ORF 128>: 

ml28.pep (partial) 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA I KAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNCVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NH 



// 



1 YASEKLREAK YAFSETXVKK YFPVGXVLNG LFAQXKKLYG IGFTEKTVPV 

51 WHKDVRYXEL QQNGEXIGGV YMDLYAREGK RGGAWMNDYK GRRRFSDGTL 

101 QLPTAYLVCN FAPPVGGREA RLSHDEILIL FHETGHGLHH LLTQVDELGV 

151 SGINGVXWDA VELPSQFMEN FVWEYNVLAQ XSAHEETGVP LPKELXDKXL 

201 AAKNFQXGMF XVRQXEFALF DMMIYSEDDE GRLKNWQQVL DSVRKKVAVI 

251 QPPEYNRFAL SFGHIFAGGY SAAXYSYAWA EVLSADAYAA FEESDDVAAT 
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301 GKRFWQEILA VGXSRSGAES FKAFRGREPS IDALLRHSGF DNAV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 128 shows 91 .7% identity over a 475 aa overlap with a predicted ORF (ORF 128.ng) 
from N. gonorrhoeae: 

ml28/gl28 

10 20 30 40 50 60 

gl28 . pep MI DNALLHLGEE PRFNQ I QTED I KPAVQTAI AEARGQI AAVKAQTHTGWANTVERLTGI T 

I III I II III III hi hi I II I I hi I I Ml II Nihil II I I III II II Mill 
ml 2 8 MTDNALLHLGEE PRFDQI KTEDI KPALQTAI AEAREQIAAI KAQTHTGWANTVE PLTG I T 

10 20 30 40 50 60 

70 80 90 100 110 120 

gl28 . pep ERVGRI WGWSHLNS WDTPELRAVYNELMPE I TVFFTE I GQDI ELYNRFKTI KNS PEFA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 i li 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 

ml28 ERVGR I WGVVSHLNCVADTPELRAVYNELMPE I TVFFTE I GQDI ELYNRFKTI KNS PEFD 

70 80 90 100 110 120 

130 140 150 160 170 180 

gl28 . pep TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

lllllllllhl 
ml 2 8 TLS PAQKTKLNH 

130 

// 

340 350 360 

gl2 8 . pep YAGEKLREAKYAFSETEVKKYFPVGKVLAG 

Ihlllllllllllll Mill III II I 

ml2 8 Y AS E KLRE AKY AFS ETX VKKY F P VGX VLNG 

10 20 30 

370 380 390 400 410 420 

gl28 . pep LFAQIKKliYGIGFAEKTVPWHKDVRYFELQQNGKTIGGVYMDLYAREGKRGGAWMNDYK 

MM 1 1 1 1 M 1 1 H 1 1 1 1 1 1 1 1 1 1 1 1 II III MM II III II I III Ml Ml Ml Ml 

ml28 LFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWMNDYK 

40 50 60 70 80 90 

430 440 450 460 470 480 

gl 2 8 . pep GRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEI LTLFHETGHGLHHLLTQVDELGV 

I I I I I : I I I I I I I I II I I II I I I I II h I I I I I I I I I I I II I II I II I M I I II I I I II 
ml 2 8 GRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILIIiFHETGHGLHHLLTQVDELGV 

100 110 120 130 140 150 

490 500 510 520 530 540 

gl28 . pep SGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGEPLPKELFDKMLAAKNFQRGMF 

MUM I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I II II III 1 1 Ml I II Ml MM III 

ml28 SGINGVXWDAVELPSQFMENFWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQXGMF 

160 170 180 190 200 210 

550 560 570 580 590 600 

gl28.pep LVRQMEFALFDMMI YSESDECRLKNWQQVLDSVRKEVAVI QPPE YNRFANSFGH I FAGGY 

III IIMMIMIIhll I i 1 1 1 1 i M 1 1 1 1 1 = 1 1 1 1 1 1 1 11 II 1 1 MINIM II 

ml2 8 XVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIFAGGY 

220 230 240 250 260 270 

610 620 630 640 650 660 

gl28 . pep SAGYYSYAWAEVLSTDAYAAFEESDDVAATGKRFWQE I LAVGGSRSAAES FKAFRGREPS 

I h 1 1 1 1 1 1 1 1 1 h II I II I II 1 1 1 II N I II 1 1 1 1 1 1 1 1 1 1 1 h II 1 1 II 1 1 II M I 

ml28 SAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAESFKAFRGREPS 

280 290 300 310 320 330 
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670 679 
gl28.pep . IDALLRQSGFDNAAX 

I I I I I I : I I I I I h 
ml28 IDALLRHSGFDNAVX 

340 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 485>: 

a!28.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

4 51 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

7 01 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 
751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

8 01 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 
8 51 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAG CGCGG AC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 486; ORP 128.a>: 

al28.pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLN FLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAVVRPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28/al28 66.0% identity in 677 aa overlap 

10 20 30 40 50 60 

ml28 . pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
I I I I I t I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I 
al2 8 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 



70 80 90 .100 110 120 

ml28 . pep ERVGRIWGVVSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
I I I I I I I I II I I I I I : I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
al28 ERVGRIWGVVSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 



130 

ml28 . pep TLSPAQKTKLNH 

I I I I I I I I I I I 

al28 TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

130 140 150 160 170 180 



ml28.pep 

al28 FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 

190 200 210 220 230 240 



ml28.pep 

al28 TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

250 260 270 280 290 300 

140 150 

ml 2 8 .pep YASEKLREAKYAFSETXVKKYFPVGX 

I I : I I I II I I I I I I I I I I I I I I I I 
al28 ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 

310 320 330 340 350 360 

.160 170 180 190 200 210 

ml 2 8 . pep VLNGLFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWM 
I I I I I I II I I I I I I I I I I I I I I I I I ! II I I I I I II I I : I I I I I I I I I I I I I I I I I I I I 
al28 VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 

220 230 240 250 260 270 

ml28 .pep NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I : I I I I I I I I I I I I I I I II I I I I I II I I I 
al28 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 

430 440 450 460 470 480 

280 290 300 310 320 330 

ml28 . pep ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQ 
I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I 1 I I I I I I I II I I I I II I 
al28 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

490 500 510 520 530 540 

340 350 360 370 380 390 

ml 2 8 . pep XGMFXVRQXEFALFDMMI YSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALS FGHI F 

Mi III I I I I I I I I I I I I I I I I I I I I I I I I M I I I I : I I I : : I I I I I I I I I I I I I I 
al28 RGMFLVRQME FALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAVVRPPEYNR FANS FGHI F 

550 560 570 580 590 600 
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400 410 420 430 440 450 

ml2 8 . pep AGGYS AAX YS YAWAE VLSADAYAAFEE S DDVAATGKR FWQE I LAVGXSRSGAE S FKAFRG 

MUM: MMI I I I I II I II I II I I M M I I ! I I II I Ml I I I I M I : I I I II M I I 

AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 
610 620 630 640 650 660 



al28 



460 470 
ml28 . pep REPS I DALLRHSGFDNAVX 

I I I I I I I I I ! I I I I I I I : 
al28 REPSI DALLRHSG FDNAAX 

670 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 487>: 

gl28-l. seq (partial) 

1 ATGATTGACA ACGCACTGCT CCACTTGGGC GAAGAACCCC GTTTTAATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGG AC A AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

3 51 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

4 01 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 
4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 
551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 
651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 
701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 
751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 
801 AACCGCCAAA CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 
851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

12 01 ATCG GCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGCTTTGCC GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TGTCCGGCAT CAACGGCGTA AAA 



This corresponds to the amino acid sequence <SEQ ID 488; ORF 128-1. ng>: 

gl28-l.pep (partial) 

1 MIDNALLHLG EEPRFNQIKT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQMV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV K 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 489>: 

ml28-l.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 
51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 
101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 
151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 
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201 


GGGCGTGGTG 


TCGCACCTCA 


251 


CCGTCTATAA 


CGAACTGATG 


301 


GGACAAGACA 


TCGAGCTGTA 


351 


CGAATTCGAC 


ACCCTCTCCC 


401 


TGCGCGATTT 


CGTCCTCAGC 


451 


GAACTGGCAA 


AACTGCAAAC 


501 


CCAAAACGTC 


CTAGACGCGA 


551 


CCGCACCGCT 


TGCCGGCATT 


601 


GCCGCGCAAA 


GCGAAAGCAA 


651 


ACACTACCTC 


GCCGTCATCC 


701 


AAATCTACCG 


CGCCTACGTT 


751 


AAATTCGACA 


ACACCGCCAA 


801 


AACCGCCAAA 


CTGCTCGGCT 


851 


CCAAAATGGC 


GGACACGCCC 


901 


GCCCGCCGCG 


CCAAACCCTA 


951 


CTTCGCCCGC 


GAAAGCCTGA 


1001 


GCTACGCCAG 


CGAAAAACTG 


1051 


GAAGTCAAAA 


AATACTTCCC 


1101 


CCAAATCAAA 


AAACTCTACG 


1151 


TCTGGCACAA 


AGACGTGCGC 


1201 


ATAGGCGGCG 


TTTATATGGA 


1251 


CGCGTGGATG 


AACGACTACA 


1301 


TGCAACTGCC 


CACCGCCTAC 


1351 


GGCAGGGAAG 


CCCGCCTGAG 


1401 


AAC CGG AC AC 


GGGCTGCACC 


1451 


TATCCGGCAT 


CAACGGCGTA 


1501 


T TT ATGG AAA 


ATTTCGTTTG 


1551 


CCACGAAGAA 


ACCGGCGTTC 


1601 


TCGCCGCCAA 


AAACTTCCAA 


1651 


TTCGCCCTCT 


TTGATATGAT 


1701 


GAAAAACTGG 


CAACAGGTTT 


1751 


TCCAGCCGCC 


CGAATACAAC 


1801 


GCAGGCGGCT 


ATTCCGCAGG 


1851 


GAGCGCGGAC 


GCATACGCCG 


1901 


CAGGCAAACG 


CTTTTGGCAG 


1951 


GCGGCAGAAT 


CCTTCAAAGC 


2001 


ACTCTTGCGC 


CACAGCGGTT 



ACTCCGTCGC CGACACGCCC GAACTGCGCG 
CCCGAAATCA CCGTCTTCTT CACCGAAATC 
CAACCGCTTC AAAACCATCA AAAATTCCCC 
CCGCACAAAA AACCAAACTC AACCACGATC 
GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 
CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
CCGACGCGTT CGG CATTT AC TTTGACGATG 
CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
AACAGGCTAC AAAATCGGCT TGCAGATTCC 
AATACGCCGA CAACCGCGAA CTGCGCGAAC 
ACCCGCGCCA GCGAACTTTC AGACGACGGC 
CATCGACCGC ACGCTCGCAA ACGCCCTGCA 
TCAAAAACTA CGCCGAATTG TCGCTGGCAA 
GAACAAGTTT TAAACTTCCT GCACGACCTC 
CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
ACCTCGCCGA TTTGCAACCG TGGGACTTGG 
CGCGAAGCCA AATACGCGTT CAGCGAAACC 
CGTCGGCAAA GTATTAAACG GACTGTTCGC 
GCATCGGATT TACCGAAAAA ACCGTCCCCG 
TATTTTGAAT TGCAACAAAA CGGCGAAACC 
TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
AAGGCCGCCG CCGTTTTTCA GACGGCACGC 
CTCGTCTGCA ACTTCGCCCC ACCCGTCGGC 
CCACGACGAA ATCCTCATCC TCTTCCACGA 
ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
GAATGGGACG CGGTCGAACT GCCCAGCCAG 
GGAATACAAT GTCTTGGCAC AAATGTCAGC 
CCCTGCCGAA AGAACTCTTC GACAAAATGC 
CGCGGCATGT TCCTCGTCCG GCAAATGGAG 
GATTTACAGC GAAGACGACG AAGGCCGTCT 
TAGACAGCGT GCGCAAAAAA GTCGCCGTCA 
CGCTTCGCCT TGAGCTTCGG CCACATCTTC 
CTATTACAGC TACGCGTGGG CGGAAGTATT 
CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 
GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 
CTTCCGCGGC CGCGAACCGA GCATAGACGC 
TCGACAACGC GGTCTGA 



This corresponds to the amino acid sequence <SEQ ID 490; ORF 128-1>: 

ml28-l.pep. 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSESKTGY KIGLQIPHYL AVIQYADNRE LREQIYRAYV TRASELSDDG . 

251 KFDNTANIDR TLANALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLNLADLQP WDLGYASEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFAPPVG 
451 GREARLSHDE ILILFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

5 01 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKK VAVIQPPEYN RFALSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAV* 



m!28-l/gl28-l 94.5% identity in 4 91 aa overlap 



10 20 30 40 50 60 

gl28~l .pep MIDNALLHLGEEPRFNQIKTEDIKPAVQTAIAEARGQIAAVKAQTHTGWANTVERLTGIT 

I llllllllllllhlllllllllhllllllll II Ihlllllllllll I! Illl! 

ml28-l MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
10 20 30 40 50 60 



70 80 90 100 110 120 

gl28-l .pep ERVGRIWGWSHLNSWDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

i I III Ml 1 1 1 M 1 1 h I II 1 1 1 MM II MINIMI MM MINI HIM Ml I M 

ml28-l ERVGRIWGWSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 
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130 140 150 160 170 180 

gl28-l .pep 
ml28-l 



gl28-l .pep 
ml28-l 



gl28-l .pep 
ml28-l 



gl28~l .pep 
ml28-l 



g!28-l .pep 
ml28-l 



gl28-l .pep 
ml28-l 



gl28-l .pep 
ml28-l 

490 500 510 520 530 540 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 49 1>: 

al28-l.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

2 51 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

4 51 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

6 01 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 
651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

7 01 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 
751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

8 01 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 
851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 



TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

I I I I I I I I I I : I I I I I I I I I I I I I I I I = I I I I I I I I I I I I I I I I I I I I I I I I I I I | | M I 
TLSPAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

130 140 150 160 170 180 

190 200 210 220 230 240 

FDDAAP LAG I P EDALAMFAAAAQ S EG KTG YKI GLQ I PHYLAVI Q YAGNRELREQI YRA YV 

II 1 1! I Mill III llllillll Ihllllllllll III lllll II illllllllllll 

FDDAAPLAGIPEDALAMFAAAAQSESKTGYKIGLQIPHYLAVIQYADNRELREQIYRAYV 
190 200 210 220 230 240 

250 260 270 280 290 300 

TRASELSNDGKFDNTANIDRTLENALKTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

Mllllhlll IIIIIMIIII llhlllllllllllllllimiMIMIIIIIMI 

TRASELSDDGKFDNTANIDRTLANALQTAKLLGFKI^AELSI^TKMADTPEQVLNFLHDL 
250 260 270 280 290 300 

310 320 330 340 350 360 

ARRAKPYAEKDLAEVKAFAREHLGLADPQPWDLSYAGEKLREAKYAFSETEVKKYFPVGK 

III 1 1 1 1 1 1 II 1 1 1 1 1 1 II II hill IMIhlhlllllMIIIMIIIIIIIIIII 

ARRAKPYAEKDLAEVKAFARESLNLADLQPWDLGYASEKLREAKYAFSETEVKKYFPVGK 
310 320 330 340 350 360 

370 380 390 400 410 420 

VLAGLFAQIKKLYGIGFAEKTVPVWHKDVRYFELQQNGKTIGGVYMDLYAREGKRGGAWM 

I I II MINI lllllhlll IIIIIIIIIIIIIIIIHIIIIIIIIIIIIII illil I 

VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAV7M 
370 380 390 400 410 420 

430 440 450 460 470 480 

NDYKGRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 

III II lllh'lll MMIII UIIIIMMIillUII IMIIIIIIIMIIIII 

NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
430 440 450 460 470 480 

490 
ELGVSG INGVK 
llllllllll: 

ELGVSGINGVEWDAVELPSQFMENFWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
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1251 CGCGTGG ATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

15 01 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 
1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 
1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 
1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 
1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 
1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 
1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 
18 51 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 
1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 
1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 
2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 492; ORF 128-1. a>: 

al28-l.pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 

6 01 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG REPSIDALLR HSGFDNAA* 



ml28-l/al28-l 97.8% identity in 677 aa overlap 



10 20 30 40 50 60 

al2 8 - 1 . pep MTDNALLHLGEE PRFDQ I KTED I KPALQTAI AEAREQI AAI KAQTHTG WANTVE PLTG I T 

I I I I I I I I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
ml 2 8 - 1 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
10 20 30 40 50 60 



70 80 90 100 110 120 

al28-l .pep ERVGRIWGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

ii i : 1 1 ii i in ii i m i ii iihi I! i iii ii iii hum i ill i ii 1 1 1 Mill 1 1 

ml28-l ERVGRI WG WSHIiNS VADTPELRAVYNELMPE I TVFFTE IGQDI ELYNRFKTI KNS PEFD 

70 80 90 100 110 120 



130 140 150 160 170 180 

al2 8 - 1 . pep TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGI Y 

I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I II I II I I I I I 
ml2 8-1 TLS PAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFG I Y 

130 140 150 160 170 180 



190 200 210 220 230 240 

al28-l.pep FDD AA P LAG I P ED ALAM F AAAAQS EG KTG Y KI GLQ I PH YLA V I Q YADNRKLRE Q I YRA YV 

1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II II h 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 

ml 2 8 - 1 FDDAAPLAGI PEDALAMFAAAAQSESKTGYKI GLQ I PHYLAV I QYADNRELREQI YRA YV 

190 200 210 220 230 240 



250 260 270 280 290 300 

al2 8 - 1 . pep TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml 2 8 - 1 TRASELSDDGKFDNTANIDRTLANALQTAKLLGFKNYAELSLATKMADTPEQVIjNFLHDL 
250 260 270 280 290 300 



310 320 330 340 350 360 

al28-l . pep ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 
I I I I II I I I I I I I I I I I I I II I h I I I I I I I I I I I I = I I I I I I I I I I I I I I I I I I I I I I I 
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ARRAKPYAEKDLAEVKAFARESLNLADLQPWDLGYASEKLREAKYAFSETEVKKYFPVGK 
310 320 330 340 350 360 

370 380 390 400 410 420 

VLNGL F AQ I KKL YG IG F T E KT VP VWHKD VR YF E LQQNG E T I G G VYMD L Y AR E G KRGG A WM 

1 1 1 1 ii 1 1 ii 1 1 i i ii 1 1 1 ii 1 1 1 in mi 1 1 1 mi 1 1 1 nun i ii i ii i ii 1 1 1 1 

VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
370 380 390 400 410 420 

430 440 450 460 470 480 

NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 

1 1 1 1 1 1 1 Ml 1 1 1 1 II 1 1 III I II hi 1 1 Ihl 1 1 II I II 1 1 I III 1 1 1 1 1 1 III 1 1 1 1 

NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
430 440 450 460 470 480 

490 500 510 520 530 540 

ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

1 1 1 1 II I II 1 1 1 1 1 II 1 1 1 IIMI III 1 1 IN 1 1 MM I II I II II 1 1 1 1 1 Mill 1 1 1 1 

ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 

550 560 570 580 590 600 

RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFANSFGHIF 

i ii ii MMMMiii i liiiiiiiiiiiiiiiiiM nmimi nun mum 

RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 
550 560 570 580 590 600 

610 620 630 640 650 660 

AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 

1 1 1 II I MM 1 1 1 Ml II III I II Ml I Ml I II MIMMI M II 1 1 MM I III III I 

AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 
610 620 630 640 650 660 

670 679 
RE PSIDALLRHSGFDNAAX 

1 1 1 II M I M I II 1 1 1 M 

REPSIDALLRHSGFDNAVX 
670 

al28-l/ P44573 

sp|P44573|OPDA_HAEIN OLIGOPEPTIDASE A >gi | 1075082 | pir || C64055 oligopeptidase A (prlC) homolog 
- Haemophilus influenzae (strain Rd KW20) 

>gi | 1573174 (U32706) oligopeptidase A (prlC) [Haemophilus influenzae Rd] Length » 681 
Score = 591 bits (1507) , Expect = e-168 

Identities = 309/677 (45%), Positives = 415/677 (60%), Gaps = 4/677 (0%) 



Query : 


4 


NALLHLGEEPRFDQIKTEDIKPALQTXXXXXXXXXXXXXXXTHTGWANTVEPLTGITERV 


63 






N LL++ P F QIK E I+PA++ H W N + PLT +R+ 




Sbjct : 


5 


NPLLNIQGLPPFSQIKPEHIRPAVEKLIQDCRNTIEQVLKQPHFTWENFILPLTETNDRL 


64 


Query : 


64 


GRIWGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFDTLS 


123 






R W VSHLNSV ++ ELR AY +P ++ + T +GQ LYN + +KNS EF S 




Sbjct : 


65 


NRAWSPVSHLNSVKNSTELREAYQTCLPLLSEYSTWVGQHKGLYNAYLALKNSAEFADYS 


124 


Query: 


124 


HAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIYFDD 


183 






AQK + + LRDF LSG L E+Q + + ++L+++FS NVLDAT + + + 




Sbjct : 


125 


IAQKKAIENSLRDFELSGIGLSEEKQQRYGEIVARLSELNSQFSNNVLDATMGWEKLIEN 


184 


Query: 


184 


AAPLAGI PEDALAMFAAAAQSEGKTGYKIGLQI PHYLAVIQYADNRKL.REQI YRAYVTRA 


243 






A LAG+PE AL +A+S+G GY+ L+IP YL V+ Y +NR LRE++YRAY TRA 




Sbjct: 


IBS 


EAELAGLPESALQAAQQSAESKGLKGYRFTLEIPSYLPVMTYCENRALREEMYRAYATRA 


244 


Query: 


244 


SELSDD - GKFDNTAN IDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDIiAR 


302 






SE + GK+DN+ ++ L ++ AKLLGF Y ELSLATKMA+ P+QVL+FL LA 




Sbjct: 


245 


SEQGPNAGKWDNSKVMEEILTLRVELAKLLGFNTYTELSLATKMAENPQQVLDFLDHLAE 


304 


Query: 


303 


RAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGKVL 


362 






RAKP EK+L E+K + + G+ +L PWD+G+ EK ++ YA ++ E++ YFP +V+ 




Sbjct: 


305 


RAKPQGEKELQELKGYCEKEFGVTELAPWDIGFYSEKQKQHLYAINDEELRPYFPENRVI 


364 



m!28-l 

al28-l .pep 
ml28-l 

al28-l .pep 
ml28-l 

al28-l .pep 
ml28-l 

al28-l .pep 
ml28-l 

al28-l .pep 
ml28-l 

al28-l .pep 
ml28-l 
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Query: 363 NG L F AQI K KLYG I G FTE - KT VP VWH KDVR YFE L - QQNG ET I GG VYMD L YAREG KRG G AWM 420 

+GLF IK+++ I E K V WHKDVR+F+L +N + G Y+DLYARE KRGGAWM 
Sbjct: 365 SG L F E LI KR I FN I RA VERKG VDTWHKD VRF FD L I D ENDQLRGS F YLD L YAREHKRG G AWM 424 

Query: 4 21 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEIXXXXXXXXXXXXXXXXQVD 4 80 

+D GR+R DG + + + P AYL CNF P+G K A +H+E+ Q+D 
Sbjct: 425 DDCIGRKRKLDGSIETPVAYLTCNFNAPIGNKPALFTHNEVTTLFHEFGHGIHHMLTQID 484 

Query: 481 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 540 

V+GINGV WDAVELPSQFMEN+ WE LA +S H ETG PLPKE ++L AKNFQ 
Sbjct: 485 VSDVAGINGVPWDAVELPSQFMENWCWEEEALAFISGHYETGEPLPKEKLTQLLKAKNFQ 544 

Query: 541 ' RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFANSFGHIF 600 

MF++RQ+EF +FD ++ D + L SV+ +VAV++ ++ R +SF HIF 

Sbjct: 545 AAMFI LRQLEFGI FDFRLHHTFDAEKTNQI LDTLKS VKSQVAVI KGVDWARAPHSFSH I F 604 

Query: 601 XXXXXXXXXXXXWAE VLS AD A Y AAF E E S DD V - AATG KR F WQ E I LA VGG S R S AAES FKAF R 659 

WAEVLSADAY+ FEE TGK F EIL GGS E FK FR 

Sbjct: 605 AGGYAAGYYSYLWAEVLSADAYSRFEEEGIFNPITGKSFLDEILTRGGSEEPMELFKRFR 664 

Query: 660 GREPS IDALLRHSGFDN 6 76 

GREP +DALLRH G N 
Sbjct: 665 GREPQLDALLRHKGIMN 681 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 493>: 



gl29 . seq 




1 


ATGCTTTCAC 


51 


TTCATTTGCG 


101 


ATCGTGCAGC 


151 


CCGACGGCGG 


201 


TCAGGCTGCG 


251 


TCCGGACAAA 


301 


ATGCGTTACT 


351 


GCGCGCCATC 


401 


AATCAAAATG 


451 


ACATATCGAG 


501 


AACTTGA 



This corresponds to the amino acid sequence <SEQ ID 494; ORF 129.ng>: 

gl2 9 .pep 

1 MLSPPRRKTA AHQSSRLSFA CGKNAACCRD QNQYRAASSP NRGLPRFPIT 

51 PTAAAVHPYP RFRHLPFQAA GIGAEQAAVE SCFIRTNALA VGKSGRPCQI 

101 MRYFG RVLSF VSGGLFLRAI RIC LGAWQTA AAVQSKCLAI SCRQASGCRP 

151 TYRAGFCLSD LAAFRPVT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 495>: 

ml29.seq (partial) 

1 . . TATCTGCGCT TTCACTATTT GCCCTTTCAG GCTGCGGGCA TAGGGACGGA 

51 ACAGGTAGCG GTCAAATCCT GTTTCATCCA AATAAACACG TTGGTAGTCG 

101 GAAAATTCGG CCGGCTGTGT CAAATAATGC GTTACTTTGG CCGGGTCTTG 

151 TTCTTTGTAA GTGGTGGTCT TTTTTTGCGC GTTATCCCCA TCTGTTTGAG 

2 01 TGCATAGCAA ATGGTGGCTG CCGTACAATC AAAATGTTTG GCGATTTCAT 
251 GCAGATAGGC ATCCGGGTGT TGCCCAACAT ATTGAGCCGG TTTTTGCCTA 

3 01 TCCGATTTGA CGGCATTTAG ACCGGTAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 496; ORF 129>: 

ml2 9 .pep (partial) 

1 ..YLRFHYLPFQ AAGIGTEQVA VKSCFIQINT LWGKFGRLC QIMRYFGRVL 

51 FFVSGGLFLR VIPIC LSAXQ MVAAVQSKCL AISCRXASGC CPTYXAGFCL 

101 SDLTAFRPVT * 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 129 shows 79.1% identity over a 1 10 aa overlap with a predicted ORF (ORF 129.ng) 
from N. gonorrhoeae: 

ml29/gl29 

10 20 30 

ml29 -pep YLRFHYLPFQAAGIGTEQVAVKSCFIQINT 

I Ih'llllllllhlhlhllM: h 
gl29 RDQNQYRAASSPNRGLPRFPITPTAAAVHPYPRFRHLPFQAAGIGAEQAAVESCFIRTNA 
30 40 50 60 70 80 

40 50 60 70 80 90 

ml29 . pep LWGKFGRLCQIMRYFGRVLFFVSGGLFLRVIPICLSAXQMVAAVQSKCLAISCRXASGC 

hill II I II I II Mil I I ! I I I M I I = I I I hi I H II I I II II I III I I | | 
gl29 LAVGKSGRPCQIMRYFGRVLSFVSGGLFLRAIRICLGAWQTAAAVQSKCLAISCRQASGC 

90 100 110 120 130 140 

100 110 
ml 2 9 . pep CPTYXAGFCLSDLTAFRPVTX 
Ml M t I I I I f : I I I I I I I 
g 1 2 9 RPT YRAGFCLSDLAAFRPVTX 

150 160 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 497>: 

al29.seq (partial) 

1 TATCTGCGCT TTCACTATTT GCCCTTTCAG GCTGCGGGCA TAGGGACGGA 

51 ACAGGTAGCG GTCAAATCCT GTTTCATCCA AATAAACACG TTGGTAGTCG 

101 GAAAATTCGG CCAGCTGTGT CAAATAATGC GTTACTTTGG CCGGGTCTTG 

151 TTCTTTGTAA GTGGTGGTCT TTTTTTGCGC GTTATCCCCA TCTGTTTGAG 

201 TGCATAGCAA ATGGTGGCTG CCGTACAATC AAAATGTTTG GCGATTTCAT 

251 GCAGATAGGC ATCCTGGTGT TGCCCAACAT ATTGAGCCGG TTTTTGCCTA 

301 TCCGATTTGA CGGCATTTAG ACCGGTAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 498; ORF 129. a>: 

al29.pep (partial) 

1 YLRFHYLPFQ AAGIGTEQVA VKSCFIQINT LWGKFGQLC QIMRYFGRVL 

51 FFVSGGLFLR VIPIC LSA*Q MVAAVQSKCL AISCR*ASWC CPTY*AGFCL 

101 SDLTAFRPVT * 

ml29/al29 98.2% identity in 1 1 0 aa overlap 

10 20 30 40 50 60 

ml 2 9. pep YLRFHYLPFQAAGIGTEQVAVKSCFIQINTLVVGKFGRLCQIMRYFGRVLFFVSGGLFLR 
I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I: I I I I I II I I I | | || | | | I | | j | 
al2 9 YLRFHYLPFQAAGIGTEQVAVKSCFIQINTLVVGKFGQLCQIMRYFGRVLFFVSGGLFLR 

10 20 30 40 50 60 

70 80 90 100 110 

ml29.pep VIPICLSAXQMVAAVQSKCLAISCRXASGCCPTYXAGFCLSDLTAFRPVTX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | 
al2 9 VIPICLSAXQMVAAVQSKCLAISCRXASWCCPTYXAGFCLSDLTAFRPVTX 

70 80 90 100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 499>: 

gl30.seq 

1 ATGAAACAAC TCCGCGACAA CAAAGCCCAA GGCTCTGCAC TGTTTACCCT 

51 TGTGAGCGGT ATCGTTATTG TTATTGCAGT CCTTTATTTC CTGATTAAGC 

101 TGGCGGGCAG TGGATCGTTC GGCGATGTCG ATGCCACTAC GGAAGCGGCA 

151 ACGCAGACCC GCATCCAGCC TGTCGGACAA TTGACGATGG GTGACGGCAT 

201 CCCCGTCGGC GAACGCCAAG GCGAACAGAT. TTTCGGCAAA ATCTGTATCC 
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251 AATGCCACGC GGCGGACAGC AATGTGCCGA ACGCTCCGAA ACTGGAACAC 

3 01 AACGGCGACT GGGCGCCGCG TATCGCGCAA GGCTTCGATA CCTTGTTCCA 
351 ACACGCGCTG AACGGCTTTA ACGCCATGCC TGCCAAAGGC GGTGCGGCAG 

4 01 ACCTGACCGA TCAGGAACTC AAACGGGCGA TTACCTACAT GGCGAATAAA 
4 51 AGCGGCGGTT CTTTCCCGAA TCCTGATGAG GCTGCGCCTG CCGACAATGC 
501 CGCTTCAGGA ACAGCTTCTG CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG 
551 CGAAGGCAGA AGACAAGGGT GCGGCAGCCC CTGCGGTCGG CGTTGACGGT 
601 AAAAAAGTCT TCGAAGCAAC CTGTCAGGTG TGCCACGGCG GTTCGATTCC 
651 CGGTATTCCC GGCATAGGCA AAAAAGACGA TTGGGCACCG CGTATCAAAA 
701 AAGGCAAAGA AACCTTGCAC AAACATGCCC TTGAAGGCTT TAACGCGATG 
751 CCGGCCAAAG GCGGCAATGC AGGTTTGAGC GATGACGAAG TCAAAGCGGC 
801 TGTTGACTAT ATGGCAAACC AATCCGGTGC AAAATTCTAA 



This corresponds to the amino acid sequence <SEQ ID 500; ORF 130.ng>: 

gl30.pep 



1 MKQLRDNKAQ G5ALF TLVSG IVIVIAVLYF LI KLAGSGSF GDVDATTEAA 

51 TQTRIQPVGQ LTMGDGIPVG ERQGEQIFGK ICIQCHAADS NVPNAPKLEH 

101 NGDWAPRIAQ GFDTLFQHAL NGFNAMPAKG GAADLTDQEL KRAITYMANK 

151 SGGSFPNPDE AAPADNAASG TASAPADSAA PAEAKAEDKG AAAPAVGVDG 

201 KKVFEATCQV CHGGSIPGIP GIGKKDDWAP RIKKGKETLH KHALEGFNAM 

251 PAKGGNAGLS DDEVKAAVDY MANQSGAKF* 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 501>: 

m!30.seq (partial) 



1 . . GGCGAACAGA TTTTCGGCAA AATCTGTATC CAATGCCACG CGGCGGACAG 

51 CAATGTGCCG AACGCTCCGA AACTGGAACA CAACGGCGAT TrGGCACCGC 

101 GTATCGgCAA GGCTTCGATA CCTTGTTCCA ACACGCGCTG AACGGCTTTA 

151 ACGCCATGCC TGCAAAAGGC GGTGCGGCAG ACCTGACCGA TCAGGAACTT 

201 AAACGGGCGA TTACTTACAT GGCGAACAAA AGCGGCGGTT CTTTCCCGAA 

251 TCCTGATGAG GCTGCGCCTG CCGACAATGC CGCTTCAGGA ACAGCTTCTG 

3 01 CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG CGAAGGCAGA AGACAAGGGT 

351 GCGGCAcCCC TGCGGTCGGC GTTGACGGTA AAAAAGTCTT CGAAGCAACC 

401 TGTCAGGTGT GCCACGGCGG TTCGATTCCC GGTATTCCCG GCATAGGCAA 

451 AAAAGACGAT TGGGCACCGC GTAT CAAAAA AGGCAAAGAA ACCTTGCACA 

501 AACACGCCCT TGAAGGCTTT AACGCGATGC CTGCCAAArG CGgCAATGCA 

551 GGTTTGAGCG ATGACGAAgT CAAAGCGGCT GTTGACTATA TGGCAAACCA 

601 AT CCGGTGC A AAATTCTAA 



This corresponds to the amino acid sequence <SEQ ID 502; ORF 130>: 

ml30.pep (partial) 

1 . . GEQIFGKICI QCHAADSNVP NAPKLEHNGD XAPRIQGFDT LFQHALNGFN 



51 AMPAKGGAAD LTDQELKRAI TYMANKSGGS FPNPDEAAPA DNAASGTASA 

101 PADSAAPAEA KAEDKGAAPA VGVDGKKVFE ATCQVCHGGS IPGIPGIGKK 

151 DDWAPRI KKG KETLHKHALE GFNAMPAKXG NAGLSDDEVK AAVDYMANQS 

201 GAKF* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 130 shows 98.1% identity over a 206 aa overlap with a predicted ORF (ORF 130.ng) 
from N. gonorrhoeae: 
ml30/gl30 



ml30 .pep 



10 20 30 

GEQI FGKI CI QCHAADSNVPNAPKLEHNGD 



gl30 




50 60 70 80 90 100 



ml 30 .pep 



40 50 60 70 80 89 

XAPRI - QGFDTLFQHALNGFNAMPAKGGAADLTDQELKRAI TYMANKSGGS FPNPDEAAP 



gl30 




HO 120 130 140 150 160 



WO 99/57280 



PCT/US99/09346 



376 



90 100 110 120 130 140 

ml 3 0 . pep ADNAASGTASAPADSAAPAEAKAEDKGAA- PAVGVDGKKVFEATCQVCHGGS I PGIPGIG 

1 1 1 1 i M 1 1 ! 1 1 1 ! I II I !! 1 1 1 1 1 II I II MM II IMMM II M ; 1 1 1! 1 1 1, 

gl3 0 ADNAASGTASAPADSAAPAEAKAEDKGAAAPAVGVDGKKVFE AT CQVCHGGS I PGIPGIG 

170 180 190 200 210 220 



150 160 170 180 190 200 

ml3 0 . pep KKDDWAPRIKKGKETLHKHALEGFNAMPAKXGNAGLSDDEVKAAVDYMANQSGAKFX 

I II I M M M I M 1 1 M M M M M II M I M M I M M M II M I M I M I M M 

gl3 0 KKI)DWAPRIKKGKETLHKHALEGFNAMPAKGGNAGLSDDEVKAAVDYMANQSGAKFX 
230 240 250 260 270 280 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 503>: 

al30 . seq 

1 ATGAAACAAC TCCGCGACAA CAAAGCCCAA GGCTCTGCAC TGTTTACCCT 

51 TGTGAGCGGT ATCGTTATTG TTATTGCAGT CCTTTATTTC CTGATTAAGC 

101 TGGCGGGCAG CGGCTCGTTC GGCGATGTCG ATGCCACTAC GGAAGCAGCA 

151 ACGCAGACCC GTATCCAGCC TGTCGGACAA TTGACGATGG GCGACGGCAT 

201 CCCCGTCGGC GAACGCCAAG GCGAACAGAT TTTCGGCAAA ATCTGTATCC 

251 AATGCCACGC GGCGGACAGC AATGTGCCGA ACGCTCCGAA ACTGGAACAC 

301 AACGGCGATT GGGCGCCGCG TATCGCGCAA GGCTTCGATA CCTTGTTCCA 

351 ACACGCGCTG AACGGCTTTA ACGCCATGCC TGCCAAAGGC GGTGCGGTAG 

4 01 ACCTGACCGA TCAGGAACTC AAACGGGCGA TTACTTACAT GGCGAACAAA 

4 51 AGCGGCGGTT CTTTCCCGAA TCCTGATGAG GCTGCGCCTG CCGACAATGC 

501 CGCTTCAGGA ACAGCTTCTG CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG 

551 CGAAGGCAGA AGACAAGGGT GCGGCAGCCC CTGCGGTCGG CGTTGACGGT 

601 AAAAAAGTCT TCGAAGCAAC CTGTCAGGTG TGCCACGGCG GTTCGATTCC 

651 CGGTATTCCC GGCATAGGCA AAAAAGACGA TTGGGCACCG CGTATCAAAA 

701 AAGGCAAAGA AACCTTGCAC AAACACGCCC TTGAAGGCTT TAACGCGATG 

751 CCTGCCAAAG GCGGCAATGC AGGTTTGAGC GATGACGAAG TCAAAGCGGC 

801 TGTTGACTAT ATGGCAAACC AATCCGGTGC AAAATTCTAA 



This corresponds to the amino acid sequence <SEQ ID 504; ORF 130.a>: 

al30 . pep 

1 MKQLRDNKAQ GSAL FTLVSG IVIVIAVLYF LI KLAGSGSF GDVDATTEAA 

51 TQTRIQPVGQ LTMGDGIPVG ERQGEQIFGK ICIQCHAADS NVPNAPKLEH 

101 NGDWAPRIAQ GFDTLFQHAL NGFNAMPAKG GAVDLTDQEL KRAITYMANK 

151 SGGSFPNPDE AAPADNAASG TASAPADSAA PAEAKAEDKG AAAPAVGVDG 

201 KKV FEAT CQV CHGGSIPGIP GIGKKDDWAP RIKKGKETLH KHALEGFNAM 

251 PAKGGNAGLS DDEVKAAVDY MANQSGAKF* 

ml30/al30 97.6% identity in 206 aa overlap 

10 20 30 

GEQI FGKICIQCHAADSNVPNAPKLEHNGD 
I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
DATTEAATQTRIQPVGQLTMGDG I PVGERQGEQI FGKICIQCHAADSNVPNAPKLEHNGD 
50 60 70 80 90 100 



ml30 .pep 
al30 



40 50 60 70 80 89 

ml30 .pep XAPRI-QGFDTLFQHALNGFNAMPAKGGAADLTDQELKRAITYMANKSGGSFPNPDEAAP 
(III I I I I I I I I I I I I I I I I I I I I I I I : I I I I M I I II I I I I I II I I I I I | | | | | | | | 
al30 WAPRIAQGFDTLFQHALNGFNAMPAKGGAVDLTDQELKRAITYMANKSGGSFPNPDEAAP 
110 120 130 140 150 160 



90 100 110 120 130 140 

ml 30 .pep ADNAASGTASAPADSAAPAEAKAEDKGAA- PAVGVDGKKVFEATCQVCHGGSI PGIPGIG 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I i I I I I I | | | | | | | 
al30 ADNAASGTASAPADSAAPAEAKAEDKGAAAPAVGVDGKKVFEATCQVCHGGS I PGIPGIG 

170 180 190 200 210 220 



ml30 .pep 



150 160 170 180 190 200 

KKDDWAPRIKKGKETLHKHALEGFNAMPAKXGNAGLSDDEVKAAVDYMANQSGAKFX 



WO 99/57280 



PCT/US99/09346 



377 



I I I I I I ! I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I M I I 

al30 KKDDWAPRIKKGKETLHKHALEGFNAMPAKGGNAGLSDDEVKAAVDYMANQSGAKFX 
230 240 250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 505>: 

gl32 . seq 

1 ATGGAAGCCT TCAAAACCCT AATTTGGATT ATTAATATTA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACCTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CCAACTTcct CAgccGCTCG AccGccGTTG CAGCAACAtt 

201 tttcttTGca acctgcAtgg gctatggTgt atattcacac CCACACGACA 

251 AAACACGGTT TGGACTtcag caacataCGA CAGACTCAGC AagcACCCAA 

3 01 ACCcgtAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACagtTTTT CAAATgccga caTGgtga 

This corresponds to the amino acid sequence <SEQ ID 506; ORF 132.ng>: 

gl32 .pep 

1 MEAFKTLIWI INIISALAV I VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS TAVAATFFFA TCMGYGVYSH PHDKTRFGLQ QHTTDSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 507>: 

ml32 . seq (partial) 

1 ATGGAACCCT TCAAAACCTT AATTTGGATT GTTAATTTAA TTTCCGCTTT 

51 GGCCGTCTTC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGA. . . 

This corresponds to the amino acid sequence <SEQ ID 508; ORF 132>: 

ml32.pep (partial) 

1 MEPFKTLIWI VNLISALAV F VLVLLQHGKG ADAGATFG . . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 132 shows 89.5% identity over a 38 aa overlap with a predicted ORF (ORF 132.ng) 
from N. gonorrhoeae: 

ml32/gl32 



10 20 30 

ml32 .pep MEPFKTLIWIVNLISALAVFVLVLLQHGKGADAGATFG 

II I I I I I I I : I : I I I I I I : I I I I I II I I I I II I I I I I 
gl32 MEAFKTLIWI INI ISALAVIVLVLLQHGKGADAGATFGSGSGSAQGVFGSAGNANFLSRS 

10 20 30 40 50 60 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 509>: 

al32 . seq 

1 ATGGAAGCCT TCAAAACCCT AATTTGGATT GTTAATATAA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CTAACTTCCT CAGCCGCTCG ACCGCCGTTG CAGCAACATT 

201 TTTCTTTGCA ACCTGCATGg GCTATGGTGT ATATTCACAC CCACACGACA 

251 AAACACGGTT TGGACTTCAG CAACGTACAA CAAACTCAGC AAGCACCCAA 

301 ACCCGTAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACAGTTTTT CAAATGCCGA CATGGTGA 

This corresponds to the amino acid sequence <SEQ ID 510; ORF 132.a>: 

al32 .pep 

1 MEAFKTLIWI VNIISALA VI VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS TAVAATFFFA TCMGYGVYSH PHDKTRFGLQ QRTTNSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 



ml32/al32 92.1% identity in 38 aa overlap 
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10 20 30 

ml32 . pep MEPFKTLIWIVNLISALAVFVLVLLQHGKGADAGATFG 
II I I M I I I I I : I II I I I : I II II II I I I I I I 1 I I I I 
al32 MEAFKTLIWIVNIISALAVIVLVLLQHGKGADAGATFGSGSGSAQGVFGSAGNANFLSRS 

10 20 30 40 50 60 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 51 1>: 

gl34 . seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCCGATGCGG GTAAAACCAC GCTGACCGAA AAACTGCTGC 

101 TGTTTTCGGG CGCGATTCAA AGCGCAGGCA CGGTGAAAGG TAAGAAAACC 

151 GGCAAATTCG CCACCTCCGA CTGGATGGAC ATCGAGAAGC AGCGCGGCAT 

2 01 TTCCGTGGCA TCAAGCGTGA TGCAGTTCGA CTACAAAGAC CACACCGTCA 
251 ACCTCTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

3 01 GTTTTAACCG CAGTGGACAG CGCCTTGATG GTCATCGACG CGGCAAAAGG 

3 51 CGTGGAAGCG CAAACCATCA AACTCTTGAA CGTCTGCCGC CTGCGCGATA 
401 CGCCGATTGT TACCTTCATG AACAAATACG ACCGCGAAGT GCGCGATTCT 

4 51 TTGGAACTCT TGGACGAAGT GGAAGACATC CTGCAAATCC GCTGCGCGCC 
501 CGTTACCTGG CCGATCGGTA TGGGCAAAAA CTTCAAGGGC GTGTACCACA 
551 TCCTGAACGA CGAAATCTAT CTCTTTGAAG CGGGCGGCGA ACGCCTGCCG 
601 CACGAGTTCG ACATCATCAA AGGCATAAAC AATCCCGAAT TGGAACAACG 
651 CTTTCCGTTG GAAATCCAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 
701 CGGCTTCCAA CGAATTTAAT CTCGacgaAT TTCTCGccgG CGAACTCACG 
751 CCAGTGTTCT TCGGCTCTGC GATTAACAAC TTCGGCATTC AGGAAATCCT 
801 CAATTCATTG ATTGACTGGG CACCCGCACC GAAACCGCGC GACGCGACCA 
851 TGCGCATGGT CGGGCCGGAC GAGCCGAAAT TTTCCGGATT TATCTTTAAA 
901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATCG CCTTCTTGCG 
951 CGTCTGCTCC GGTAAATTCG AGCGCGGCAT GAAGATGAAA CACCTGCGTA 

1001 TCAACCGCGA AATCGCCGCC TCCAGCGTAG TAACCTTCAT GTCGCACGAC 

1051 CGCGAACTGG CGGAAGAAGC CTACGCCGGC GACATCATCG GCATCCCGAA 

1101 CCACGGCAAC ATCCAAATCG GCGACAGCTT CTCCGAAGGC GAACAACTGG 

1151 CGTTTACCGG CATCCCATTC TTCGCGCCCG AACTGTTCCG CAGCGTCCGC 

1201 ATCAAAAACC CGCTGAAAAT CAAACAACTG CAAAAAGGTT TGCAACAACT 

12 51 CGGCGAAGAA GGTGCGGTTC AAGTATTCAA ACCGATGAGC GGCGCGGATT 
1301 TGATTTTGGG TGCGGTCGGC GTGTTGCAGT TTGAAGTCGT AACCTCACGC 

13 51 CTCGCCAACG AATACGGCGT GGAAGCCGTG TTCGACAGCG CATCCATCTG 
1401 GTCGGCGCGC TGGGTATCGT GCGACGACAA GAAAAAACTG GCGGAATTTG 

14 51 AAAAAGCCAA CGCAGGCAAC CTCGCCATCG ACGCAGGCGG CAACCTCGCC 
1501 TACCTCGCCC CCAACCGCGT GAATTTGGGG TTGACGCAAG AACGCTGGCC 
1551 GGACATCGTG TTCCACGAAA CGCGCGAACA TTCGGTCAAA CTCTAA 

This corresponds to the amino acid sequence <SEQ ID 512; ORF 134.ng>: 

gl34 .pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWMD IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VIDAAKGVEA QTIKLLNVCR LRDTPIVTFM NKYDREVRDS 

151 LELLDEVEDI LQIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDIIKGIN NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IDWAPAPKPR DATMRMVGPD EPKFSGFIFK 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 RELAEEAYAG DIIGIPNHGN IQIGDSFSEG EQLAFTGIPF FAPELFRSVR 

4 01 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEW TSR 

4 51 LANEYGVEAV FDSASIWSAR WVSCDDKKKL AEFEKANAGN LAIDAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 513>: 

ml34 . seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCTGACGCAG GTAAAACCAC GTTGACTGAA AAACTCTTGC 

101 TGTTTTCGGG CGCGATTCAG AGCGCGGGTA CGGTAAAAGG CAAGAAAACC 

151 GGCAAATTCG CCACTTCCGA CTGGATGGAA ATCGAGAAGC AGCGCGGCAT 
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201 TTCCGTGGCA TCAAGTGTGA TGCAGTTCGA TTACAAAGAC CACACCGTCA 

251 ACCTCTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

3 01 GTTTTAACCG CCGTGGACAG CGCATTAATG GTCATCGACG CGGCAAAAGG 
351 CGTGGAAGCG CAAACCATCA AGCTCTTAAA CGTCTGCCGC CTGCGCGATA 

4 01 CACCGATTGT TACGTTTATG AACAAATACG ACCGCGAAGT GCGCGATTCC 
451 CTGGAACTTT TGGACGAAGT GGAAAACATT TTAAAAATCC GCTGCGCGCC 
501 CGTTACCTGG CCGATCGGTA TGGGCAAAAA CTTCAAGGGC GTGTACCACA 
551 TCCTGAACGA TGAAATTTAT CTCTTTGAAG CTGGCGGCGA ACGCCTGCCG 
601 CACGAGTTCG ACATCATCAA AGGCATCGAT AATCCTGAAT TGGAACAACG 
651 CTTTCCGTTG GAAATCCAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 
701 CGGCTTCCAA CGAGTTTAAT CTCGACGAAT TCCTCGCCGG CGAACTCACG 
751 CCCGTATTCT TCGGCTCTGC GATTAACAAC TTCGGTATTC AGGAAATCCT 
801 CAATTCATTG ATTGACTGGG CGCCCGCGCC GAAACCGCGC GACGCGACCG 
851 TACGTATGGT CGAGCCGGAC GAGCCGAAGT TTTCCGGATT TATCTTCAAA 
901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATTG CCTTCTTGCG 
951 CGTCTGCTCC GGCAAATTCG AGCGCGGCAT GAAGATGAAA CACCTGCGTA 

1001 TCAACCGCGA AATCGCCGCC TCCAGCGTGG TTACCTTCAT GTCGCACGAC 

1051 CGCGAGCTGG TTGAAGAAGC CTACGCCGGC GACATTATCG GCATCCCGAA 

1101 CCACGGCAAC ATCCAAATCG GCGACAGCTT CTCCGAAGGC GAACAACTGG 

1151 CGTTCACCGG CATCCCATTC TTCGCACCCG AACTGTTCCG CAGCGTACGC 

1201 ATCAAAAACC CGCTGAAAAT CAAACAACTG CAAAAAGGCT TGCAACAGCT 

1251 CGGCGAAGAA GGCGCGGTGC AGGTGTTCAA ACCGATGAGC GGCGCGGATT 

1301 TGATTTTGGG CGCGGTCGGC GTGTTGCAGT TTGAAGTCGT TACCTCGCGC 

13 51 CTCGCCAACG AATACGGCGT AGAAGCCGTG TTCGACAGCG CATCCATCTG 

14 01 GTCGGCGCGC TGGGTATCGT GCGACGACAA GAAAAAACTG GCTGAATTTG 
14 51 AAAAAGCCAA CGCGGGCAAC CTCGCCATCG ACGCAGGCGG CAACCTCGCC 
1501 TACCTCGCCC CCAACCGCGT GAATTTGGGA CTCACGCAAG AACGTTGGCC 
1551 GGACATCGTG TTCCACGAAA CACGCGAACA TTCGGTCAAA CTGTAA 

This corresponds to the amino acid sequence <SEQ ID 514; ORF 134>: 

ml34 -pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWME IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VIDAAKGVEA QTIKLLNVCR LRDTPIVTFM NKYDREVRDS 

151 LELLDEVENI LKIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDIIKGID NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IDWAPAPKPR DATVRMVEPD EPKFSGFIFK 

3 01 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 
351 RELVEEAYAG DIIGIPNHGN IQIGDSFSEG EQLAFTGIPF FAPELFRSVR 

4 01 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEW TSR 
451 LANEYGVEAV FDSASIWSAR WVSCDDKKKL AEFEKANAGN LAIDAGGNLA 
501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N. gonorrhoeae 

ORF 134 shows 98.7% identity over a 531 aa overlap with a predicted ORF (ORF 134.ng) 
from TV. gonorrhoeae: 



10 20 30 40 50 60 

MSQE I LDQVRRRRTFAI I SHPDAGKTTLTEKLLLFSGAI QSAGTVKGKKTGKFATSDWME 
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MSQE I LDQVRRRRTFAI I SHPDAGKTTLTEKLLLFSGAI QSAGTVKGKKTGKFATSDWMD 
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ml 3 4 -pep 
gl34 

ml34 .pep 
gl34 

ml34 .pep 
gl34 

ml34 .pep 
gl34 

ml34 -pep 
gl34 

ml34 .pep 
gl34 

ml34 .pep 
gl34 



QTI KLLNVCRLRDTP I VTFMNKYDREVRDS LELLDE VEN I LKI RCAPVTWP I GMGKNFKG 
I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | : | | : | | | | | | | | | | | | | | | | | | 
QTIKLLNVCRLRDTPIVTFMNKYDREVRDSLELLDEVEDILQIRCAPVTWPIGMGKNFKG 
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200 210 220 230 240 

V YH I LNDE I YLFEAGGERLPHEFDI I KGI DNPELEQRFPLEI QQLRDE I ELVQAASNE FN 
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VYHI LNDEI YLFEAGGERLPHEFDI I KGINNPELEQRFPLEIQQLRDEIELVQAASNEFN 
190 200 210 220 230 
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260 270 280 290 300 

LDEFLAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATVRMVEPDEPKFSGFIFK 

HIM II II I II III Ml MINIM 1 1 II Mill IIMIIIhlM 1 1 1 1 1 1 1 1 1 1 1 1 

LDEFLAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATMRMVGPDEPKFSGFIFK 
250 260 270 280 290 300 

310 320 330 340 350 360 

IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSWTFMSHDRELVEEAYAG 

MMMIMMMMIMIMMIIMIMMMM MMMMMI MMMMMMM I 

IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSWTFMSHDRELAEEAYAG 
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DIIGIPNHGNIQIGDSFSEGEQLAFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 

M M M I M MMMM MMMMMI MMMM MMMM M M MM Ml MM Ml 

DIIGIPNHGNIQIGDSFSEGEQLAFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 
370 380 390 400 410 " 420 

4 30 440 450 460 470 480 

GAVQVFKPMSGADLILGAVGVLQFEWTSRLANEYGVEAVFDSASIWSARWVSCDDKKKL 

1 1 1 1 1 r i f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r i f r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 j 

GAVQVFKPMSGADLI LGAVGVLQFEWTSRLANEYGVEAVFDSAS I WSARWVSCDDKKKL 
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AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 

MMMMMMMMMMMMMMMMMIMMMMMMMMM 

AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 
490 500 510 520 530 



identified iniV. meningitidis <SEQ ID 515>: 



The following partial DNA sequence was 

al34.seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA 

51 CATCTCCCAC CCTGACGCAG GTAAAACCAC GTTGACTGAA 

101 TGTTTTCAGG TGCGATTCAA AGCGCGGGTA CGGTAAAAGG 

151 GGCAAATTCG CCACCTCCGA CTGGATGGAC ATCGAGAAGC 

201 TTCCGTGGCA TCAAGCGTGA TGCAGTTCGA CTATAAAGAC 

251 ACCTTTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA 

301 GTTTTGACCG CCGTCGATAG TGCCTTGATG GTCATCGACG 

351 CGTGGAAGCG CAAACCATCA AACTCTTGAA CGTCTGCCGC 

4 01 CGCCGATTGT TACGTTCATG AACAAATACG ACCGCGAAGT 

4 51 CTGGAATTGC TGGACGAAGT GGAAAACATC CTGCAAATCC 

501 CGTAACCTGG CCGATCGGCA TGGGCAAAAA CTTCAAAGGC 

551 TCCTGAACGA CGAAATCTAT CTCTTTGAAG CGGGCGGCGA 

601 CACGAGTTCG ACATCATCAA AGGCATCGAT AATCCCGAAT 

651 CTTTCCGTTA GAAATACAGC AGTTGCGCGA CGAAATCGAA 

701 CGGCTTCCAA CGAGTTCAAT CTCGACGAAT TCCTCGCCGG 

7 51 CCCGTATTCT TCGGCTCTGC GATTAACAAC TTCGGTATTC 

801 CAATTCATTG ATTGAATGGG CGCCCGCGCC GAAACCACGC 

851 TGCGTATGGT CGAGCCGGAC GAGCCGAAGT TTTCCGGATT 

901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATTG 



CGTTTGCCAT 
AAACTCTTGC 
CAAGAAAACC 
AGCGCGGCAT 
CACACCGTCA 
CACCTACCGC 
CGGCAAAAGG 
CTGCGCAATA 
GCGCGATTCC 
GCTGCGCGCC 
GTGTACCACA 
ACGCTTGCCG 
TGGAACAACG 
TTGGTGCAGG 
CGAACTCACG 
AGGAAATCCT 
GATGCGACCG 
TATCTTCAAA 
CCTTCTTGCG 
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951 CGTCTGCTCC GGCAAATTCG 

1001 TCAACCGCGA AATCGCCGCC 

1051 CGCGAGCTGG TTGAAGAAGC 

1101 CCACGGCAAC ATCCAAATCG 

1151 CGTTTACCGG CATCCCATTC 

1201 ATCAAAAACC CGCTGAAAAT 

1251 TGGCGAAGAA GGTGCGGTGC 

1301 TGATTTTGGG CGCGGTCGGC 

1351 CTTGCCAACG AATACGGCGT 

14 01 GTCGGCGCGC TGGGTATCGT 

14 51 AAAAAGCCAA CGCGGGCAAC 

1501 TACCTCGCCC CTAACCGCGT 

1551 GGACATCGTG TTCCACGAAA 



AGCGCGGCAT GAAAATGAAA CACCTGCGTA 
TCCAGCGTGG TAACCTTCAT GTCCCACGAC 
CTACGCCGGC GACATTATCG GTATCCCAAA 
GCGACAGCTT CTCCGAAGGC GAACAACTGA 
TTCGCGCCCG AACTGTTCCG CAGCGTTCGC 
CAAGCAACTG CAAAAAGGTT TGCAACAGCT 
AGGTGTTCAA ACCAATGAGC GGCGCGGATT 
GTGTTGCAGT TTGAAGTCGT TACCTCGCGC 
GGAAGCCGTG TTCGACAACG CATCCATCTG 
GCGACGACAA GAAAAAACTG GCGGAATTTG 
CTCGCCATCG ACGCGGGCGG CAACCTCGCC 
GAATCTGGGA CTCACGCAAG AACGCTGGCC 
CGCGCGAGCA TTCGGTCAAA CTTTAA 



This corresponds to the amino acid sequence <SEQ ID 516; ORF 134.a>: 

al34 . pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWMD IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM V I DAAKG VEA QTIKLLNVCR LRNTPIVTFM NKYDREVRDS 

151 LELLDEVENI LQIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDIIKGID NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IEWAPAPKPR DATVRMVEPD EPKFSGFIFK 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 RELVEEAYAG DIIGIPNHGN IQIGDSFSEG EQLTFTGIPF FAPELFRSVR 

4 01 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEW TSR 

4 51 LANEYGVEAV FDNASIWSAR WVSCDDKKKL AEFEKANAGN LAIDAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

ml34/a!34 98.9% identity in 531 aa overlap 
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ml34 .pep QTIKLLNVCRLRDTPIVT FMNKYDREVRDSLELLDEVENILKIRCAPVTWPIGMGKN FKG 
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ml34 .pep LDEFLAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATVRMVEPDEPKFSGFI FK 
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ml34 . pep IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSWTFMSHDRELVEEAYAG 
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DIIGIPNHGNIQIGDSFSEGEQLAFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 517>: 

gl35 . seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCCG 

51 TTCGGAcgGC AGCCTCTCGC GCGGCAAAAT CCAAACCATC ACCCGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCTG CAGGGTTCGG CGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

201 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGCAGGGG CTGCTGATGG 

251 AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCACAAATC 

3 01 CTGCTCAGCC GTGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCCGG 

3 51 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCGATTCCC ATCATCAATG 

4 01 AAAACGACAC GGTTTCGGTT GAGGAGTTGA AAATCGGCGA CAACGACACA 

4 51 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTGCTGCT 

5 01 GACCGACATA GACGGTCTTT ACACCGGCAA CCCGAACAGC AATCCCGATG 
551 CCGTACGGCT GGACAAAATC GAACACATCA ACCATGAAAT CATCGAAATG 

6 01 GCGGGCGGCT CGGGTTCGGC AAACGGCACG GGCGGTATGC TGACCAAAAT 
651 CAAAGCGGCA ACCATCGCCG CCGAATCCGG CGTACCGGTG TATATCTGTT 
701 CCTCACTCAA ACCCGATTCA TTGGCCGAAG CCGCCGAACA TCAGGCGGAC 
751 GGCTCGTTTT TCGTcccCcg tgCCAAAGGT TTGCGGACAC AGAAGCAATG 
801 GctggCGTTC TATTCcgaaa gcggGGgcag cgttTAtgtg gacgaaagtg 
851 cggaacacgc tTtgtccgaa caagggaaag cctgCTGA 

This corresponds to the amino acid sequence <SEQ ID 518; ORF 135.ng>: 

gl35.pep 

1 MKYKRIVFKV GTSSITRSDG SLSRGKIQTI TRQLAALHHA GHE LVLVSSG 

51 AVAAGFGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAIP IINENDTVSV EELKIGDNDT 

151 LSAQVAAMIQ ADLLVLL TDI DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKI KAA TIAAESGVPV YICSSLKPDS LAEAAEHQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESGGSVYV DESAEHALSE QGKAC* 

The following partial DNA sequence was identified ixiN. meningitidis <SEQ ID 519>: 

ml35 , seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCCA 

51 TTCGGACGGC AGTCTCTCGC GCGGCAAAAT CCAAACCATC ACCTGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCGG CAGGGTTCGG TGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

201 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGCAGGGG CTGCTGATGG 

251 AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCGCAAATC 

3 01 CTGCTCAGCC GCGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCCGG 

3 51 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCCGTCCCC ATCATCAATG 

401 AAAACGATAC GGTTTCGGTT GAGGAATTGA AAATCGGCGA CAACGACACA 

451 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTGCTGCT 
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5 01. GACCGACATA GACGGTCTTT ACACGGGCAA CCCGAACAGC AATCCCGATG 
551 CCGTACGGCT GGACAAAATC GAACACATCA ACCATGAAAT CATCGAAATG 

6 01 GCGGGCGGCT CGGGTTCGGC AAACGGCACG GGCGGTATGC TGACCAAAAT 
651 CAAAGCGGCA ACCATCGCCG CCGAATCCGG CGTACCGGTG TATATCTGTT 
701 CCTCGCTCAA ACCCGATGCA CTTGCCGAAG CTGCCGAACA TCAGGCGGAC 
751 GGCTCGTTTT TCGTCCCCCG TGCCAAAGGT TTGCGGACGC AGAAGCAATG 
8 01 GCTGGCGTTC TATTCCGAAA GCCGGGGCAG CGTTTATGTG GACGAAGGTG 
851 CGGAACACGC TTTGTCCGAA CAGGGGAAAA GCCTGCTGAT GTCGGGCATT 
901 GCCGGAATCG AAGGGCATTT TTCCCGTATG GACACCGTAA CCGTGTACAG 
951 CAAGG CAACC AAACAGCCCC TGGGCAAAGG GCGCGTCCTG TTCGGCTCTG 

1001 CCGCCGCCGA AGACCTGCTC AAATCGCGTA AGGCGAAAGG CGTGTTCATC 

1051 CATCGGGACG ACTGGATTTC CATCACGCCC GAAATACGCC TGCTTCTGAC 

1101 CGAATTTTAG 

This corresponds to the amino acid sequence <SEQ ID 520; ORF 135>: 

ml35 .pep 

1 MKYKRIVFKV GTSSITHSDG SLSRGKIQTI TCQLAALHHA GHE LVLVSSG 

51 AVAAGFGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAVP IINENDTVSV EELKIGDNDT 

151 LSAQVAAMIQ ADLLVLL TDI DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKIKAA TIAAESGVPV YICSSLKPDA LAEAAEHQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESRGSVYV DEGAEHALSE QGKSLLMSGI 

3 01 AGIEGHFSRM DTVTVYSKAT KQPLGKGRVL FGSAAAEDLL KSRKAKGVFI 

3 51 HRDDWISITP EIRLLLTEF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 135 shows 97.6% identity over a 294 aa overlap with a predicted ORF (ORF 135.ng) 
from TV. gonorrhoeae: 

ml35/gl35 

10 20 30 40 50 60 

ml35 . pep MKYKRIVFKVGTSSITHSDGSLSRGKIQTITCQLAALHHAGHELVLVSSGAVAAGFGALG 

MilllMIMIIIIhlMIIIIIIIMII I i I I 1 I | I I I | | i 1 j I I | f | 1 f I | I f j | 

gl35 MKYKRIVFKVGTSSITRSDGSLSRGKIQTITRQLAALHHAGHELVLVSSGAVAAGFGALG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml3 5 . pep FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

9 135 FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml3 5 . pep SVLLQRRAVP 1 1 NEND TVS VEELKIGDNDTLSAQVAAM I QADLLVLLTD I DGLYTGNPNS 

HUM I Mil II IMIIIIMI MIIMMI IIMMMI MMIiMm M II: 

9135 S VLLQRRAI P 1 1 NENDTVS VEELKI GDNDTLS AQVAAMI QADLLVLLTD I DGLYTGNPNS 

130 140 150 160 170 180 

1^0 200 210 220 230 240 

ml35 . pep NPDAVRLDKI EHINHE 1 1 EMAGGSGS ANGTGGMLTKI KAAT I AAESGVPVY I CS S LKPDA 

■ m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r i f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m f 1 1 r 1 1 1 1 1 1 f 1 1 1 1 1 1 i i = 

913 5 NPDAVRLDKI EHINHE 1 1 EMAGGSGSANGTGGMLTKI KAAT IAAESGVPVYI CSS LKPDS 

19 0 200 210 220 230 240 

250 260 270 280 290 300 

ml35 . pep LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESRGSVYVDEGAEHALSEQGKSLLMSGI 

MIIIIMIIIIIMIIIMIIIIIIIMIMII Mllllhllllllllll: 

gl3 5 LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESGGSVYVDESAEHALSEQGKACX 

250 260 270 280 290 



310 320 330 340 350 



360 
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ml35 . pep AG I EGH FSRMDTVTVYS KATKQPLGKGRVLFGS AAAEDLLKS RKAKGVF IHRDDW I S I TP 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 52 1>: 

al35 . seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCCA 

51 TTCGGACGGC AGTCTCTCGC GCGGCAAAAT CCAAACCATC ACCCGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCGG CAGGGTTCGG TGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

201 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGCAGGGG CTGCTGATGG 

251 AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCACAAATC 

301 CTGCTCAGCC GCGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCCGG 

351 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCCGTCCCC ATCATCAATG 

4 01 AAAACGATAC GGTTTCGGTT GAGGAATTGA AAATCGGCGA CAACGACACA 

4 51 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTGCTGCT 

501 GACCGACATA GACGGTCTTT ACACCGGCAA CCCGAACAGC AATCCCGATG 

551 CCGTACGGCT GGACAAAATC GAACACATCA ACCATGAAAT CAT CG AAATG 

601 GCGGGCGGCT CGGGTTCGGC AAACGGCACA GGCGGTATGC TGACTAAAAT 

651 CAAAGCGGCG ACGATTGCGA CCGAGTCCGG CGTACCGGTC TATATCTGTT ' 

701 CCTCGCTCAA ACCCGATGCA CTTGCCGAAG CGGCAGATAA TCAGGCGGAC 

751 GGCTCGTTTT TCGTCCCCCG TGCCAAAGGT TTGCGGACGC AGAAGCAATG 

801 GCTGGCGTTC TATTCCGAAA GCAGGGGCGG CGTTTATGTG GACGAAGGTG 

851 CGGAACACGC TTTGTCCGAA CAGGGAAAAA GCCTGCTGAT GTCGGGCATT 

901 GCCGGAATCG AAGGGCATTT TTCCCGTATG GACACCGTAA CCGTGTACAG 

951 CAAGGCAACC AAACAGCCTT TGGGCAAAGG GCGAGTCCTG TTCGGCTCTG 

1001 CCGCCGCCGA AGACCTGCTC AAATTGCGTA AGGCGAAAGG CGTGTTCATC 

1051 CATCGGGACG ACTGGATTTC CATCACGCCC GAAATACGCC TGCTTCTGAC 

1101 CGAATTTTAG 

This corresponds to the amino acid sequence <SEQ ID 522; ORF 135.a>: 

al35 . pep 

1 MKYKRIVFKV GTSSITHSDG SLSRGKIQTI TRQLAALHHA GHE LVLVSSG 

51 AVAAG FGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAVP IINENDTVSV EELKIGDNDT 

151 LSAQVAAMIQ ADLLVLL TPI DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKIKAA TIATESGVPV YICSSLKPDA LAEAADNQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESRGGVYV DEGAEHALSE QGKSLLMSGI 

301 AGIEGHFSRM DTVTVYSKAT KQPLGKGRVL FGSAAAEDLL KLRKAKGVFI 

351 HRDDWISITP EIRLLLTEF* 

ml35/al35 98.4% identity in 369 aa overlap 

10 20 30 40 50 60 

ml35.pep MKYKRI VFKVGT S S ITHSDGSLSRGKIQT I TCQLAALHHAGHELVLVSSG AVAAG FGALG 

I M I I I i ! I It I I I I I I I I I i I I I! I I f I I I I I I I I I I I I I | I I I | | | | j | | | | 

al35 MKYKRIVFKVGTSSITHSDGSLSRGKIQTITRQLAALHHAGHELVLVSSGAVAAGFGALG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml35 . pep FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I | | | | | | | j | | | 
al35 FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml35.pep SVLLQRRAVPIINENDTVSVEELKIGDNDTLSAQVAAMIQADLLVLLTDIDGLYTGN PNS 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I | | | | | | 1 | | | 
al35 SVLLQRRAVPIINENDTVSVEELKIGDNDTLSAQVAAMIQADLLVLLTDI DGLYTGNPNS 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml35.pep NPDAVRLDKIEHINHEIIEMAGGSGSANGTGGMLTKIKAATIT^AESGVPVYICSSLKPDA 
I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I : I I I I I | | | | | | | | | | | 
al35 NPDAVRLDKIEHINHEIIEMAGGSGSANGTGGMLTKIKAATIATESGVPVYICSSLKPDA 

190 200 210 220 230 240 
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250 260 270 280 290 300 

ml35.pep LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESRGSVYVDEGAEHALSEQGKSLLMSGI 
I MM:: I I! M M I II I I I II I I I II II I! M I M: I M M I ] II I I | tl I I i Ml | M 
al35 LAEAADNQADGSFFVPRAKGLRTQKQWLAFYSESRGGVYVDEGAEHALSEQGKSLLMSGI 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml35.pep AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKSRKAKGVFIHRDDWISITP 
M I I I I I I I I I I I I I I I I I I I I I I I | I I I | | | | | | | | | | | | I f ! I i I I I | | | | | | | ] | | 

al35 AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKLRKAKGVFIHRDDWISITP 

310 320 330 340 350 360 

370 

ml35 . pep EIRLLLTEFX 
M M M M I I 
al35 EIRLLLTEFX 

370 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 523>: 

gl36.seq 

1 ATGGAAATCC GGTTTCAGAC AGCATTTTTA CGTTTGGTTC AGatgaAAAC 

51 AAACGCTtca aTTCTtaccg caACACGCCT TGTATTTCCT GccgCTGCCG 

101 CACGGACAGG GATCGTTCCT GCCGgtTTTT TCCCCTTCCC TGCGGACGGT 

151 TTGCGGTTTG TTGATGACCG CCTGCCAGTA GCGGTAGATG TCtgccagcg 

201 cgTAAGGCag tTCGGAcgca agttccgcca gctcgccttc ggTGAATTGC 

251 AGgcggataa cgccgtttTC CTCTTCGTCg taaatgccgc ccactgccat 

301 cacgGGGTAA AACAGCTCTT CAAACGCTTC ATCATCGGCG GCTTCAAACC 

351 AATCGGTCGG CACAATGTCC AAACCGTAAA GATAGGCGTT GCACCAAGTG 

401 TAAAAATCGC TGCCGCCCTC GCCGTCGTCG TAGAGCCACA AATCGGGCAG 

451 CTTTTTATCC GACATCGCGG CGGTTGTTTC CATCGCCATT GCCAAAACCA 

501 GCCGTTCGAT TTCGGAACGT TCGGCGGCGG TAAATTGCGA TTCGTCGCCC 

551 AACACTTCGG GCAGCCAGTC GAGCGGTGCC AATTTGTCCG GCCCGCTCAA 

601 CAGCGCCGTC ATAAAACCTT GAACCTCGTC GCAACGCATC GTGTTGCCTT 

651 GTTCGCTTTT GGCATCCAAT AA 

This corresponds to the amino acid sequence <SEQ ID 524; ORF 136.ng>: 

gl3 6.pep 

1 MEIRFQTAFL RLVQMKTNAS ILTATRLVFP AAAARTGIVP AGFFPFPADG 

51 LRFVDDRLPV AVDVCQRVRQ FGRKFRQLAF GELQADNAVF LFWNAAHCH 

101 HGVKQLFKRF IIGGFKPIGR HNVQTV KIGV APSVKIAAAL AVW EPQIGQ 

151 LFIRHRGGCF HRHCQNQPFD FGTFGGGKLR FVAQHFGQPV ERCQFVRPAQ 

201 QRRHKTLNLV ATHRVALFAF GIQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 525>: 

ml36 . seq 

1 ATGGAAACAA ACGCTTCAAT TCTTACCGCA ACACGCCTTG TATTTTCTGC 

51 CGCTGCCGCA CGGACAGGGA TCGTTCCTGC CTGTTTTTTC GCCTTCCCTG 

101 CGGACGGTTT GCGGTTTGTT GATGACTGCC TGCCAGTAGC GGTAGATATC 

151 CGCCAATGCA TAAGGCAACT CGGATTCCAG TTCCGCCAGC TCGCCTTCTG 

201 TGAATTGCAG ACGGATAGCG CCGTTTTCCT CTTCGTCGTA AATACCGCCC 

251 AATGCCATGA TGGGATAAAA CAACTCTTCA AACGCTTCAT CATCGACGGC 

3 01 TTCAAACCAA TCGGTCGGCA CAATATCCAA ACCGTAAAGA TAAGCATTGC 
351 ACCATGTGTA AAAATCGCTG CCGCCGTCTT CGTTTTCATA CAGCCACAAA 

4 01 TCGGGCAGTT TTTTATCCGA CATCGCGGCG GTTGTTTCCA TCGCCATTGC 

4 51 CAAAACCAGC CGTTCGATTT CGGAACGTTC GGCGGCGGTA AATTGCGATT 

5 01 CGTCGCCCAA CACTTCGGGC AGCCAGTCGA GCGGTGTCAA TTTGTCCGGC 
551 CCGCTCAACA GCGCCGTCAT AAAACCTTGA ACCTCGTCGC AACGCATCGT 
601 GTTGCCTTGT TCGCTTTTGG CATCCAACAA TTCGCTCAAC CGCCGTTTGG 
651 ATGCTTCGGT AAATTTTCGG GAATCCATCA TTTTCCTTTT CAAATGGGTT 
701 TTGCGCCCTA TTATCGCCGC AATGCCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 526; ORF 136>: 
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ml 3 6 .pep 

1 METNASILTA TRLVFSAAAA RTGIVPACFF AFPADGLRFV DDCLPVAVDI 
51 . RQCIRQLGFQ FRQLAFCELQ TDSAVFLFW NTAQCHDGIK QLFKRFIIDG 
101 FKPIGRHNIQ TV KISIAPCV KIAAAVFV FI QPQIGQFFIR HRGGCFHRHC 
151 QNQPFDFGTF GGGKLRFVAQ HFGQPVERCQ FVRPAQQRRH KTLNLVATHR 
201 VALFAFGIQQ FAQPPFGCFG KFSGIHHFPF QMGFAPYYRR NAV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 136 shows 85.6% identity over a 209 aa overlap with a predicted ORF (ORF 136.ng) 
from N. gonorrhoeae: 

ml36/gl36 

10 20 30 40 

METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRFVDDCLPV 

hllilllMIMM lllllllllll II 1 1 1 1 1 1 1 1 1 f I Ml 

MEIRFQTAFLRLVQMKTNASILTATRLVFPAAAARTGIVPAGFFPFPADGLRFVDDRLPV 
10 20 30 40 50 60 

50 60 70 80 90 100 

AVDIRQCIRQLGFQFRQLAFCELQTDSAVFLFWNTA'QCHDGIKQLFKRFIIDGFKPIGR 

Mh I =lhl Mill II llhh! Mill IhUI hi MM MM Ml MM 

AVDVCQRVRQFGRKFRQLAFGELQADNAVFLFWNAAHCHHGVKQLFKRFIIGGFKPIGR 
70 80 90 100 110 120 

110 120 130 140 150 160 

HNIQTVKISIAPCVKIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLR* 

1 1 : 1 1 M h M I 1 1 M I h I : M 1 1 1 h 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

HNVQTVKIGVAPSVKIAAALAVWEPQIGQLFIRHRGGCFHRHCQNQPFDFGTFGGGKLR 
130 140 150 160 170 180 

170 180 190 200 210 220 

FVAQHFGQPVERCQFVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIH 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
FVAQHFGQPVERCQFVRPAQQRRHKTLNLVATHRVALFAFGIQX 
190 200 210 220 

230 240 
HFPFQMGFAPYYRRNAVX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 527>: 

al36 . seq 

1 ATGGAAACAA ACGCTTCAAT TCTTACCGCA ACACGCCTTG TATTTTCTGC 

51 CGCTGCCGCA CGGACAGGGA TCGTTCCTGC CTGTTTTTTC GCCTTCCCTG 

101 CGGACGGTTT GCGGCTTGTT GATGACCGCC TGCCAGTAGC GGTAGATATC 

151 CGCCAATGCA TAAGGCAACT CGGATTCCAG TTCCGCCAGC TCGCCTTCTG 

201 TGAATTGCAG ACGGATAGTG CCGTTGTCCT CTTCGTCGTA AATACCGCCC 

251 AATGCCATGA TGGGATAAAA CAACTCTTCA AACGCTTCAT CATCGACGGC 

301 TTCAAACCAA TCGGTCGGCA CAATATCCAA ACCGTAAAGA TAAGCATTGC 

351 ACCATGTGTA AAAATCGCTG CCGCCGTCTT CGTTTTCATA CAGCCACAAA 

401 TCGGGCAGTT TTTTATCCGA CATCGCGGCG GTTGTTTCCA TCGCCATTGC 

4 51 CAAAACCAGC CGTTCGATTT CGGAACGTTC GGCGGCGGTA AATTGCGATT 

501 CGTCGCCCAA CACTTCGGGC AGCCAGTCGA GCGGTGTCAA TTTGTCCGGC 

551 CCGCTCAACA GCGCCGTCAT AAAACCTTGA ACCTCGTCGC AACGCATCGT 

601 GTTGCCTTGT TCGCTTTTGG CATCCAACAA TTCGCTCAAC CGCCGTTTGG 

651 ATGCTTCGGT AAATTTTCGG GAATCCATCA TTTTCCTTTT CCAATGGGTT 

701 TTGCGCCCTA TTATAGTGGA TTAAATTTAA ATCAGGACAA GGCGACGAAG 

7 51 CCGCAGACAG TACAAATAGT ACGGCAAGGC GAGGCAACGC CGTACTGGTT 

801 TAAATTTAAT CCACTATATC GCCGCAATGC CGTCTGA 

This corresponds to the amino acid sequence <SEQ ID 528; ORF 136.a>: 

a!36 . pep 



m!36 .pep 
gl36 

ml36 .pep 
gl36 

ml3 6 .pep 
gl36 

ml 3 6 .pep 
gl36 

m!36 . pep 
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1 METNASILTA TRLVFSAAAA RTGIVPACFF AFPADGLRLV DDRLPVAVDI 

51 RQCIRQLGFQ FRQ LAFCELQ TDSAVVLFVV NTAQCHDGIK QLFKRFIIDG 

101 FKPIGRHNIQ TVK ISIAPCV KIAAAVFVFI QPQIGQFFIR HRGGCFHRHC 

151 QNQPFDFGTF GGGKLRFVAQ HFGQPVERCQ FVRPAQQRRH KTLNLVATHR 

201 VALFAFGIQQ FAQPPFGCFG KFSGIHHFPF PMGFAPYYSG LNLNQDKATK 

251 PQTVQIVRQG EATPYWFKFN PLYRRNAV* 



ml36/al36 98.3% identity in 238 aa overlap 

10 20 30 40 50 60 

ml36.pep METNAS I LTATRLVFSAAAARTGIVPACFFAFPADGLRFVDDCLPVAVDI RQCIRQLGFQ 

I I I I I I I I I I I I I I I II I t I I I I I I I I I I I I II I I I I I : I I I I M I I I I I I I I I M I I I 
al36 METNAS I LTATRLVFSAAAARTG I VPACFFAFPADGLRLVDDRLPVAVDI RQCIRQLGFQ 

10 20 30 40 50 60 



70 • 80 90 100 110 120 

ml36.pep FRQLAFCELQTDSAVFLFVVNTAQCHDGIKQLFKRFIIDGFKPIGRHNIQTVKISIAPCV 
I I I I I I I I 1 I I I I I I I I i I I I I I I I I I I II I II I I I I I M I I I I I I | I | | | | | || | | | | 
al36 FRQLAFCELQTDSAVVLFVVNTAQCHDGIKQLFKRFIIDGFKPIGRHNIQTVKISIAPCV 

70 . 80 90 100 110 120 



130 140 150 160 170 180 

m!3 6.pep KIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLRFVAQHFGQPVERCQ 
I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I | I | I 
al36 KIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLRFVAQHFGQPVERCQ 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml36 . pep FVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIHHFPFQMGFAPYYRR 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I 
al3 6 FVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIHHFPFPMGFAPYYSG 

190 200 210 220 230 240 



ml 3 6. pep NAVX 

al3 6 LNLNQDKATKPQTVQIVRQGEATPYWFKFNPLYRRNAVX 

250 260 270 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 529>: 



gl37 . seq 












1 


ATGATTATCC 


ATCACcaaTT 


CGATCCCGTC 


CTCATCAGTA 


TCGGCCCGCT 


51 


TGCCGTCCGC 


TGGTATGCCT 


TAAGCTACAT 


CCTCGGATTT 


ATTCTTTTTA 


101 


CCTTTCTCGG 


CAGAAGGCGC 


ATCGCGCAAG 


GCTTGTCCGT 


TTTTACCAAA 


151 


GAATCGCTCG 


ACGACTTCCT 


GACATGGGGC 


ATTTTGGGCG 


TGATTTTGGG 


201 


CGGACGCTTG 


GGCTATGTCC 


TGTTTTACAA 


ATTCTCCGAC 


TACCTCGCCC 


251 


ATCCGCTTGA 


TATTTTCAAG 


GTATGGGAAG 


GCGGAATGTC 


GTTCCACGGC 


301 


GGCTTTTTGG 


GTGTAGTTAT 


TGCCATATGG 


TTGTTCAGCC 


GCAAGCACGG 


351 


CATCGGCTTC 


CTCAAACTGA 


TGGACACGGT 


CGCGCCGCTC 


GTTCCGCTGG 


401 


GTCTCGCTTC 


GGGACGTATC 


GGCAACTTTA 


TCAACGGCGA 


ACTTTGGGGA 


451 


CGCATTACCG 


ACATTAACGC 


ATTTTGGGCA 


ATGGGCTTCC 


CGCAAGCGCA 


501 


TTACGAAGAT 


GCCGAAGCCG 


CCGCGCACAA 


TCCGCTTTGG 


GCAGAATGGC 


551 


TGCAACAATA 


CGGTATGCTG 


CCGCGTCATC 


CCTCGCAGCT 


TTATCAGTTT 


601 


GCCCTTGAAG 


GCATCTGCCT 


GTTCGCCGTC 


GTTTGGCTGT 


TTTCCAAAAA 


651 


ACCGCGCCCG 


ACCGGGCAGA 


CTGCCGCGCT 


TTTTCTCGGC 


GGCTACGGCG 


701 


TGTTCCGCTT 


TATTGCCGAA 


TTTGCGCGCC 


AACCCGACGA 


CTATCTCGGG 


751 


CTGCTGACCT 


TGGGGCTGTC 


GATGGGGCAA 


TGGTTGAGCG 


TCCCGATGAT 


801 


TGTTTTGGGT 


ATCGTCGGCT 


TTGTCCGGTT 


CGGCATGAAA 


AAACAGCACT 


851 


GA 











This corresponds to the amino acid sequence <SEQ ID 530; ORF 137.ng>; 

gl37 .pep 
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1 MI IHHQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEGG MSFHG 

101 GFLGWIAIW LF SRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RITDINAFWA MGFPQAHYED AEAAAHNPLW AEWLQQYGML PRHPSQ LYQF 

201 ALEGICLFAV WL FSKKPRP TGQTAALFLG GYGVFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFG MK KQH* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 53 1>: 

ml37 . seq 

1 ATGATTACCC 

51 TGCCGTCCGC 

101 CCTTTCTCGG 

151 GAATCGCTCG 

201 CGGGCGTTTG 

251 ATCCGCTTGA 

3 01 GGCTTTTTGG 
351 CATCGGCTTC 

4 01 GTCTCGCTTC 
4 51 CGCGTTACCG 
501 TTACGAAGAT 
551 TGCAACAATA 
6 01 GCACTTGAAG 
651 ACAGCGGTCG 
701 TATTCCGCTT 
751 CTGCTGACCT 
801 TGTTTTGGGT 
851 GA 

This corresponds to the amino acid sequence <SEQ ID 532; ORF 137>: 

ml3 7.pep 

1 MITHPQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEGG MSFHG 

101 GFLGWIAIR LF GRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RVTDINAFWA MGFPQARYED AEAAAHNPLW AEWLQQYGML PRHPS QLYQF 

201 ALEGICLFTV IWL FSKKQRS TGQVASLFLG GYGIFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFGM K KQH* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 137 shows 95.4% identity over a 283 aa overlap with a predicted ORF (ORF 137.ng) 
from N. gonorrhoeae: 

ml37/gl37 

10 20 30 40 50 60 

MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 

II I I I I II I I I II I I I II II I I I I M I I I I II M I I I M I I I I II I I II I I I I I I M I 
MIIHHQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 
10 20 30 40 50 60 

70 80 90 100 110 120 

I LGVI LGGRLGYVLFYKFSDYLAHPLDI FKVWEGGMSFHGGFLGWI AI RLFGRKHGI GF 

1 1 1 II IMIIIIII llllllll Mill II Mill II Ml MM i I II II Ihlllllll 

I LGVI LGGRLGYVLFYKFSDYLAHPLDI FKVWEGGMSFHGGFLGWI AIWLFSRKHGIGF 
70 80 90 100 110 120 

130 140 150 160 170 180 

L KLMDTV AP LV P LGLAS GR I GN F I NGE L WGRVTD I N AF WAMGF PQ AR YED AE AAAHN P L W 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I : I I I M I I I I I I I I h I I I I I I I I | | | | | 
LKLMDTVAPLVPLGLASGRIGNFINGELWGRITDINAFWAMGFPQAHYEDAEAAAHNPLW 
130 140 150 160 170 180 



ATCCCCAATT CGATCCCGTC 
TGGTATGCCC TAAGCTACAT 
CAGAAGGCGC ATCGCGCAAG 
ACGACTTCCT GACATGGGGC 
GGTTACGTCC TGTTTTACAA 
TATTTTCAAG GTATGGGAAG 
GTGTAGTTAT TGCCATACGG 
CTCAAACTGA TGGATACGGT 
GGGACGTATC GGCAACTTCA 
ACATCAACGC ATTTTGGGCA 
GCCGAAGCCG CCGCGCACAA 
CGGTATGCTG CCGCGTCATC 
GCATCTGCCT GTTCACCGTC 
ACCGGACAAG TCGCCTCGCT 
CATTGCCGAA TTCGCACGCC 
TGGGGCTGTC GATGGGGCAA 
ATCGTCGGCT TTGTCCGGTT 



CTTATCAGTA TCGGCCCGCT 
CCTCGGATTT ATTCTTTTTA 
GCTTGTCCGT TTTTACCAAA 
ATTTTGGGCG TAATTTTGGG 
GTTTTCCGAC TACCTCGCCC 
GCGGAATGTC GTTCCACGGC 
TTGTTCGGCC GCAAACACGG 
CGCACCGCTC GTTCCGCTGG 
TCAACGGCGA ACTTTGGGGA 
ATGGGCTTCC CGCAGGCGCG 
TCCGCTTTGG GCAGAATGGC 
CCTCGCAGCT TTATCAGTTT 
ATTTGGCTGT TCTCTAAAAA 
CTTCCTCGGC GGCTACGGCA 
AACCCGACGA CTATCTCGGG 
TGGTTGAGCG TCCCGATGAT 
CGGCATGAAA AAACAGCACT 



ml3 7.pep 
gl37 

ml3 7 .pep 
gl37 

ml37 .pep 
gl37 



ml37 .pep 



190 200 210 220 230 240 

AEWLQQYGMLPRHPSQLYQFALEGICLFTVIWLFSKKQRSTGQVASLFLGGYGIFRFIAE 



WO 99/57280 



PCT/US99/09346 



389 



g!37 



Ml 1 1 III II I III M IMIMIi lllhhll MM I M M M I II II I M 1 1 1 1 M 

AEWLQQYGMLPRHPSQLYQFALEGICLFAWWLFSKKPRPTGQTAALFLGGYGVFRFIAE 
190 200 210 220 230 240 



250 260 270 280 

ml 3 7 . pep FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 

M M I M I M 1 1 M M M M M M M M M M M M 1 1 M M M 

g i3 7 FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 

250 260 270 280 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 533>: 



al37 . seq 
1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 



ATGATTACCC 
TGCCGTCCGC 
CCTTTCTCGG 
GAATCGCTCG 
CGGGCGTTTG 
ATCCGCTTGA 
GGCTTTTTGG 
CATCGGCTTC 
GTCTCGCTTC 
CGCGTTACCG 
TTACGAAGAC 
TGCAACAATA 
GCACTTGAAG 
ACAGCGGCCG 
TATTCCGCTT 
CTGCTGACCT 
TGTTTTGGGT 
GA 



ATCCCCAATT 
TGGTATGCCC 
CAGAAGGCGC 
ACGACTTCCT 
GGTTACGTCC 
TATTTTCAAG 
GTGTAGTTAT 
CTCAAACTGA 
GGGACGTATC 
ACATCAACGC 
CTCGAAGCCG 
CGGTATGCTG 
GCATCTGCCT 
ACCGGACAAG 
CATTGCCGAA 
TGGGGCTGTC 
ATCGTCGGCT 



CGACCCCGTC 
TAAGCTACAT 
ATCGCGCAAG 
GACATGGGGC 
TGTTTTACAA 
GTATGGGAAG 
TGCCATATGG 
TGGACACGGT 
GGCAACTTCA 
ATTTTGGGCA 
CCGCGCACAA 
CCGCGTCATC 
GTTCGCCGTC 
TCGCCTCACT 
TTTGCACGCC 
GATGGGGCAA 
TTGTCCGGTT 



CTTATCAGTA 
CCTCGGATTT 
GCTTGTCCGT 
ATTTTGGGCG 
GTTTTCCGAC 
GCGGAATGTC 
TTGTTCGGTC 
CGCACCGCTC 
TCAACGGCGA 
ATGGGCTTCC 
TCCGCTTTGG 
CCTCGCAGCT 
GTTTGGCTGT 
CTTCCTCGGC 
AACCCGACGA 
TGGTTGAGCG 
CGGCATGAAA 



TCGGCCCGCT 
ATTCTTTTTA 
TTTTACCAAA 
TAATTTTGGG 
TACCTCGCCC 
GTTCCACGGC 
GCAAACACGG 
GTTCCACTGG 
ACTTTGGGGA 
CGCAGGCGCG 
GCAGAATGGC 
TTATCAGTTT 
TCTCTAAAAA 
GGCTACGGCA 
CTATCTCGGG 
TCCCGATGAT 
AAACAGCACT 



This corresponds to the amino acid sequence <SEQ ID 534; ORF 137.a> 

al37.pep 

MITHPQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR 

ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK 

GFLGWIAIW LFGRKHGIGF LKLMDTVAPL VPLGLASGRI 



1 
51 
101 
151 
201 
251 



RVTDINAFWA MGFPQARYED LEAAAHNPLW AEWLQQYGML 
ALEGICLFAV VWL FSKKQRP TGQVASLFLG GYGIFRFIAE 
LLTLGLSMGQ WLSVPMIVLG IVGFVRFGMK KQH* 



ml37/al37 98.2% identity in 283 aa overlap 

10 20 



30 



40 



IAQGLSVFTK 
VWEGG MSFHG 
GNFINGELWG 
PRHP5Q LYQF 
FARQPDDYLG 



50 



60 



ml37 .pep MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 
I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I i I I I I I I I I 
a!37 MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml37 . pep ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIRLFGRKHGIGF 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al37 ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGVVIAIWLFGRKHGIGF 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml37 . pep LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDAEAAAHNPLW 
I I I I I I I I I I I I M I I ! I I I M I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al37 LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDLEAAAHNPLW 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml37 .pep AEWLQQYGMLPRHPSQLYQFALEGICLFTVIWLFSKKQRSTGQVASLFLGGYGIFRFIAE 
I I I II I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al37 AEWLQQYGMLPRHPSQLYQFALEGICLFAVVWLFSKKQRPTGQVASLFLGGYGIFRFIAE 
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190 200 210- 220 230 240 

250 260 270 280 

ml 3 7 . pep FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al37 FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 

250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 535>: 

gl38 . seq 

1 ATGGAGTTTG AAAACATTAT TTCCGCCGCc gaCAAGGCGC GTATCCTTGC 

51 CGAAGCACTG CCTTACAtCC gccgGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AGTATGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GCGAATTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGAC GATGGATATT 

3 01 GTCGAAATGG TATTGGGCGG GCACGTCAAC AAGGAAATCG TGTCGATGAT 
351 TAACACATAT GGAGGGCACG CGGTCGGCGT GAGCGGGCGC GACGACCATT 

4 01 TCATTAAGGC GAAGAAACTT TTGGTCGATA CGCCCGAACA GAATAGCGTG 
451 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 
501 AGGGCTGATA GAACGCGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 
551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT GGCAGGCAAA 
601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAAtatcgc 
651 cgGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC acgCCGAAAC 
701 GGATTGATGG GCTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 
751 AAAATCGCTT CTGCGGTCGA AGCcgccgtC aACGGTGTGA AAGCCACGCA 
801 CATCATCGAC GGCAGGTTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 
851 ATGCCGGTAT CGGGTCGATG ATTTTAGGCA GAGGGGAAGA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 536; ORP 138.ng>: 

gl38 .pep 

1 MEFENIISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM TEPALKEGFA 

51 RD WLLKLVG IHPVIVHGG G PQINAMLEKV GKKGEFVQGM RVTDKETMDI 

101 VEMVLGGHW KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LVDTPEQNSV 

151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 

201 LAEELNAEKL LMMTNIAGVM DKTGNLLTKL TPKRIDGLIA DGTLYGGMLP 

251 KI AS AVE AAV NGVKATHIID GRLP NALLLE IFTDAGIGSM IL GRGEDA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 537>: 

m!38 . seq 

1 ATGGAGTCTG AAAACATTAT TTCCGCCGCC GACAAGGCGC GTATCCTTGC 

51 CGAAGCGCTG CCTTACATCC GCCGGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AATACGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GTGAGTTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGGC GATGGATATT 

301 GTCGAAATGG TGTTGGGCGG GCATGTCAAT AAAGAAATCG TGTCGATGAT 

3 51 TAACACATAT GGCGGACACG CGGTCGGCGT AAGCGGACGC GACGACCATT 

4 01 TCATTAAGGC GAAGAAACTT TTGATCGATA CGCCCGAACA GAATGGCGTG 
451 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 
501 AGGGCTGATA GAACGTGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 
551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT AGCAGGCAAA 
601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAATATCGC 
651 CGGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC ACGCCGAAAC 
701 GGATTGATGA ACTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 
751 AAAATCGCTT CTGCGGTCGA AGCCGCCGTC AACGGTGTGA AAGCCACGCA 
801 TATCATCGAC GGCAGGTTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 
851 ATGCCGGTAT CGGTTCGATG ATTTTGGGCG GTGGGGAAGA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 538; ORF 138>: 

ml38 .pep 

1 MESENIISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM TEPALKEGFA 
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51 RD WLLKLVG IHPVIVHGG G PQINAMLEKV GKKGEFVQGM RVTDKEAMDI 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LIDTPEQNGV 

151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 

201 LAEELNAEKL LMMTNIAGVM DKTGNLLTKL TPKRIDELIA DGTLYGGMLP 

2 51 KIASAVEAAV NGVKATHIID GRLPN ALLLE IFTDAGIGSM IL GGGEDA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 138 shows 98.0% identity over a 298 aa overlap with a predicted ORF (ORF 138.ng) 
from K gonorrhoeae: 

ml38/gl38 

10 20 30 40 50 60 

MESENIISAADKARILAEALPYIRRFSGSVAVIKYGGNAMTEPALKEGFARDWLLKLVG 

II II 1 1 1 1 1 1 1 1 1 1 1 II II I ! I II 1 1 1 1 1 II II i M 1 1 M ! 1 1! M I M i 1 1 1 II 1 1 1 

MEFENIISAADKARILAEALPYIRRFSGSVAVIKYGGNAMTEPALKEGFARDWLLKLVG 
10 20 30 40 50 60 

70 80 90 100 110 120 

IHPVIVHGGGPQINAMLEKVGKKGEFVQGMRVTDKEAMDIVEMVLGGHVNKEIVSMINTY 

IM M II II II I IMIIIMI lllllllll lllllhMIIIIIMM II III II IMII 

IHPVIVHGGGPQINAMLEKVGKKGEFVQGMRVTDKETMDI VEMVLGGHVN KEIVSMINTY 
70 80 90 100 110 120 

130 140 150 160 170 180 

GGHAVGVSGRDDHFI KAKKLL IDTPEQNGVD I GQVGTVES I DTGLVKGLI ERGCI PWAP 

I I I M 1 1 II II 1 1 II 1 1 INI :| I! I Ihl 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II I M 1 1 M I II 1 1 

GGHAVGVSGRDDHFI KAKKLLVDTPEQNSVDI GQVGTVES IDTGLVKGLI ERGCI PVVAP 
130 140 150 160 170 180 

190 200 210 220 230 240 

VGVGEKGEAFNINADLVAGKLAEELNAEKLLMMTNIAGVMDKTGNLLTKLTPKRIDELIA 

1 1 1 1 Ml I III IMIIIIMI IIIIIIIIIIIIIIMIIillll MM II lllli I III 

VGVGEKGEAFNINADLVAGKLAEELNAEKLLMMTNI AGVMDKTGNLLTKLTPKRI DGL I A 
190 200 210 220 230 240 

250 260 270 280 290 299 

DGTLYGGMLPKIASAVEAAVNGVKATHIIDGRLPNALLLEIFTDAGIGSMILGGGEDAX 

M MM II Ml I MM llllllllll llllll II III Ml II li:i MM I L Mill 

DGTLYGGMLPKIASAVEAAVNGVKATHIIDGRLPNALLLEIFTDAGIGSMILGRGEDAX 
250 260 270 280 290 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 539>: 

al38.seq 

1 ATGGAGTCTG AAAACATTAT TTCCGCCGCC GACAAGGCGC GTATCCTTGC 

51 CGAAGCGCTG CCTTACATCC GCCGGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AATACGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GTGAGTTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGGC GATGGATATT 

301 GTCGAAATGG TGTTGGGCGG GCATGTCAAT AAAGAAATCG TGTCGATGAT 

351 TAACACATAT GGCGGACACG CGGTCGGCGT AAGCGGACGC GACGACCATT 

401 TCATTAAGGC GAAGAAACTT TTGATCGATA CGCCCGAACA GAATGGCGTG 

4 51 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

501 AGGGCTGATA GAACGTGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 

551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT AGCAGGCAAA 

601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAATATCGC 

651 CGGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC ACGCCGAAAC 

701 GGATTGATGA ACTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 

751 AAAATCGCTT CTGCGGTCGA AGCCGCCGTC AACGGCGTGA AAGCCACGCA 

801 TAT CAT CG AC GGCAGGGTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 

851 ATGCCGGTAT CGGTTCGATG ATTTTGGGCG GTGGGGAAGA TGCCTGA 



ml38 .pep 
gl38 

ml38 .pep 
gl38 

ml3 8 .pep 
gl38 

ml3 8 .pep 
gl38 

m!38.pep 
gl38 
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This corresponds to the amino acid sequence <SEQ ID 540; ORF 138.a>: 

al38.pep 

MESENIISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM TEPALKEGFA 
RD WLLKLVG IHPVIVHGG G PQINAMLEKV GKKGEFVQGM RVTDKEAMDI 
VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LIDTPEQNGV 
DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 
LAEELNAEKL LMMTNIAGVM DKTGN LLTKL TPKRI DELIA DG TLYGKMT.P 
K I AS AVE AAV NGVKATHIID GRVPNALLLE IFTDAGIGSM ILGGGEDA* 



1 
51 
101 
151 
201 
251 



ml38/al38 99.7% identity in 298 aa overlap 

10 20 

ml38.pep 



al38 

ml38 .pep 
al38 

ml38 .pep 
al38 

ml38 .pep 
al38 

ml 3 8 .pep 
al38 



30 40 50 60 

MESENIISAADKARILAEALPYIRRFSGSVAVIKYGGNAMTEPALKEGFARDWLLKLVG 
l!1, llilliMIIIIIMIIIIIMII!MlllMIIIIMIIMIIIIMIIilMII 
MESENIISAADKARILAEALPYIRRFSGSVAVIKYGGNAMTEPALKEGFARDVVLLKLVG 
10 20 30 40 50 



60 



70 



80 



90 100 110 120 

IHPVIVHGGGPQINAMLEKVGKKGEFVQGMRVTDKEAMDIVEMVLGGHVNKEIVSMINTY 
,,M 'IIIIIIIIIIIIIMiMIMM!IMIMIiMIM]MMMMMIMIMI 
IHPVIVHGGGPQINAMLEKVGKKGEFVQGMRVTDKEAMDIVEMVLGGHVNKEIVSMINTY 
7 0 80 90 100 



110 



120 



130 



140 150 160 170 180 

GGHAVGVSGRDDHFIKAKKLLIDTPEQNGVDIGQVGTVESIDTGLVKGLIERGCIPWAP 
M I I I I ' * I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | M | | | | | | | | | M | | , 

GGHAVGVSGRDDHFIKAKKLLIDTPEQNGVDIGQVGTVESIDTGLVKGLIERGCIPWAP 
13 0 140 150 160 170 



180 



190 



200 210 220 230 240 

VGVGEKGEAFNINADLVAGKLAEELNAEKLLI^MTNIAGVMDKTGNLLTKLTPKRIDELIA 

IIIIIMIIIIIMIIIMIIIMIIMIIIIIIIIlliiiiiiiiiiiiiM, ,,),,, 

VGVGEKGEAFNINADLVAGKLAEELNAEKLLMMTNIAGVMDKTGNLLTKLTPKRIDELIA 
190 200 210 220 230 240 



250 260 270 280 290 299 

DGTLYGGMLPKIASAVEAAVNGVKATHIIDGRLPNALLLEIFT DAGIGSMILGGGEDAX 

"MiiiiiiiiiiiiiiiiiiiiMiiiiiirni,,,,,,,,,,,, minimi 

DGTL YGGML PK I AS AVEAAVNGVKATH 1 1 DGRVPN ALLLE I FT DAG I G SM I LGGGE DAX 
250 260 270 280 290 



The following partial DNA sequence was identified in ^.gonorrhoeae <SEQ ID 541>- 

gl39.seq 

1 ATGCGAACCA CCTCAACCTT CCCTACAAAA ACTTTCAAAC CGGCTGCCAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAggc ggcggcggag 

101 gcGGCACTTC TGCTCCCGAC TTTAATGCAG GCGGCACCGG TATCGGCAGC 

151 AACAGCAGGG CAACGATAGC GGAATCAGCA GCAGTATCTT ACGCCGGTAT 

2 01 AAAAAACGAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 
251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAAAGCCCC CCGAATCTGC 

3 01 ATACCGGAGA CTTTTCAAAC CCAAATGACC AATATTAAGA ATATGATCAA 

3 51 CCTCAAACCT GCAATTGAAG CAGGCTATAC AGGACGCGGG GTAGAGGTAG 

4 01 GTATCGTCGA TACAGGCGAA TCCGTCGGCA GCATATCCTT TCCCGAACTG 
451 TATGGCAGAA AAGAACACGG CTATAACGAA AATTACAAAA ACAAATTACA 

5 01 AAAACTATAC GGCGTATATG CGGAAGGAAG CGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 542; ORF 138.ng>: 
gl3 9.pep 

1 MRTTSTFPTK TFKPAAMA LA VATTLSACLG GGGGGTSAPD FNAGGTGIGS 

51 NSRATIAESA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKIKAPRIC 

101 IPETFQTQMT NIKNMINLKP AIEAGYTGRG VEVGIVDTGE SVGSISFPEL 

151 YGRKEHGYNE NYKNKLQKLY GVYAEGSA* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 543>: 

ml39 . seq 

1 ATGCGAACGA CCCCAACCTT CCCTACAAAA ACTTTCAAAC CGACTGCCAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAGGC GGCGGCGGAG 

101 GCGGCACTTC TGCGCCCGAC TTCAATGCAG GCGGTACCGG TATCGGCAGC 

151 AACAGCAGAG CAACAACAGC GAAATCAGCA GCAGTATCTT ACGCCGGTAT 

201 CAAGAACGAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAATGCCCC CCCCCGAATC 

301 TGCATACCGG AGACTTTCCA AACCCAAATG ACGCATtACA AGAATTTGAT 

351 CAACCTCAAA CCTGCAATTG AAGCAGGCTA TACAGGACGC GGGGTAGAGG 

401 TAGGTATCGT CGACACAGGC GAATCCGTCG GCAGCATATC CTTTCCCGAA 

451 CTGTATGGCA GAAAAGAACA CGGCTATAAC GAAAATTACG AAAAACTATA 

501 CGGCGTATAT GCGGAAGGAA GCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 544; ORF 138>: 

ml3 9 . pep 

1 MRTTPTFPTK TFKPTAMA LA VATTLSACLG GGGGGTSAPD FNAGGTGIGS 

51 NSRATTAKSA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPRI 

101 CIPETFQTQM THYKNLINLK PAI EAGYTGR GVEVGIVDTG ESVGSISFPE 

151 LYGRKEHGYN ENYEKLYGVY AEGSA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 138 shows 92.2% identity over a 179 aa overlap with a predicted ORF (ORF 138.ng) 
from N. gonorrhoeae; 

ml39/gl39 

10 20 30 40 50 60 

ml3 9 .pep 
g!39 



ml3 9 .pep 
gl39 



ml 3 9 .pep 
gl39 

120 130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 545>: 

al39 . seq 

1 ATGCGAACGA CCCCAACCTT CCCTACAAAA ACTTTCAAAC CGGCTGCCAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAGGC GGCGGCGGAG 

101 GCGGCACTTC TGCGCCCGAC TTCAATGCAG GCGGCACCGG TATCGGCAGC 

151 AACAGCAGGG CAACAACAGC GAAATCAGCA GCAATATCTT ACGCCGGTAT 

201 CAAGAACGAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAATGCCCC CCCCCGAATC 

301 TGCATACCGG AGACTTTACA AACCCAAATG ACGCAT.ACA AGAATTTGAT 

351 CAACCTCAAA CCTGCAATTG AAGCAGGCTA TACAGGACGC GGGGTAGAGG 

401 TAGGTATCGT CGACACAGGC GAATCCGTCG GCAGCATATC CTTTCCCGAA 

4 51 CTGTATGGCA GAAAAGAACA CGGCTATAAC GAAAATTAC . AAAAACTATA 

501 CGGCGTATAT GCGGAAGGAA GCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 546; ORF 139.a>: 

al39.pep 

1 MRTTPTFPTK TFKPAAMALA VATTLSA CLG GGGGGTSAPD FNAGGTGIGS 



MRTTPTFPTKTFKPTAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSA 

MM M M I M I ! H IN MMI 1 1 1 1 1 i MMM I MIM 1 1 ! II I ! II ! 1 1 1 MM 

MRTTSTFPTKTFKPAAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATIAESA 
10 20 30 40 50 60 

70 80 90 100 110 120 

AVSYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETFQTQMTHYKNLINLK 

M ! 1 1 i 1 1 1 1 M t i 1 1 1 M I i 1 1 i 1 1 1 1 M I i II : 1 1 MMMMMMM MMMI 

AVSYAGI KNEMCKDRSMLCAGRDDVAVTDRDAKI KAP - RI CI PETFQTQMTNI KNMINLK 
70 80 90 100 110 

130 140 150 160 170 
PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENY EKLYGVYAEGSAX 

MMMMMMIMMMMMMMIMMMMMMMI MMMMMMI 

PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENYKNKLQKLYGVYAEGSAX 
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51 NSRATTAKSA AISYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPRI 
101 CIPETLQTQM THXKNLINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE 
151 LYGRKEHGYN ENYXKLYGVY AEGSA* 

ml39/al39 97.1% identity in 175 aa overlap 

10 20 30 40 50 60 

ml 3 9 . pep MRTTPTFPTKTFKPTAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSA 
I I I I I I I I I I I I I I : I I I I I I II I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I 
al39 MRTTPTFPTKTFKPAAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATT7VKSA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml39 . pep AVSYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETFQTQMTHYKNLINLK 
I : I I I I I t 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I II : I I I I I I I II I I It 
al39 AISYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETLQTQMTHXKNLINLK 

70 80 90 100 110 120 

130 140 150 " 160 170 

ml39.pep PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENYEKLYGVYAEGSAX 

1 1 1 ii i 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

al 3 9 PAIEAGYTGRGVEVGI VDTGESVGS I S FPELYGRKEHG YNEN YXKLYGVYAEGSAX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 547>: 

gl40 . seq 

1 Atgtcggcac gCGGCAAGGG GGCAGgctat ctcAACAGTA CCGGACGACa 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCAGGATTAT TCTTTCTTCA 

101 AAAATATCAA AACCGACGGC GGTCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GCAGTGAAGG CGACACGCCG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

3 01 GAGCTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 
351 GGTCGCCGAC CGCACAGATA TGCCGGGCAT CCGCCTACGG CGCACAACTT 

4 01 TCCGCACAGC GGCAGCCGTA CAGCATGCGA ATACCGCCGA CGGCGTACGc 
451 aTCTTcaaCA GTCTCGCCGC TAccgTCTAt GccgACAGTG CCGCCGCCCA 
501 TGccgATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 
551 ACAACGGTAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 
601 ACGTGGGAAC AGGGCGGTGT CGAAGGCAAA ATGCGCGGCA GTACCCAAAC 
651 TATCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 
701 TGGGCATAGG ACGCAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 
751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGTGG GCGATATCGG 
801 CTATCTCAAA GGCCTGTTCT CctaCGGACG CTACAAAAAC AGCATCAGCC 
851 GCAGCACCGG TGCGGATGAA TATGCGGAAG GCAGCGTCAA CGGCACGCTG 
901 ATGCAGCTGG GCGCACTGGG TGGTGTCAAC GTTCCGTTTG CCGCAACGGG 
951 AGATTTGACG GTTGAAGGCG GTCTGCGCCA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCagt GCTTTGGGCT GGAGCGGCAA CAGCCTCACT 

1051 GAAGGCACAC TGGTCGGACT CGCGGGTCTG AAACTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTCTG CGACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CGCGGTAACG GGCGGCTTTA CCGGCGCGGC TGCAGCAACC 

1201 GGCAAGACGG GTGCACGCAA TATGCCGCAC ACCCGCCGGG TTGCCGGTCT 

1251 GGGGGTGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 

1301 GCTACACCGG TTCCAAACAG TACGGCAACC ACAGCGGACA AATCGGCGTA 

1351 GGCTACCGGT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 548; ORF 140.ng>: 

gl40 .pep 

1 MSARGKGAGY LNSTGRHVPF LSAAKIGQDY SFFKNIKTDG GLLASLDSVE 

51 KTAGSEGDTP SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 

101 ELDASESSAT PETVETAVAD RTDMPGIRLR RTTFRTAAAV QHANTADGVR 

151 IFNSLAATVY ADSAAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG 

201 TWEQGGVEGK MRGSTQTIGI AAKTGENTTA AATLGIGRST WSENSANAKT 
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251 DSISLFAGIR HDVGDIGYLK GLFSYGRYKN SISRSTGADE YAEGSVNGTL 
3 01 MQLGALGGVN VPFAATGDLT VEGGLRHDLL KQDAFAEKGS ALGWSGNSLT 

3 51 EGTLVGLAGL KLSQPLSDKA VLSATAGVER DLNGRDYAVT GG FTGAAAAT 
401 GKTGARNMPH TRRVAGLGVD VEFGNGWNGL ARYSYTGSKQ YGNHSGQIGV 
451 GYRF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 549>: 

ml40 . seq 

1 ATGTCGGCAC GCGGCAAGGG GGCAGGCTAT CTCAACAGTA CCGGACGACG 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCAGGATTAT TCTTTCTTCA 

101 CAAACATCGA AACCGACGGC GGCCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GCAGTGAAGG CGACACGCTG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

301 GAACTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 

351 GGCAGCCGAC CGCACAGATA* TGCCGGGCAT CCGCCCCTAC GGCGCAACTT 

4 01 TCCGCGCAGC GG CAGCCGTA CAGCATGCGA ATGCCGCCGA CGGTGTACGC 
451 ATCTTCAACA GTCTCGCCGC TACCGTCTAT GCCGACAGTA CCGCCGCCCA 
501 TGCCGATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 
551 ACAACGGCAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 
6 01 ACGTGGGAAC AGGGCGGTGT TGAAGGCAAA ATGCGCGGCA GTACCCAAAC 
651 CGTCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 
701 TGGGCATGGG ACGCAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 
751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGCGG GCGATATCGG 
801 CTATCTCAAA GGCCTGTTCT CCTACGGACG CTACAAAAAC AGCATCAGCC 
851 GCAGCACCGG TGCGGACGAA CATGCGGAAG GCAGCGTCAA CGGCACGCTG 
901 ATGCAGCTGG GCGCACTGGG CGGTGTCAAC GTTCCGTTTG CCGCAACGGG 
951 AGATTTGACG GTCGAAGGCG GTCTGCGCTA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCAGT GCTTTGGGCT GGAGCGGCAA CAGCCTCACT 

1051 GAAGGCACGC TGGTCGGACT CGCGGGTCTG AAGCTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTTTG CAACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CACGGTAACG GGCGGCTTTA CCGGCGCGAC TGCAGCAACC 

1201 GGCAAGACGG GGGCACGCAA TATGCCGCAC ACCCGTCTGG TTGCCGGCCT 

1251 GGGCGCGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 

13 01 GCTACGCCGG TTCCAAACAG TACGGCAACC ACAGCGGACG AGTCGGCGTA 

13 51 GGCTACCGGT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 550; ORF 140>: 

ml4 0 .pep 

1 MSARGKGAGY LNSTGRRVPF LSAAKIGQDY SFFTNIETDG GLLASLDSVE 
51 KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 
101 ELDASESSAT PETVETAAAD RTDMPGIRPY GATFRAAAAV QHANAADGVR 
151 IFNSLAATVY ADSTAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG 
201 TWEQGGVEGK MRGSTQTVGI AAKTGENTTA AATLGMGRST WSENSANAKT 
251 DSISLFAGIR HDAGDIGYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL 

3 01 MQLGALGGVN VPFAATGDLT VEGGLRYDLL KQDAFAEKGS ALGWSGNSLT 
351 EGTLVGLAGL KLSQPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT 

4 01 GKTGARNMPH TRLVAGLGAD VEFGNGWNGL ARYSYAGSKQ YGNHSGRVGV 
451 GYRF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 140 shows 94.5% identity over a 454 aa overlap with a predicted ORF (ORF 140.ng) 
from N. gonorrhoeae: 

ml40/g!40 

10 20 30 40 50 60 

ml4 0 .pep MS ARGKGAGYLNSTGRRVPFLSAAKIGQDYSFFTNIETDGGLLASLDSVE KTAGSEGDTL 

1 1 M 1 1 1 1 1 1 II ! 1 1 1 :| I II II 1 1 1 1 III II I I hi MM I II 1 1 1 1 1 1 i 1 1 1 1 1 II 

gl4 0 MSARGKGAGYLNSTGRHVPFLSAAKIGQDYSFFKNIKTDGGLLASLDSVEKTAGSEGDTP 

10 20 30 40 50 60 

70 80 90 100 110 120 
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ml4 0 . pep SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASESSATPE WETAAAD 

I M I M 1 1 1 1 1 1 1 ! 1 1 1 II M 1 1 1 1 1 1 1 II I M 1 1 1 1 1 II 1 1 1 1 1 M 1 1 II 1 1 1 1 hi I 

gl40 SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASESSATPETVETAVAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml4 0 . pep RTDMPGIRPYGATFRAAAAVQHANAADGVRI FNSLAATVYADSTAAHADMQGRRLKAVSD 

I I M I I I I -\ I I : I I I I I I I I » I I I I I I I I I I I I II I M I : I I I I I I I I I I I I I || | 
gl4 0 RTDMPGIRLRRTTFRTAAAVQHANTADGVRIFNSLAATVYADSAAAHADMQGRRLKAVSD 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 4 0 . pep GLDHNGTGLRVI AQTQQDGGTWEQGGVEGKMRGSTQTVGI AAKTGENTTAAATLGMGRST 

MMMI M Mi IMMIMMIMI II IMMM IhIM I IMIMI II I MMMM 

gl40 GLDHNGTGLRVI AQTQQDGGTWEQGGVEGKMRGS TQT I G I AAKTGENTTAAATLG I GRS T 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml40 . pep WSENSANAKTDS I SLFAGI RHDAGDIGYLKGLFS YGRYKNS I SRSTGADEHAEGS VNGTL 

M i 1 1 1 II I II II III I M MMI IMMI 1 1 1 II M I M M II 1 1 1 M M I Ml MM I 

9140 WSENSANAKTDSliSLFAGIRHDVGDIGYLKGLFSYGRYKNSISRSTGADEYAEGSVNGTL 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 4 0 . pep MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSLTEGTLVGLAGL 
I I I I I I I I I I I I I I I I I I I I I I I I I I : II I I I I I I I I I I I I I I I I II I I II I | | | | | | | | 
gl40 MQLGALGGVNVPFAATGDLTVEGGLRHDLLKQDAFAEKGSALGWSGNSLTEGTLVGLAGL 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml4 0 . pep KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 

MMIIIMM 1 1 1 IMMMM II M MM I M M M M I Ml I M I M llllhl 

gl40 KLSQPLSDKAVLSATAGVERDLNGRDYAVTGGFTGAAAATGKTGARNMPHTRRVAGLGVD 

370 380 390 400 410 420 



430 440 450 

ml4 0 . pep VEFGNGWNGLARYSYAGSKQYGNHSGRVGVGYRFX 

IMMMIMMMMMMMMIMMMIMI 

gl4 0 VEFGNGWNGLARYSYTGSKQYGNHSGQ I GVG YRFX 

430 440 450 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 55 1>: 

al40 . seq 

1 ATGTCGGCAG GCGGTAAGGG GGCAGGCTAT CTCAACCGTA CCGGACAACG 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCGGGATTAT TCTTTCTTCA 

101 CAAACATCGA AACCGACGGC GGTCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GTAGTGAAGG CGACACGCTG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

301 GAACTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 

351 GGCCGCCGAC CGCACAGATA TGCCGGGCAT CCGCCCCTAC GGCGCAACTT 

4 01 TCCGCGCAGC GGCAGCCGTA CAGCATGCGA ATGCCGCCGA CGGTGTACGC 

451 ATCTTCAACA ATCTCGCCGC TACCGTCTAT GCCGACAGTA CCGCCGCCCA 

501 TGCCGATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 

551 ACAACGCTAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 

601 ACGTGGGAAC AGGGCGGTGT TGAAGGCAAA ATGCGCGGCA GTACCCAAAC 

651 CGTCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 

701 TGGGCATGGG ACACAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 

751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGCGG GCGATATCGG 

801 CTATCTCAAA GGCCTGTTCT CCTACGGACG CTACAAAAAC AGCATCAGCC 

851 GCAGCACCGG TGCGGACGAA CATGCGGAAG GCAGCGTCAA CGGCACGCTG 

901 ATGCAGCTGG GCGCACTGGG CGGTGTCAAC GTTCCGTTTG CCGCAACGGG 
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951 AGATTTGACG GTCGAAGGCG 

1001 CATTCGCCGA AAAAGGCAGT 

1051 GAAGGCACAC TGGTCGGACT 

1101 CGATAAAGCC GTCCTGTTTG 

1151 GACGCGACTA CACGGTAACG 

1201 GGCAAGACGG GGGCACGCAA 

1251 GGGCGCGGAT GTCGAATTCG 

1301 GCTACGCCGG TTCCAAACAG 

1351 GGCTACCGGT TCTGA 



GTCTGCGCTA CGACCTGCTC AAACAGGATG 
GCTTTGGGCT GGAGCGGCAA CAGCATCACT 
CGCGGGTCTG AAGCTGTCGC AACCCTTGAG 
CAACGGCGGG CGTGGAACGC GACCTGAACG 
GGCGGCTTTA CCGGCGCGAC TGCAGCAACC 
TATGCCGCAC ACCCGCCTGG TTGCCGGTCT 
GCAACGGCTG GAACGGCTTG GCACGTTACA 
TACGGCAACC ACAGCGGACG AGTCGGCGTA 



This corresponds to the amino acid sequence <SEQ ID 552; ORF 140.a>: 

LSAAKIGRDY SFFTNIETDG GLLASLDSVE 
RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 
RTDMPGIRPY GATFRAAAAV QHANAADGVR 
QGRRLKAVSD GLDHNATGLR VIAQTQQDGG 
AAKTGENTTA AATLGMGHST WSENSANAKT 
GLFSYGRYKN SISRSTGADE HAEGSVNGTL 
VEGGLRYDLL KQDAFAEKGS ALGWSGNSIT 
VLFATAGVER DLNGRDYTVT GGFTGATAAT 
VEFGNGWNGL ARYSYAGSKQ YGNHSGRVGV 

m 1 40/al 40 98 .2% identity in 454 aa overlap 

10 20 30 40 50 60 

ml 4 0. pep MSARGKGAGYLNSTGRRVPFLSAAKIGQDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 
Ml I I I I I I I I I I : I I I I I I I I I I I : I I I I II I I I I I I I I I I I I I I 11 I I I I I II I I I 
al4 0 MSAGGKGAGYLNRTGQRVPFLSAAKIGRDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 

10 20 30 40 50 60 



al4 0 .pep 






1 


MSAGGKGAGY 


LNRTGQRVPF 


51 


KTAGSEGDTL 


SYYVRRGNAA 


101 


ELDASESSAT 


PETVETAAAD 


151 


IFNNLAATVY 


ADSTAAHADM 


201 


TWEQGGVEGK 


MRGSTQTVGI 


251 


DSISLFAGIR 


HDAGDIGYLK 


301 


MQLGALGGVN 


VPFAATGDLT 


351 


EGTLVGLAGL 


KLSQPLSDKA 


401 


GKTGARNMPH 


TRLVAGLGAD 


451 


GYRF* 





70 . 80 90 100 110 120 

ml 4 0. pep SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASES SAT PETVETAAAD 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I- 1 | | | 
al4 0 SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASESSATPETVETAAAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml4 0 . pep RTDMPGIRPYGATFRAAAAVQHANAADGVRIFNSLAATVYADSTAAHADMQGRRLKAVSD 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I | I I 
al4 0 RTDMPGIRPYGATFRAAAAVQHANAADGVRI FNNLAATVYADSTAAHADMQGRRLKAVS D 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 4 0 . pep GLDHNGTGLRVIAQTQQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGRST 
I I I I I : I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | : | | 
a!40 GLDHNATGLRVIAQTQQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGHST 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml40.pep WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 
I I I I i I i I I I I I I I I I I I I I I I I I i I I I I I M I I I I I I I I I I I I I I I I I I I I I I | | I I M 
al4 0 WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 40 . pep MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSLTEGTLVGLAGL 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I | M | | | 
al40 MQLGALGGVN VPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSITEGTLVGLAGL 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml 4 0 . pep KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I 
a!4 0 KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 
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370 380 390 400 410 420 

430 440 450 

ml 4 0 . pep VEFGNGWNGLARYS YAGSKQYGNHSGRVGVGYRFX 

I I I I I I ! I I I I I I I I I I I I I ! I I I I I I I I I I I | | | 
al4 0 VEFGNGWNGLARYS YAGSKQYGNHSGRVGVGYRFX 

430 440 450 



The following partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 553>: 

gl41.seq 

1 atgagcttca aAAccgATGC CGAAACCGCC CAATCCTCCA CCATGCGCCC 
51 GATTGGCGAA ATTGCCGCCA AGCTGGGTTT GAACGTTGAC AACATTGAGC 
101 CTTACGGTCA TTACAAAGCC AAAATCAATC CTGCCGAAGC GTTCAAGCTG 
151 CCGCAAAAAC AAGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 
2 01 GGCGGGCGAA GGCAAAACCA CCGTAACCAT CGGTTTGGCG GACGCATTGC 
251 GCCATATCGG CAAAGACTCT GTGATTGCTT TGCGCGAGCC TTCTTTGGGT 
301 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ACGCGCAAGT 
351 TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGCGAC TTCCACGCCA 
401 TCGGTGCGGC GAATAACCTC CTCGCCGCCA TGCTCGACAA CCATATCTAC 
451 CAAGGTAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 
501 GGTCGATATG AACGACCGCC AGTTGCGCAA CATCATCGAC GGTATGGGCA 

5 51 AGCCTGTtga cggCGTGATG CGtcccGACG GCTTCGACAT CACCGTCGCC 
601 TCCGAAGTGa tggcgGTATT CTGCCTTGCC AAAGACATCA GCGATTTGAA 

651 AGAGCGTTtt gGCAATATTC TCGTCGCCTA CGCCAAAGAC GGCAGCCCCG 

701 TTTACGCCAA AGATTTGAAG GCACACGGCG CGATGGCGGC ATTGCTAAAA 

751 GATGCGATTA AGCCCAATTT GGTGCAAACC ATCGAAGGCA CTCCGGCCTT 

801 TGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTTA 

851 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 

901 GGCTTCGGCG CGGACTTGGG TGCGGAAAAA TTCTGCGACA TCAAATGCCG 

951 CCTTGCCGGT TTGAAACCTG ATGCGGCAGT CGTCGTGGCG ACTGTCCGCG 

1001 CCCTGAAATA CAACGGCGGC GTGGAACGCG CCAACCTTGG TGAAGAAAAC 

1051 CTCGAAGCCT TGGCAAAAGG TTTGCCCAAC CTGTTGAAAC ACATTT CCAA 

1101 CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 

1151 TGTCCGACTC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 

1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GCGGCGCGGG 

1251 CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA TGCCATCGAC AACCAACCTA 

1301 ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AGACAAAATC 

1351 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTCG ATTTCAGCGC 

1401 GGAAGCGTCT GCCGAAATCG CCTCGCTGGA AAAACTGGGC TTGGACAAAA 

1451 TGCCGATCTG CATGGCGAAA ACCCAATATT CATTGAGCGA CAACGCCAAA 

1501 CTCTTGGGCT GCCCCGAAGG CTTCCGCATC GCCGTACGCG GTATCACTGT 

1551 TTCCGCCGGC GCGGGCTTCA TCGTTGCGTT GTGCGGCAAT ATGATGAAAA 

1601 TGCCGGGCCT GCCGAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGAA 

1651 CACGGCGTGA TTCACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 554; ORF 14Lng>: 

gl41.pep 

1 MS FKTDAETA QSSTMRPIGE IAAKLGLNVD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAINPTPAGE GKTTVTIGLA DALRHIGKDS VIALREPSLG 

101 PVFGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRNIID GMGKPVDGVM RPD GFDITVA 

201 SEVMAVFCLA KDISDLKERF GNILVAYAKD GSPVYAKDLK AHGAMAALLK 

251 DAIKPNLVQT IEGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEA 

3 01 GFGADLGAEK FCDIKCRLAG LKPDAAVWA TVRALKYNGG VERANLGEEN 
351 LEALAKGLPN LLKHISNLKN VFGLPVWAL NRFVSDSDAE LAMIEKACAE 
401 HGVEVSLTEV WGKGGAGGAD LARKWNAID NQPNNFGFAY DVELGIKDKI 

4 51 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LDKMPICMAK TQYSLSDNAK 
501 LLGCPEGFR I AVRGITVSAG AGFIVA LCGN MMKMPGLPKV PAAEKIDVDE 
551 HGVIHGLF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 555>: 
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ml41 . seq 

1 ATGAGCTTCA AAACCGATGC CGAAATCGCC CAATCCTCCA CCATGCGCCC 
51 GATTGGCGAA ATTGCCGCCA AGCTTGGTCT GAATGCCGAC AACATTGAGC 
101 CTTACGGTCA TTACAAGGCG AAAATCAATC CTGCCGAAGC GTTCAAACTG 
151 CCGCAAAAAC AGGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 
201 GGCGGGCGAA GGCAAAACCA CCGTAACCAT CGGTTTGGCG GACGCGTTGC 
251 GCCACATCGG CAAAGATGCC GTGATTGCCC TGCGCGAACC TTCTCTGGGG 
301 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 
351 TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGAGAT TTTCACGCCA 
4 01 TCGGTGCGGC AAATAATCTG CTTGCCGCGA TGCTCGACAA CCATATCTAC 
451 CAAGGCAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 
501 GGTCGATATG AACGACCGCC AGTTGCGCAA CATCATCGAC GGCATGGGTA 
551 AACCCGTTGA CGGCGTGATG CGTCCTGACG GTTTCGATAT TACCGTTGCT 
601 TCCGAAGTGA TGGCGGTATT CTGTCTTGCC AAAGACATCA GCGATTTGAA 
651 AGAGCGTTTG GGCAACATCC TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 
701 TTTACGCCAA AGATTTGAAA GCGAATGGCG CGATGGCGGC ATTGCTTAAA 
751 GATGCGATTA AGCCCAACTT GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 
801 CGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTAA 
851 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 
901 GGCTTCGGCG CGGACTTGGG CGCGGAAAAA TTCTGCGAGA TCAAATGCCG 
951 CCTTGCCGGT TTGAAACCTG ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 
1001 CGTTGAAATA TAACGGCGGC GTGGAACGCG CCAACCTCGG CGAAGAAAAT 
1051 TTAGACGCTT TGGAAAAAGG TTTGCCCAAC CTGCTGAAAC ACATTTCCAA 
1101 CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 
1151 TGTCCGACGC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 
1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GTGGTGCGGG 
1251 CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA CGCCATTGAA AGTCAAACCA 

13 01 ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AGACAAAATC 
1351 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTTG ATTTCAGCGC 

14 01 GGAAGCGTCT GCCGAAATCG CTTCACTGGA AAAACTGGGC TTGGACAAAA 
1451 TGCCGATCTG CATGGCGAAA ACCCAATACT CTTTGAGCGA CAACGCCAAA 
1501 CTGTTGGGCT GCCCCGAAGA CTTCCGCATC GCCGTGCGCG GCATCACCGT 
1551 TTCCGCAGGC GCAGGTTTCA TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 
1601 TGCCCGGCCT GCCCAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGCA 
1651 GAAGGCGTGA TTCACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 556; ORF 141>: 

ml 41. pep 

1 MSFKTDAEIA QSSTMRPIGE IAAKLGLNAD NIEPYGHYKA KINPAEAFKL 
51 PQKQGRLILV TAINTPTPAGE GKTTVTIGLA DALRHIGKDA VIALREPSLG 
101 PVFGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 
151 QGNELNIDPK RVLWRRWDM NDRQLRNIID GMGKPVDGVM RPDGFDITVA 
201 SEVMAVFCLA KDISDLKERL GNILVAYAKD GSPVYAKDLK ANGAMAALLK 
251 DAIKPNLVQT IEGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEA 
301 GFGADLGAEK FCDIKCRLAG LKPDAAWVA TVRALKYNGG VERANLGEEN 
351 LDALEKGLPN LLKHISNLKN VFGLPWVAL NRFVSDADAE LAMIEKACAE 
401 HGVEVSLTEV WGKGGAGGAD LARKWNAIE SQTNNFGFAY DVELGIKDKI 
451 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LDKMPICMAK TQYSLSDNAK 
501 LLGCPEDFR I AVRGITVSAG AGFIVA LCGN MMKMPGLPKV PAAEKIDVDA 
• 551 EGVIHGLF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 141 shows 97.5% identity over a 558 aa overlap with a predicted ORF (ORF 141. ng) 
from N. gonorrhoeae: 

ml41/gl41 

10 20 30 40 50 60 

ml41 . pep MSFKTDAEIAQSSTMRPIGEIAAKLGLNADNIEPYGHYKAKINPAEAFKLPQKQGRLILV 

' 1 1 M M I MMIIIIIMMIIIIIhlMIIIMIIIIjllllMIMIIIIIIIM 

9141 MS FKTDAETAQS S TMRP IGE I AAKLGLNVDNI EP YGHYKAKI NPAEAF^PQKQGRLI LV 

10 20 30 40 50 60 



WU W5728U 



PCT/US99/09346 



400 



70 80 90 100 110 120 

ml 4 1 . pep TAINPTPAGEGKTTVTIGLADALRHIGKDAVIALREPSLGPVFGVKGGAAGGGYAQVLPM 

1 1 1 1 1 M 1 1 1 1 1 1 II I M II III Mill hi II 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 , 1 1 

gl41 TAINPTPAGEGKTTVTIGIiADALRHIGKDSVIALREPSLGPVFGVKGGAAGGGYAQVLPM 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml4 1 . pep EDINLHFTGDFHAIGAANNLLAAMLDNHIYQGNELNIDPKRVLWRRVVDMNDRQLRN ID 

1 1 1 1 1 Ml II 1 1 1 1 1 M I II 1 1 1 INI 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 i I II 1 1 1 1 1 1 i 1 1 

gl41 EDINLHFTGDFHAIGAANNLLAAMLDNHIYQGNELNIDPKRVLWRRVVDMNDRQLRNIID 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml41 . pep GMGKPVDGVMRPDGFDITVASEVMAVFCLAKDISDLKERLGNILVAYAKDGSPVYAKDLK 

1 1 1 ! 1 1 [ I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 = [ 1 1 1 1 1 !! 1 1 1 1 1 1 1 ! I f 1 1 

gl41 GMGKPVDGVMRPDGFDI TVASEVMAVFCLAKDI SDLKERFGNILVAYAKDGS PVYAKDLK 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml41 . pep ANGAMAAIiLKDAIKPNLVQTIEGTPAFVHGGPFANIAHGCNSVTATRIAKHLADYAVTEA 

hlllll! IMMIIIMIIIIIMIIMIMIIMIMIIIIIIMIIIMIIIIIMI 

gl4 1 AHGAMAALLKDAI KPNLVQTIEGTPAFVHGGPFANI AHGCNSVTATRLAKHLADYAVTEA 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml4 1 . pep GFGADLGAEKFCDIKCRLAGLKPDAAVWATVRALKYNGGVERANLGEENLDALEKGLPN 

1 1 II 1 1 1 1 II 1 1 II MM II M III I Ml II 1 1 1 1 1 1 1 1 II M 1 1 1 M 1 1 h 1 1 MMI 

gl4 1 GFGADLGAEKFCDIKCRLAGLKPDAAVWATVRALKYNGGVERANLGEENLEALAKGLPN 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml41 . pep LLKHISNLKNVFGLPVWALNRFVSDADAELAMIEKACAEHGVEVSLTEVWGKGGAGGAD 

M I II IM II II II MMMMIMMIMM I IIMIIMI II III M 1 1 II Mill II 

gl41 LLKHISNLKt^FGLPVVVALNRFVSDSDAELAMIEKACAEHGVEVSLTEVWGKGGAGGAD 

370 380 390 400 410 420 



430 440 450 460 470 480 

ml 4 1 . pep LARKWNAI ESQTNNFGFAYDVELG I KDKI RAI AQKV YGAEDVDFSAEAS AE I AS LEKLG 

111111111-1 1 1 IM MMM II II 1 1 1 M M I M I M I M II I M I II 1 1 M II M 

gl41 LARKVVNAIDNQPNNFGFAYDVELGIKDKIRAIAQKVYGAEDVDFSAEASAEIASLEKLG 

430 440 450 460 470 480 



490 500 510 520 530 540 

ml4 1 . pep LDKMP I CMAKTQYSLSDNAKLLGCPEDFRI AVRGI TVSAGAGFI VALCGNMMKMPGLPKV 

II I M 1 1 MM I II i II MMM 1 1 1 M M M I II I M II I M Ml I II II I II I II 1 1 

gl41 LDKMPICMAKTQYSLSDNAKLLGCPEGFRIAVRGITVSAGAGFIVALCGN^KMPGLPKV 

490 500 510 520 530 540 



550 559 
ml 4 1 . pep PAAEKIDVDAEGVIHGLFX 

MMMMIMMIMM 

gl41 PAAEKIDVDEHGVIHGLFX 

550 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 557>: 

al41 . seq 

1 ATGAGTTTCA AAACCGATGC CGAAATCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTGGGTTT GAACGTTGAC AACATTGAGC 

101 CTTACGGTCA TTACAAAGCC AAAATCAATC CTGCCGAAGC GTTCAAACTG 

151 CCGCAAAAAC AGGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 



WO 99/57280 



FCT/US99/U9.540 



401 



201 


GGCGGGCGAA 


GGTAAAACCA 


251 


GCCATATCGG 


CAAAGACTCT 


301 


CCGGTGTTCG 


GCGTGAAAGG 


351 


TTTGCCGATG 


GAAGACATCA 


401 


TCGGTGCGGC 


AAATAATCTG 


451 


CAAGGCAACG 


AGTTGAACAT 


501 


GGTCGATATG 


AACGACCGCC 


551 


AGCCTGTTGA 


CGGCGTGATG 


601 


TCCGAAGTGA 


TGGCGGTATT 


651 


AGAGCGTTTG 


GGCAACATCC 


701 


TTTACGCCAA 


AGATTTGAAA 


751 


GATGCGATTA 


AGCCCAACTT 


801 


CGTACACGGC 


GGCCCGTTCG 


851 


CCGCAACCCG 


TCTGGCGAAA 


901 


GGCTTCGGCG 


CGGACTTGGG 


951 


CCTTGCCGGT 


TTGAAACCTG 


1001 


CGTTGAAATA 


TAACGGCGGC 


1051 


TTAGACGCTT 


TGGAAAAAGG 


1101 


CCTGAAAAAC 


GTATTCGGAC 


1151 


TGTCCGACTC 


CGATGCCGAG 


1201 


CACGGCGTTG 


AAGTTTCCCT 


1251 


CGGCGCGGAT 


TTGGCGCGCA 


1301 


ATAACTTCGG 


TTTCGCCTAC 


1351 


CGTGCGATTG 


CCCAAAAAGT 


1401 


GGAAGCGTCT 


GCCGAAATCG 


1451 


TGCCGATCTG 


CATGGCGAAA 


1501 


CTGTTGGGCT 


GCCCCGAAGA 


1551 


TTCCGCAGGC 


GCAGGTTTCA 


1601 


TGCCCGGCCT 


GCCCAAAGTT 


1651 


GAAGGCGTGA 


TTCACGGCTT 



CCGTAACCAT CGGTTTGGCG GACGCATTGC 
GTGATTGCTT TGCGCGAGCC TTCTTTGGGT 
CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 
ACCTGCACTT CACCGGAGAT TTTCACGCCA 
CTTGCCGCGA TGCTCGACAA CCATATCTAC 
CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 
AGTTGCGCAA CATCATCGAC GGCATGGGCA 
CGTCCTGACG GTTTCGATAT TACCGTTGCT 
CTGTCTTGCC AAAGACATCA GCGATTTGAA 
TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 
GCGAATGGCG CGATGGCGGC ATTGCTTAAA 
GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 
CCAACATCGC CCACGGCTGC AACTCCGTAA 
CACCTTGCCG ATTACGCCGT AACCGAAGCA 
CGCGGAAAAA • TTCTGCGACA TCAAATGCCG 
ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 
GTGGAACGCG CCAACCTCGG CGAAGAAAAT 
TTTGCCCAAC CTGCTGAAAC ACATTTCCAA' 
TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 
TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 
GACCGAAGTG TGGGGCAAAG GTGGTGCGGG 
AAGTCGTCAA CGCCATTGAA AGTCAAACCA 
GATGTCGAGT TGGGCATCAA AGACAAAATC 
GTACGGCGCG GAAGATGTTG ATTTCAGCGC 
CTTCACTGGA AAAACTGGGC TTGGACAAAA 
ACCCAATACT CTTTGAGCGA CAACGCCAAA 
CTTCCGCATC GCCGTGCGCG GCATCACCGT 
TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 
CCGGCTGCCG AG AAAAT CG A TGTGGACGCA 
GTTCTGA 



This corresponds to the amino acid sequence <SEQ ID 558; ORF 141.a>: 

al41 .pep 

1 MSFKTDAEIA QSSTMRPIGE IAAKLGLNVD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAIN PT PAGE GKTTVTIGLA DALRHIGKDS VIALREPSLG 

101 PVFGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 

151 QGNELNIDPK RVLWRRVVDM NDRQLRNIID GMGKPVDGVM RPDGFDITVA 

201 SEVMAVFCLA KDISDLKERL GNILVAYAKD GS PVYAKDLK ANGAMAALLK 

251 DAIKPNLVQT IEGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEA 

301 GFGADLGAEK FCDIKCRLAG LKPDAAVVVA TVRALKYNGG VERANLGEEN 

351 LDALEKGLPN LLKHISNLKN VFGLPVWAL NRFVSDSDAE LAMIEKACAE 

401 HGVEVSLTEV WGKGGAGGAD LARKWNAIE SQTNNFGFAY DVELGIKDKI 

4 51 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LDKMPICMAK TQYSLSDNAK 

501 LLGCPEDFR I AVRGITVSAG AGFIVA LCGN MMKMPGLPKV PAAEKIDVDA 

551 EGVIHGLF* 

m!41/al41 99.5% identity in 558 aa overlap 

10 20 30 40 50 60 

ml 4 1 . pep MSFKTDAEIAQSSTMRPIGEIAAKLGLNADNIEPYGHYKAKINPAEAFKLPQKQGRLILV 
I I I M I I I I I I I M I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I 
al41 MSFKTDAEIAQSSTMRPIGEIAAKLGLNVDNIEPYGHYKAKINPAEAFKLPQKQGRLILV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml41 . pep TAINPTPAGEGKTTVTIGLADALRHIGKDAVIALREPSLGPVFGVKGGAAGGGYAQVLPM 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I | | I I 
al41 TAINPTPAGEGKTTVTIGLADALRHIGKDSVIALREPSLGPVFGVKGGAAGGGYAQVL PM 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 41 .pep EDINLHFTGDFHAIGAANNLLAAMLDNHIYQGNELNI DPKRVLWRRVVDMNDRQLRNI ID 

I M I I I I I I M I I I I I I I I I I ! I I I I I I I | | | | | | | | | II [ I | I I | | | | | | | | | | | | I || 
a!41 EDINLHFTGDFHAIGAANNLLAAMLDNHIYQGNELNIDPKRVLWRRVVDMNDRQLRNIID 

130 140 150 160 170 180 



WO 99/57280 



PCT/US99/09346 



402 



190 200 210 220 230 240 

ml 41. pep GMGKPVDGVMRPDGFDITVASEVMAVFCLAKDISDLKERLGNILVAYAKDGSPVYAKDLK 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 

al41 GMGKPVDGVMRPDGFDIT VAS EVMAV FCLAKD I S DLKERLGN I LVAYAKDG S P VYAKDLK 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 4 1 . pep ANGAMAALLKDAIKPNLVQTIEGTPAFVHGGPFANIAHGCNSVTATRLAKHLADYAVTEA 
I I I M I I I I I I I I I (I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | 
al41 ANGAMAALLKDAIKPNLVQTIEGTPAFVHGGPFANIAHGCNSVTATRLAKHLADYAVTEA 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 4 1 . pep GFGADLGAEKFCDIKCRLAGLKPDAAVWATVRALKYNGGVERANLGEENLDALEKGLPN 

1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

al 4 1 GFGADLGAEKFCDIKCRLAGLKPDAAVVVATVRALKYNGGVERANLGEENLDALEKGLPN 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 41 . pep LLKHISNLKNVFGLPVWALNRFVSDADAELAMIEKACAEHGVEVSLTEVWGKGGAGGAD 
I I I I I I M I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | 
al41 LLKHISNLKNVFGLPVWALNRFVSDSDAELAMIEKACAEHGVEVSLTEVWGKGGAGGAD 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml41 . pep LARKWNAIESQTNNFGFAYDVELGIKDKIRAIAQKVYGAEDVDFSAEASAEIASLEKLG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | 
al41 LARKVVNAIESQTNNFGFAYDVELGIKDKIRAIAQKVYGAEDVDFSAEASAE1ASLEKLG 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml41 . pep LDKMPICMAKTQYSLSDNAKLLGCPEDFRIAVRGITVSAGAGFIVALCGNMMKMPGLPKV 
I I I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I | | | | | M 
al41 LDKMPICMAKTQYSLSDNAKLLGCPEDFRIAVRGITVSAGAGFIVALCGNMMKMPGLPKV 

490 500 510 520 530 540 



550 559 
ml 41 .pep PAAEKIDVDAEGVIHGLFX 
I I I I 1 I I I I I I I I I I I I I I 
al41 PAAEKIDVDAEGVIHGLFX 

550 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 559>: 

gl42 . seq 

1 ATGCGTGCCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCTTC TATTTCAAGT TGTCCCGTTT TGCCGCGATG CCAAATATGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCAT ATTGATGCAG AGgCTGCCGT 

201 TTTCCGACAG GATcggaATG AttcgCGCAC TCCGGTTTAT GCACAGCATC 

251 ACGGTCGGCG GCTCGTCGGT AACCGGCGCA ACCGCCGTCA TTGTAATGCC 

3 01 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCAC 

351 AGGATGCCAT CGCATCACGG AACGAAGTTT GAAAAGTTTT CTGCAAATCC 

401 GCCATTTTTC CCCTTTAAAC CGTCCCCTAT ATAAGAATGC TGCACACAAG 

451 GCATCCCCCC ATGTGCAGCA GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 560; ORF 142.ng>: 

gl42 .pep 

1 MRADFMFADN MPVQVRQRAF YFKLSRFAAM PNMVGKPLFG RQAGQPGKMF 

51 GNILMFVRQH IDAEAAVFRQ DRNDSRTPVY AQHHGRRLVG NRRNRRHCNA 

101 VTPCRTVCRD DMNACRTGCH RITERSLKSF LQIRHFSPLN RPLYKNAAHK 

151 ASPHVQQF* 



WO 99/57280 



PCT/US99/09346 



403 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 56 1>: 

ml4 2.seq 

1 ATGCGTGCCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCCTC TATTTCAAGT TGTCCCGTTT TGCCGCGATG CCAGATGTGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCGT ATTGATGCAG AGGCTGCCGT 

201 TTTCCGACAG GATCGGAATG ATTCGCGCAC TCCGGTTGAT GCACAGCATC 

251 ACGGTCGGCG GCTCGTCGGT AACCGGCGCG ACCGCCGTCA TTGTAATGCC 

301 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCGC 

351 AAGATGCCAT CG CAT CACGG AACGAAGTTT GAAAATTTTT CTGCAAATCC 

4 01 GCCATTTTTC CCCTTTAAAC TGTCCCCTAT ATAAGAATGC TGCACACAAG 

451 GCATCCCCcC ATGTGCAGCA GTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 562; ORF 142>: 

m!42.pep 

1 MRADFMFADN MPVQVRQRAL YFKLSRFAAM PDWGKPLFG RQAGQPGKMF 
51 GNILMFVRQR IDAEAAVFRQ DRNDSRTPVD AQHHGRRLVG NRRDRRHCNA 
101 VTPCRTVCRD DMNACRARCH RITERSLKIF LQIRHFSPLN CPLYKNAAHK 
151 ASPHVQQF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 142 shows 93.7% identity over a 158 aa overlap with a predicted ORF (ORF 142.ng) 
from N. gonorrhoeae: 

ml42/gl42 



10 20 30 40 50 60 

ml 4 2 .pep MRADFMFADNMPVQVRQRALYFKLSRFAAMPDWGKPLFGRQAGQPGKMFGNILMFVRQR 
MIIIIIIIIIMIIIII!:|||||||||!|::||||||||||||||||||||||||||: 
9142 MRADFMFADNMPVQVRQRAFYFKLSRFAAMPNMVGKPLFGRQAGQPGKMFGNILMFVRQH 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml4 2 - pep I DAE AAV FRQDRNDS RT P VD AQHHGRRLVGNRRDRRHCN AVT P CRTVCRDDMNACRARCH 

MIIIIIIIIMIIIIIII lllllll llllhl I III III Mill 1 1 1 1! II Ih || 

gl 4 2 IDAEAAVFRQDRNDSRTPVYAQHHGRRLVGNRRNRRHCNAVTPCRTVCRDDMNACRTGCH 

70 80 90 100 110 120 



130 140 150 159 

ml42 . pep RITERSLKIFLQIRHFSPLNCPLYKNAAHKASPHVQQFX 

IlllMII MM lillllllllllllllll 

gl42 RI TERS LKS FLQ I RH FS PLNRPL YKNAAHKAS PHVQQFX 

130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 563>: 

al42 . seq 

1 ATGCGTGCCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCCTC TATTTCAAGT TGTCCCGTTT TGCCGCGATG CCAGATGTGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCGT ATTGATGCAG AGGCTGCCGT 

201 TTTCCGACAG GATCGGAATG ATTCGCGCAC TCCGGTTGAT GCACAGCATC 

251 ACGGTCGGCG GCTCGTCCGT AACCGGCGCA ACCGCCGTCA TTGTAATGCC 

301 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCAC 

351 AGGATGCCAT CGCATCACGG AACGAAGTTT GAAAAGTTTT CTGCAAATCC 

4 01 GCCATTTTTC CCCTTTAAAC TGTCCCCTAT ATAAGAATGC TGCACACAAG 

451 GCACCCCCCA TGTGCAGCAG TTCTGATTCA AAAAGCCGTC GGTCGGACAT 

501 TTCCGCGCGT TACGGCGTAT TACGAGTTCA ACGCATCCTC GATTTTGGCA 

551 AGTTCTGCCA ACAGGTCTTT AAGCAGCAGC ATTTTCTCGC GGCCCAGCAC 

601 TTCCTCGATA GCGTCGTAAC GCTCGTCCAC TTCTTCGCCG ATTTCCTCAT 

651 ACAGCTTCTC GCCCTCGGCA GTCAGCTTCA GAAAAACACG TCGTTGGTCG 

701 TTGGAAGGTT TCAGGCGGAC AACCAAACCC GCTTTTTCAA GGCGGGTCAG 

751 GATACCGGTC AGGCTGGGGC GCAAAATGCA CGCCTGATTC GCCAAATCTT 



WO 99/57280 
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404 



801 GAAAGTCCAG CGTGCCGTTT TCCGCCAAAA GACGGATAAT CCGCCATTGC 
851 TGATCGGTAA TATTCGCCTG ATTCAGAATA GGCCTGAATT GGGTCATCAG 
901 GGCTTCCCTT GCCTGTATCA GACCGATATT GATAGACGCA TGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 564; ORF 142 a>- 

al42.pep " ' ~~" 4 " 

1 MRADFMFADN MPVQVRQRAL YFKLSRFAAM PDVVGKPLFG RQAGQPGKMF 

51 GNILMFVRQR IDAEAAVFRQ DRNDSRTPVD AQHHGRRLVR NRRNRRHCNA 

101 VTPCRTVCRD DMNACRTGCH RITERSLKSF LQIRHFSPLN CPLYKNaS 

151 APPMCSSSDS KSRRSDISAR YGVLRVQRIL DFGKFCQQVF KQQHFLaS 

201 FLDSWTLVH FFADFLIOT.T. AT.^nr.n^ SLWGRFQAD NQTRFfSgS 

so" SSSS&i SSI™ RAVFRQKTDN ppll — IQNRPELGHQ 
m 1 42/al 42 96. 1 % identity in 1 53 aa overlap 

10 20 30 40 50 fin 

ml42.pep ^^FMFADNMPVQVRQRALYFKLSRFAAMPDVVGKPLFGRQAGQPGKMFGNILMFVRQR 

a!42 MRADFMFADNMPVQVRQRALYFKLSRFAAMPDWGKPLFGRQAGQPG 

10 20 30 40 50 60 

70 80 90 100 lio i?n 

ml 4 2 . pep ^AEAAVFRQDRNDSRTPVDAQHHGRRLVGNRRDRRHCNAVTPCRTVCRDDMNACRARCH 
IINIIIIIIIIIIIIIIIIlllMiiii I I I : I I I I I I I II I I I I I I I I I I I I I ■ II 
al42 IDAEAAVFRQDRNDSRTPVDAQHHGRRLVRNRRNRRHCNAVTPCRTVCRDD^ACRTGCH 

70 80 90 100 HO 120 

130 140 150 159 
ml42.pep RITERSLKIFLQIRHFSPLNCPLYKNAAHKASPHVQQFX 
MM INI I II III I II IIMI IN | 

RITERSLKSFLQIRHFSPLNCPLYKNAAHKAPPMCSSSDSKSRRSDISARYGVLRVQRIL 
130 140 150 160 170 180 



al42 
al42 



DFGKFCQQVFKQQHFLAAQHFLDSWTLVHFFADFLIQLLALGSQLQKNTSLVVGRFQAD 
190 2 °0 210 220 230 240 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 565>- 

gl43.seq x 
1 ATGTTGAGCT TCGGCTATCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 
51 CTCGCAGATG AGCCGCATTT TTCAAACGCT AGGCGCAGAC CCGCACAATT 
101 TGGGCTGGTT TTTCATCCTG CCGCCGCTGG CGGGGATGCT GGTTCAGCCG 

151 ATAGTGgGCT ACTACTCAGA CCGCACTTGG AAGCCGCGCT TGGGCGGCCG 
201 CCGCCTGCCG TATCTGCTTT ACGGCACGCT GATTGCGGTC ATCGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCCTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTGGACG TGTCGTCGAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGATATG GTCAACGAGG 

401 AGCAGAAAAG CTACGCCTAC GGGATTCAAA GTTTCTTAGC GAATACGGAC 

451 GCGGTTGTGG CAGCGATTCT GCCGTTTGTG TTcgcgtata TCGGTTTGGC 

501 GAACACTGCC GAGAAAGGCG TTGTGCCACA AACCGTGGTC GTAGCATTCT 

551 ATGTGGGTGC GGCGTTACTG ATTATTACCA GTGCGTTCAC AATCTCCAAA 

601 GTCAAAGAAT ACGACCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGTT CGAACTCTTA AAAACCGCGC 

701 CTAAAGTGTT TTGGACGGTT ACTCCGGTAC AGTTTTTCTG CTGGTTCGCC 

751 TTCCGGTATA TGTGGACTTA CTCGGCAGGC GCGATTGCAG AAAACGTCTG 

801 GCACACTACC GATGCGTCTT CCGTAGGCCA TCAGGAGGCG GGCAACCGGT 

851 ACGGCGTTTT GGCGGCGGTG TAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 

901 ATTCTGGCAA AAGTACCGAA TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GTTTCTTCTC TATCTTCTTC ATCTACAATC 

1001 AATACGCACT CATCCTGTCT TATATCTTAA TCGGCATCGC TTGGGCGGGC 

1051 ATTATCACTT ATCCGCTGAC GATTGTGGCC AACGCTTTGT CGGGCAAACA 

1101 CATGGATACT TATTTGGGCC TGTttaacgg ctctgtCTGT ATGCcgcaaa 

1151 tcgTcgctTC GctgttgAGT TTCGTGCTTT TCCCGATGCT GGGCGGCCAT 
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1201 CAGGCAACCA TGTTCTTGGT TGCAGGCGCA GTCTTGCTGC TGGGAGCCTT 
1251 CTCAGTCTGT CTGATTAAAG AGATCCACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 566; ORF 143.ng>: 

gl43.pep 

1 MLSFGYLGVQ TAFTLQSSQM SRIFQTLGAD PHNLGW FFIL PPLAGMLVQP 

51 IVGYYSDRTW KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKSYAY GIQSFLANTD 

151 AWAAILPFV FAYIGLA NTA EKGVVPQT VV VAFYVGAALL IITSA FTISK 

201 VKEYDPETYA RYHG I DVAAN QEKANWFELL KTAPKVFWTV TPVQFFCWFA 

251 FRYMWTYSAG AIAENVWHTT DASSVGHQEA GNRYG VLAAV *SVAAVICSF 

301 ILAKVPNKYH KAG YFGCLAL GALGFFSIFF IYNQ YALILS YILIGIAWAG 

351 IITYPLTIVA NALSGKHMDT YLGLFN GSVC MPQIVASLLS FVL FPMLGGH 

4 01 QAT MFLVAGA VLLLGAFSVC L IKETHGGV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 567>: 

ml4 3.seq 

1 ATGCTCAGTT TCGGCTTTCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAAATG AGCCGCATTT TTCAAACGCT AGGCGCAGAC CCGCACAATT 

101 TGGGCTGGTT TTTCATCCTG CCGCCGCTGG CGGGGATGCT GGTGCAGCCG 

151 ATTGTCGGCC ATTACTCCGA CCGCACTTGG AAGCCGCGTT TGGGCGGCCG 

201 CCGTCTGCCG TATCTGCTTT ATGGCACGCT GATTGCGGTT ATTGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCTTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTAGACG TGTCGTCAAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGACATG GTCAACGAGG 

4 01 AGCAGAAAGG CTACGCCTAC GGGATTCAAA GTTTCTTAGC AAATACGGGC 

4 51 GCGGTCGTGG CGGCGATTCT GCCGTTTGTG TTTGCGTATA TCGGTTTGGC 

501 GAACACCGCC GAGAAAGGCG TTGTGCCGCA GACCGTGGTC GTGGCGTTTT 

551 ATGTGGGTGC GGCGTTGCTG GTGATTACCA GCGCGTTCAC GATTTTCAAA 

601 GTGAAGGAAT ACGATCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGAT CGAACTCTTG AAAACCGCGC 

701 CTAAGGCGTT TTGGACGGTT ACTTTGGTGC AATTCTTCTG CTGGTTCGCC 

751 TTCCAATATA TGTGGACTTA CTCGGCAGGC GCGATTGCGG AAAACGTCTG 

801 GCACACCACC GATGCGTCTT CCGTAGGTTA TCAGGAGGCG GGTAACTGGT 

851 ACGGCGTTTT GGCGGCGGTG CAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 

901 GTATTGGCGA AAGTGCCGAA TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GCTTTTTCTC CGTTTTCTTC ATCGGCAACC 

1001 AATACGCGCT GGTGTTGTCT TATACCTTAA TCGGCATCGC TTGGGCGGGC 

1051 ATTATCACTT ATCCGCTGAC GATTGTGACC AACGCCTTGT CGGGCAAGCA 

1101 TATGGGCACT TACTTGGGCT TGTTTAACGG CTCTATCTGT ATGCCTCAAA 

1151 TCGTCGCTTC GCTGTTGAGT TTCGTGCTTT TCCCTATGCT GGGCGGCTTG 

1201 CAGGCCACTA TGTTCTTGGT AGGGGGCGTC GTCCTGCTGC TGGGCGCGTT 

1251 TTCCGTGTTC CTGATTAAAG AAACACACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 568; ORF 143>: 

ml 4 3 .pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PHNLGW FFIL PPLAGMLVQP 

51 IVGHYSDRTW KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

!51 AWAAILPFV FAYIGLA NTA EKGVVPQT VV VAFYVGAALL VITSA FTIFK 

201 VKEYDPETYA RYHG I DVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DASSVGYQEA GNWYG VLAAV QSVAAVICSF 

301 VLAKVPNKYH KAG YFGCLAL GALGFFSVFF IGNQ YALVLS YTLIGIAWAG 

351 IITYPLTIVT NALSGKHMGT YLGLFN GSIC MPQIVASLLS FVL FPMLGGL 

4 01 QATM FLVGGV VLLLGAFSVF LIKETHGGV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml43 / gl43 93.9% identity in 429 aa overlap 

10 20 30 40 50 60 
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ml43.pep MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGHYSDRTW 
I II I I : I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I 
gl43 MLSFGYLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGYYSDRTW 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml43.pep KPRLGGRRLPYLLYGTLIAVIVMI LMPNSGS FGFGYASLAALS FGALMIALLDVSSNMAM 

I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl43 KPRLGGRRLPYLLYGTLIAVIVMI LMPNSGS FGFGYASLAALS FGALMIALLDVSSNMAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 3 . pep QPFKMMVGDMVNEEQKGYAYGIQSFLANTGAWAAILPFVFAYIGLANTAEKGWPQTVV 
I I I I I I I I I II I I I I I : I I I I I I I I I I I I I I 11 I I I I I! I I I I I I I I I I I I I I I I I I I I 
gl4 3 QPFKMMVGDMVNEEQKSYAYGIQSFLANTDAWAAILPFVFAYIGLANTAEKGWPQTW 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 4 3 . pep VAFYVGAALLVITSAFTIFKVKEYDPETYARYHGIDVAANQEKANWIELLKTAPKAFWTV 
I I I I I I I I I I : I I I I I I I I f I I I I I I I I I I I I I I II I ! I I I I I I I : I I I I I I II : I I I I 
gl4 3 VAFYVGAALLIITSAFTISKVKEYDPETYARYHGIDVAANQEKANWFELLKTAPKVFWTV 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 4 3 . pep TLVQFFCWFAFQYMWTYSAGAIAENVWHTTDASSVGYQEAGNWYGVLAAVQSVAAVICSF 
I I I I I I I I I I : I I I I I I II I I I I I I I I I I I I I II I : i I I I I I I I I 1 I 1 I I I I I I I I I 
gl43 TPVQFFCWFAFRYMWTYSAGAIAENVWHTTDASSVGHQEAGNRYGVLAAVXSVAAVICSF 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 4 3 . pep VLAKVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLSYTLIGIAWAGIITYPLTIVT 
: I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I : I I I I I I I I I I I II I I I I I II : 
gl43 I L AKV PN K YHKAG Y FG C L ALG ALG FF S I FF I YNQ Y AL ILSYILIGI AW AG 1 1 T Y P LT I V A 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml4 3 . pep NALSGKHMGTYLGLFNGSICMPQIVASLLSFVLFPMLGGLQATMFLVGGVVLLLGAFSVF 

I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I I I I I 
gl4 3 NALSGKHMDTYLGLFNGSVCMPQIVASLLSFVLFPMLGGHQATMFLVAGAVLLLGAFSVC 

370 380 390 400 410 420 

430 

ml 4 3. pep LIKETHGGVX 
I I I I I I I I I 
gl43 LIKEIHGGVX 

430 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 569>: 

al43 . seq 

1 ATGCTCAGTT TCGGCTTTCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAGATG AGCCGCATCT TCCAGACGCT CGGTGCCGAT CCGCACAGCC 

101 TCGGCTGGTT CTTTATCCTG CCGCCGCTGG CGGGGATGCT GGTGCAGCCG 

151 ATTGTCGGCC ATTACTCCGA CCGCACTTGG AAGCCGCGTT TGGGCGGCCG 

201 CCGTCTGCCG TATCTGCTTT ATGGCACGCT GATTGCGGTT ATTGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCTTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTAGACG TGTCGTCAAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGACATG GTCAACGAGG 

401 AGCAGAAAGG CTACGCCTAC GGGATTCAAA GTTTCTTAGC GAATACGGGC 

451 GCGGTCGTGG CGGCGATTCT GCCGTTTGTG TTTGCGTATA TCGGTTTGGC 

501 GAACACCGCC GAGAAAGGCG TTGTGCCGCA GACCGTGGTC GTGGCGTTTT 

551 ATGTGGGTGC GGCGTTGCTG GTGATTACCA GCGCGTTCAC GATTTTCAAA 

601 GTGAAGGAAT ACAATCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGAT CGAACTCTTG AAAACCGCGC 

701 CTAAGGCGTT TTGGACGGTT ACTTTGGTGC AATTCTTCTG CTGGTTCGCC 
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751 TTCCAATATA TGTGGACTTA 

801 GCACACCACC GATGCGTCTT 

851 ACGGCGTTTT GGCGGCGGTG 

901 GTATTGGCGA AAGTGCCGAA 

951 TTTGGCTTTG GGCGCGCTCG 

1001 AATACGCGCT GGTGTTGTCT 

1051 AT TAT C ACT T ATCCGCTGAC 

1101 TATGGGCACT TACTTGGGCC 

1151 TCGTCGCTTC GCTGTTGAGT 

1201 CAGGCCACTA TGTTCTTGGT 

1251 TTCCGTGTTC CTGATTAAAG 



CTCGGCAGGC GCGATTGCGG AAAACGTCTG 
CCGTAGGTTA TCAGGAGGCG GGTAACTGGT 
CAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 
TAAATACCAT AAGGCGGGTT ATTTCGGCTG 
GCTTTTTCTC CGTTTTCTTC ATCGGCAACC 
TATACCTTAA TCGGCATCGC TTGGGCGGGC 
GATTGTGACC AACGCCTTGT CGGGCAAGCA 
TGTTTAACGG CTCTATCTGT ATGCCGCAAA 
TTCGTGCTTT TCCCTATGCT GGGCGGCTTG 
AGGGGGCGTC GTCCTGCTGC TGGGCGCGTT 
AAACACACGG CGGGGTTTGA 



This corresponds to the amino acid sequence <SEQ ID 570; ORF 143.a>: 

al4 3 .pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PHSLGW FFIL PPLAGMLVQP 

51 IVG HYSDRTW KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

151 AWAAILPFV FAYIGLA NTA EKGWPQT W VAFYVGAALL VITSA FTIFK 

201 VKEYNPETYA RYHGIDVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DASSVGYQEA GNWYG VIiAAV QSVAAVICSF 

301 VLAKVPNKYH KAGY FGCLAL GALGFFSVFF I GNQY ALVLS YTLIGIAWAG 

351 IITYPLTIVT NALSGKHMGT YLGLFNGSIC MPQ IVASLLS FVLFPMLGGL 

4 01 QATMF LVGGV VLLLGAfSVF LI KETHGGV* ' ' ~~ 

ml43/al43 99.5% identity in 429 aa overlap 

10 . 20 30 40 50 60 

ml43 . pep MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGHYSDRTW 
I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I : I I I I I I I I I I I I I I I I I M II I I I I I I 
al4 3 MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHSLGWFFILPPLAGMLVQPIVGHYSDRTW . 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml43.pep KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 
I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I 
al43 KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAW 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 3 . pep QPFKMMVGDMVNEEQKGYAYGIQSFLANTGAWAAILPFVFAYIGLANTAEKGWPQTVV 
I I I I ! I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al4 3 QPFKMMVGDMVNEEQKGYAYGIQSFLANTGAVVAAILPFVFAYIGLANTAEKGVVPQTVV 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml43 . pep VAFYVGAALLVITSAFTIFKVKEYDPETYARYHGIDVAANQEKANWIELLKTAPKAFWTV 
I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al4 3 VAFYVGAALLVITSAFTIFKVKEYNPETYARYHGIDVAANQEKANWIELLKTAPKAFWTV 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml43 . pep TLVQFFCWFAFQYMWTYSAGAIAENVWHTTDASSVGYQEAGNWYGVLAAVQSVAAVICSF 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
al4 3 TLVQFFCWFAFQYMWTYSAGAIAENVWHTTDASSVGYQEAGNWYGVLAAVQSVAAVICSF 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 4 3. pep VLAKVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLSYTLIGIAWAGIITYPLTIVT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al4 3 VLAKVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLSYTLIGIAWAGIITYPLTIVT 

310 320 330 340 350 360 



ml 4 3 .pep 



370 380 390 400 410 420 

NALSGKHMGTYLGLFNGSICMPQIVASLLSFVLFPMLGGLQATMFLVGGWLLLGAFSVF 
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al43 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I i ! I I I I 

NALSGKHMGTYLGLFNGSICMPQIVASLLSFVLFPMLGGLQATMFLVGGVVLLLGAFSVF 
370 380 390 400 410 420 



430 



ml 4 3 .pep 



al43 



LIKETHGGVX 
I I I I I I I I I I 
LIKETHGGVX 



430 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 571>: 



1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGCCTGATCG ACGGGCGGGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGTGC GTCTTCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC CCGTGGTGTC GTTCGACGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTGAG ATTAACAAGC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGTAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCcgtTT 

351 CAACGCGGTG GCGGCAGACG GccgacggTt atCCCAACGA TTTGGatatT 

401 TCctaccgCT TGGACGAGGA CGGCCGGCTT ACCGTtaccT ATCGCGCCAC 

4 51 CGCgctCGGC GACACGGTGT TCGACCCGAC GCTGCACATT TACTGGCGGC 

501 TGGACGCGGG CCTGCACGAT GCGGTTCTGC ATATTCCGCA GGGCGGACAT 

551 ATTCCGGCCG ATGCCGAAAA ACTGCCCGTC TTAACGGTTT CAGACGGCCT 

601 CGAAGTATTT GA 



This corresponds to the amino acid sequence <SEQ ID 572; ORF 144.ng>: 



1 MSDTPATRDF GLIDGRAVTG YVLSNRRGT C VFVLDLGGIV QEFSVL ADGV 

51 RENPWSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRRLSQR FGYFLPLGRG RPAYRYLSRH 

151 RARRHGVRPD AAHLLAAGRG PARCGSAYSA GRTYSGRCRK TARLNGFRRP 

201 RSI* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 573>: 



1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGTCTGATCG ACGGGCGTGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGCGT GTCTGCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC TCGTGGTGTC GTTCGATGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAAC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGCAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCCGTTT 

351 CAACGCGGTG GCGGCAGACG GCCGTTCGGT GGTGCTGCGC AGCCGCCTGg 

4 01 CAACAGTCGG CCGACGGTTA TCCCAACGAT TTGGATTTGG ATATTTCCTA 

451 CCGCTTGGAC GAGGACGACC GGCTTACCGT TAcCTATCGC GCCACCGCGC 

501 TCGGCGACAC GGTGTTCGAC CCGACGCTGC ACATTTACTG GCGGCTGGAC 

551 GCGGGCCTGC ACGATGCGGT TCTGCATATT CCGCAGGGCG GACATATGCC 

601 GGCCGATGCC GAAAAACTGC CCGTCTCAAC GGTTTCAGAC GACCTCGAAG 

651 TATTTGA 



This corresponds to the amino acid sequence <SEQ ID 574; ORF 144>: 



1 MSDTPATRDF GLIDGRAVTG YVLSNRRGT R VCVLDLGGIV QEFSVL ADGV 

51 RENLWSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRSWLR SRLATVGRRL SQRFGFGYFL 

151 PLGRGRPAYR YLSRHRARRH GVRPDAAHLL AAGRGPARCG SAYSAGRTYA 

201 GRCRKTARLN GFRRPRSI* 



gl44 . seq 



gl44 .pep 



ml4 4 . seq 



ml 4 4 .pep 



Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ml44 / gl44 91.3% identity in 218 aa overlap 

10 20 30 40 50 60 

ml4 4 .pep MS DT PATRDFGLI DGRAVTGYVLSNRRGTRVCVLDLGGIVQE FS VLADGVRENLWS FDD 

I I M I I I I I I I I I I I I I I I i I I I I I I I I I I | I | | | I I | | | | | | | | | | | | M I I I I I I 
gl4 4 MS DT PATRDFGLI DGRAVTGYVLSNRRGTCVFVLDLGGIVQEFSVLADGVRENPVVS FDD 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 4 . pep AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 
I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | 
gl44 AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 4 . pep AADGRS WLRSRLATVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

I I I I II M I I II I I I I I I I I I I I I I I I I ! I I I I I I | | I | | | | | | | 

gl4 4 AAD GRRLSQRFG — YFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

130 140 150 160 

190 200 210 219 

ml44 . pep AAGRGPARCGSAYSAGRTYAGRCRKTARLNGFRRPRSIX 
I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I 
gl4 4 AAGRGPARCGSAYSAGRTYSGRCRKTARLNGFRRPRSIX 
170 180 190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 575>: 

al44.seq 

1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGCCTGATCG ACGGGCGTGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGCGT GTCTGCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC TCGTGGTGTC GTTCGACGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAGC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGCAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCCGTTT 

351 CAACGCGGTG GCGGCAGACG GCCGTTCGGT GGTGCTGCGC AGCCGCCTG. 

4 01 CAACAGTCGG CCGACGGTTA TCCCAACGAT TTGGATTTGG ATATTTCCTA 

4 51 CCGCTTGGAC GAGGACGACC GGCTTACCGT TACCTATCGC GCCACCGCGC 

501 TCGGCGACAC GGTGTTCGAC CCGACGCTGC ACATTTACTG GCGGCTGGAC 

551 GCGGGCCTGC ACGATGCGGT TCTGCATATT CCGCAGGGCG GACATATTCC 

601 GGCCGATGCC GAAAAACTGC CCGTCTCAAC GGTTTCAGAC GACCTCGAAG 

651 TATTTGA 

This corresponds to the amino acid sequence <SEQ ID 576; ORF 144.a>: 

al44 .pep 

1 MSDTPATRDF GLIDGRAVTG YVLSNRRGTR VCVLDLGGIV QEFSVLA DGV 

51 RENLVVSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRS WLR SRLXTVGRRL SQRFGFGYFL 

151 PLGRGRPAYR YLSRHRARRH GVRPDAAHLL AAGRGPARCG SAYSAGRTYS 

201 GRCRKTARLN GFRRPRSI* 

ml44/al44 99.1% identity in 218 aa overlap 

10 20 30 40 50 60 

ml4 4 . pep MSDTPATRDFGLIDGRAVTGYVLSNRRGTRVCVLDLGGIVQEFSVLADGVRENLWSFDD 
I I I I I I ! I I ! I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I 
al4 4 MSDTPATRDFGLIDGRAVTGYVLSNRRGTRVCVLDLGGIVQEFSVLADGVRENLVVSFDD 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 4 4 . pep AAS YADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 
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I I I I I I I I I 1 1 I I I 1 1 1 1 I I I I I I 1 1 I I 1 1 1 1 I I I I I 1 1 I i I I I I I I I I I I I I I I I I I I I 

al4 4 AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 4 . pep AADGRS WLRSRLATVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

I i I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I i I I I 
al4 4 AADGRS WLRSRLXTVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

130 140 150 160 170 180 



190 200 210 219 

AAGRG PARCGSAYSAGRTYAGRCRKTARLNGFRRPRSIX 
I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I 
AAGRG PARCGSAYSAGRTYSGRCRKTARLNGFRRPRS IX 
190 200 210 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 577>: 

gl46. seq 

1 ATGAAGCAAA TCCCCCTCCG CCTTCTCCAG GTCGTCATTG ACCACGACAA 

51 AGTCGAACAA TACGGACTGT TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAa ctTCCCGACT GTCCGTCCCG CGCcctTTGA GGCGCGCGGC - 

151 AAGCACGTCG AAAGAAGGCG GCAGGATAAA GATACCGACA GCTTCCGGCA 

201 GCGCGTTGCG AACCTGCGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAGCCTG CCGCCGCCAA CGCATTCACG CCCTCCGTGC TTGTGCCGTA 

301 ATAGTTGCCG AATACGTCTG CGTATTCCAA AAAAGCCTCC TGCGCGATAA 

351 GCGATTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

401 TCGCCTTCAC GCGGCGGGCG CGTCGTATGC GACACGGAAA CGCGCAAACC 

4 51 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAA CCGATTGTAT CACAACGGAC ACCCTATTTC 

601 ATATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 



ml 4 4 . pep 
al44 



This corresponds to the amino acid sequence <SEQ ID 578; ORF 146.ng>: 

gl46.pep 

1 MKQIPLRLLQ WIDHDKVEQ YGLFDFMPCL RQPPLDNFPT VRPAPFEARG 

51 KHVERRRQDK DTDSFRQRVA NLRRALNVDF QNHVIACRRQ R IHALRACAV 

101 IVAEYVCV FQ KSLLRDKRFK LFFGNKVIMY AVCFAFTRRA RRMRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQRTPYF 

201 IFADAHILPL LF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 579>: 

ml4 6 . seq 

1 ATGGCGCAAA TCCTCCTCCG CTCGCGCCAA GTCGTCATTG ACCACGACAA 

51 AGTCAAACAA TACGGACTGC TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAA CTTCCCGACT GTCCGTCCCG CGTCCGTTGA GGCGCGCGGC 

151 AAGTACGTCG AAAGAAGGCG GCAGGATAAA GATGCCGACG GCTTCGGGCA 

201 GCGCGTCGCG AACCTGCGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAGCCTG CCGCCGCCAA CGCATTCACA CCCTCCGCGC CTGTGCCGTA 

301 ATAGTTGCCA AATACGTCGG CGTATTCCAA AAAAGCTTCC TGCGCGATAA 

351 GCGACTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

401 TCGCCTTCAC GCGGCGGGCG CGTCGTGTGC GACACGGAAA CGCGCAAACC 

451 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAG CCGATTGTAT CACAATGGAC ACCCAGTTTC 

601 CTATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 580; ORF 146>: 

ml 4 6. pep 

1 MAQILLRSRQ WIDHDKVKQ YGLLDFMPCL RQPPLDNFPT VRPASVEARG 

51 KYVERRRQDK DADGFGQRVA NLRRALNVDF QNHVIACRRQ R IHTLRACAV 

101 IVAKYVGV FQ KSFLRDKRLK LFFGNKVIMY AVCFAFTRRA RRVRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQWTPSF 

201 LFADAHILPL LF* 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml46 / gl46 90.1% identity in 212 aa overlap 

10 20 30 40 50 60 

ml 4 6 . pep MAQILLRSRQVVIDHDKVKQYGLLDFMPCLRQPPLDNFPTVRPASVEARGKYVERRRQDK 
I II M I I I I I I I | | : | | | | : | | | 1 | | | | | | | | | | | | | | | | I I | | i : | | | | | | | 1 
g!4 6 MKQIPLRLLQWIDHDKVEQYGLFDFMPCLRQPPLDNFPTVRPAPFEARGKHVERRRQDK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 6 . pep DADGFGQRVANLRRALNVDFQNHVIACRRQRIHTLRACAVIVAKYVGVFQKSFLRDKRLK 
I • I : I H I I I I I { I i I || I I I | | } | | | | | I | | : | ( | | | { | I {; | | 11111:11111:1 
g!4 6 DTDSFRQRVANLRRALNVDFQNHVIACRRQRIHALRACAVIVAEYVCVFQKSLLRDKRFK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 6 . pep LFFGNKVIMYAVCFAFTRRARRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 

I I I I I I I I I I I I I I I hlllMMMlMMIIMMIM I I I Mill Mill! 

g!4 6 LFFGNKVIMYAVCFAFTRRARRMRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 

130 140 150 160 170 180 

190 200 210 

ml4 6.pep GHIFYLYIFQPIVSQWTPSFLFADAHILPLLFX 

I I I I I I I I I I I I I I I II 1:11111 I 1 

gl4 6 GHIFYLYIFQPIVSQRTPYFIFADAHILPLLFX 

190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 581>: 

al46. seq 

1 ATGGCGCAAA TCCTCCTCCG CCCGCGCCAA GTCATCATTG ACCACGACAA 

51 AATCGAACAA TACGGACTGT TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAA CTTCCCGACT GTCCGTCCCG CGTCCGTTGA GACGCGCAGC 

151 AAGCACATCG AAAGACGGCG GCAGGATAAA GATGCCGACG GCTTCGGGCA 

201 GCGCATCTCG AACCTGAGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAACCTG CCGCCGCCAA CGCATTCACA CCCTCCGCGC TTGTGCCGTA 

301 ATAGTTGCCG AACACGTCCG CGTATTCCAA AAAAGCCTCC TGCGCGATAA 

351 GCGACTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

4 01 TCGCCTTCAC GCGGCGGACG CGTCGTGTGC GACACGGAAA CGCGCAAACC 

451 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAG CCGATTGTAT CACAACGGAC ACCCGGTTTC 

601 CTATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 



This corresponds to the amino acid sequence <SEQ ID 582; ORF 146.a>: 

al4 6.pep 

1 MAQILLRPRQ VIIDHDKIEQ YGLFDFMPCL RQPPLDNFPT VRPASVETRS 

51 KHIERRRQDK DADGFGQRIS NLSRALNVDF QNHVITCRRQ RIHTLRACAV 

101 IVAEHVRVFQ KSLLRDKRLK LFFGNKVIMY AVCFAFTRRT RRVRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQRTPGF 

201 LFADAHILPL LF* 



ml46/al46 90.6% identity in 212 aa overlap 

10 20 30 40 50 60 

ml 4 6 . pep MAQILLRSRQVVIDHDKVKQYGLLDFMPCLRQPPLDNFPTVRPASVEARGKYVERRRQDK 
Mil IN I M: I I I M:: j| ||:|| M II I M I I I I I I I! I II II I: | : |:: M I II I I 
a146 MAQILLRPRQVIIDHDKIEQYGLFDFMPCLRQPPLDNFPTVRPASVETRSKHIERRRQDK 

1° 20 30 40 50 60 

70 80 90 100 110 120 
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ml 4 6 . pep DADGFGQRVANLRRALNVDFQNHVIACRRQRIHTLRACAVIVAKYVGVFQKSFLRDKRLK 

I | I I 1 I I I : : I I I I I I I I I I I I I I : I ! I I I I i I ! I I I I I I I I I I I I I I : I I I I I I I 
al4 6 DADGFGQRISNLSRALNVDFQNHVITCRRQRIHTLRACAVIVAEHVRVFQKSLLRDKRLK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 6 . pep LFFGNKVIMYAVCFAFTRRARRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 

I | | | | I | I I I I I I I i I I I I : I I I I I I I I 11 i I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! 
al4 6 LFFGNKVIMYAVCFAFTRRTRRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 

130 140 150 160 170 180 

190 200 210 

ml 4 6 . pep GHI FYLYI FQPIVSQWTPSFLFADAHI LPLLFX 

I I I I I I I I I I I I I I I I I : M t I I I I I I I I I I I 
a 14 6 GHI FYLYIFQPIVSQRTPGFLFADAHI LPLLFX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 583>: 

gl47 . seq (partial) 

1 . . ATGCGACGAG AAGCCAAAAT GGCACAAATC ACACTCAAAC CCATTGTTTT 

51 ATCAATTCTT TTAATCAACA CACCCCTCCT CGCCCAAGCG CATGAAACTG 

101 AGCAATCGGT GGGCTTGGAA ACGGTCAGCG TCGTCGGCAA AAGCCGTCCG 

151 CGCGCGACTT CGGGGCTGCT GCACACTTCG ACCGCCTCCG ACAAAATCAT 

201 CTCCGGCGAT ACTTTGCGCC AAAAAGCCGT CAACTTGGGC GACGCTTTGG 

251 ACGGCGTACC GGGCATCCAC GCTTCGCAAT ACGGCGGCGG CGCATCCGCT 

301 CCCGTTATTC GCGGTCAAAC GGGCAGACGG ATTAAAGTAT TGAACCATCA 

351 CGGCGAAACG GGCGATATGG CGGACTTTTC TCCCGATCAC GCCATTATGG 

401 TAGATACCGC CTTGTCGCAA CAGGTTGAAA TCCTGCGCGG GCCGGTTACG 

4 51 CTCTTGTACA GCTCGGgcaa tgtggccgGG GCTGGtcaat gttgccgatg 

501 gAAAAAtccc ccaaaaAAtg cc. . 

This corresponds to the amino acid sequence <SEQ ID 584; ORF 147.ng>: 

gl47.pep (partial) 

1 . . MRREAKMAQI TLKPIVLSIL LINTPLLAQA HETEQSVGLE TVSWGKSRP 

51 RATSGLLHTS TASDKIISGD TLRQKAVNLG DALDGVPGIH ASQYGGGASA 

101 PVIRGQTGRR IKVLNHHGET GDMADFSPDH AIMVDTALSQ QVEILRGPVT 

151 LLYSSGNVAG AGQCCRWKNP PKNA. . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 585>: 

ml47 .seq (partial) 

1 . . CCGCATAAAA CTGAGCAATC GGTGGATTTG GAAACGGTCA GCGTCGTCGG 

51 CAAAAGCCGT CCGCGCGCCA CGTCGGGGCT GTTGCACACT TCGACCGCCT 

101 CCGACAAAAT CATCTCCGGC GATACCTTGC GCCAAAAAGC CGTCAACTTG 

151 GGCGACGCTT TAGACGGCGT ACCGGGCATC CACGCTTCGC AATACGGCGG 

201 CGGCGCGTCT GCTCCCGTCA TTCGCGGTCA AACAGGCAGG CGGATTAAAG 

251 TGTTGAACCA TCACGGCGAA ACAGGCGATA TGGCGGATTT TTCGCCCGAT 

301 CACGCCATTA TGGTAGATAC CGCCTTGTCG CAACAGGTCG AAATCCTGCG 

351 CGGGCCGGTT ACGCTCTTGT ACAGCTCGGG CAATGTGGCG GGGCTGGTCG 

401 ATGTTGCCGA TGGCAAAATC CCCGAAAAAA TGCCTGAAAA CGGCGTATCG 

451 GGCGAACTCG GATTGCGTTT GAGCAGCGGC AATCTGGAAA AACTCACGTC 

501 CGGCGGCATC AATATCGGTT TGGGCAAAAA CTTTGTATTG CACACGGAAG 

551 GGCTGTACCG CAAATCGGGG GATTACGCCG TACCGCGTTA CCGCAATCTG 

601 AAACGCCTGC CCGACAGCCA CGCCGATTCG CAAACGGGCA GCATCGGGCT 

651 GTCTTGGGTT GGCGAAAAAG GTTTTATCGG CGTAGCGTAC AGCGACCGTC 

701 GCGACCAATA TGGTCTGCCT GCCCACAGCC ACGAATACGA TGATTGCCAC 

751 GCCGACATCA TCTGGCAAAA GAGCTTGATT AACAAACGCT ATTTACAGCT 

801 TTATCCGCAC CTGTTGACCG AAGAAGACAT CGATTACGAC AATCCGGGCT 

851 TGAGCTGCGG CTTCCACGAC GACGATAATG CACACGCACA CACCCACAGC 

901 GGCAGACCGT GGATAGACCT GCGCAACAAA CGCTACGAAC TCCGTGCCGA 

951 ATGGAAGCAA CCGTTCCCCG GTTTTGAAGC CCTGCGCGTA CACCTGAACC 

1001 GCAACGACTA CCGCCACGAC GAAAAAGCAG GCGATGCAGT CGAAAACTTT 

1051 TTTAACAACC AAACGCAAAA CGCCCGCATC GAGTTGCGCC ACCAACCCAT 
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1101 AGGTCGTCTG AAAGGCAGCT GGGGCGTGCA ATATTTACAA CAAAAATCCA 

1151 GTGCTTTATC TGCCATATCC GAAGCGGTTA AACAACCGAT GCTGCTTGAC 

1201 AACAAAGTGC AACATTACAG CTTTTTCGGT GTAGAACAGG CAAACTGGGA 

1251 CAACTTCACG CTTGAAGGAG GCGTACGCGT GGAAAAACAA AAAGCCTCCA 

1301 TTCAGTACGA CAAAGCATTG ATTGATCGGG AAAACTACTA CAACCACCCC 

1351 CTGCCCGACC TCGGCGCGCA CCGCCAAACC GCCCGCTCAT TCGCACTTTC 

1401 GGGCAACTGG TATTTCACGC CACAACACAA ACTCAGCCTG ACCGCCTCCC 

1451 ATCAGGAACG CCTGCCGTCA ACGCAAGAGC TGTACGCACA CGGCAAACAC 

1501 GTCGCCACCA ACACCTTTGA AGTCGGCAAC AAACACCTCA ACAAAGAGCG 

1551 TTCCAACAAT ATCGAACTCG CGCTGGGCTA CGAAGGCGAC CGCTGGCAAT 

1601 ACAATCTGGC ACTCTACCGC AACCGCTTCG GTAACTACAT TTACGCCCAA 

1651 ACCTTAAACG ACGGACGCGG CCCCAAATCC ATCGAAGACG ACAGCGAAAT 

1701 GAAGCTCGTG CGCTACAACC AATCCGGCGC CGACTTCTAC GGCGCGGAAG 

17 51 GCGAAATCTA CTTCAAACCG ACACCGCGCT ACCGCATCGG CGTTTCCGGC 

1801 GACTATGTAC GAGGCCGTCT GAAAAACCTG CCTTCCCTAC CCGGCAGAGA 

1851 AGATGCCTAC GGCAACCGTC CTTTCATCGC ACAGGACGAC CAAAATGCCC 

1901 CCCGTGTTCC GGCTGCGCGC CTCGGCTTCC ACCTGAAAGC CTCGCTGACC 

1951 GACCGTATCG ATGCCAATTT GGACTACTAC CGCGTGTTCG CCCAAAACAA 

2001 ACTCGCCCGC TACGAAACGC GCACGCCCGG ACACCATATG CTCAACCTCG 

2051 GCGCAAACTA CCGCCGCAAT ACGCGCTATG GCGAGTGGAA TTGGTACGTC 

2101 AAAGCCGACA ACCTGCTCAA CCAATCCGTT TACGCCCACA GCAGCTTTCT 

2151 CTCTGATACG CCGCAAATGG GCCGCAGCTT TACCGGCGGC GTGAACGTGA 

2201 AGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 586; ORF 147>: 

ml47.pap (partial) 

1 . . PHKTEQSVDL ETVSVVGKSR PRATSGLLHT STASDKIISG DTLRQKAVNL 

51 GDALDGVPGI HASQYGGGAS APVIRGQTGR RIKVLNHHGE TGDMADFSPD 

101 HAIMVDTALS QQVEILRGPV TLLYSSGNVA GLVDVADGKI PEKMPENGVS 

151 GELGLRLSSG NLEKLTSGGI NIGLGKNFVL HTEGLYRKSG DYAVPRYRNL 

201 KRLPDSHADS QTGSIGLSWV GEKGFIGVAY SDRRDQYGLP AHSHEYDDCH 

251 ADIIWQKSLI NKRYLQLYPH LLTEEDIDYD NPGLSCGFHD DDNAHAHTHS 

301 GRPWIDLRNK RYELRAEWKQ PFPGFEALRV HLNRNDYRHD EKAGDAVENF 

351 FNNQTQNARI ELRHQPIGRL KGSWGVQYLQ QKSSALSAIS EAVKQPMLLD 

401 NKVQHYSFFG VEQANWDNFT LEGGVRVEKQ KASIQYDKAL IDRENYYNHP 

451 LPDLGAHRQT ARSFALSGNW YFTPQHKLSL TASHQERLPS TQELYAHGKH 

501 VATNTFEVGN KHLNKERSNN IELALGYEGD RWQYNLALYR NRFGNYIYAQ 

551 TLNDGRGPKS IEDDSEMKLV RYNQSGADFY GAEGEIYFKP TPRYRIGVSG 

601 DYVRGRLKNL PSLPGREDAY GNRPFIAQDD QNAPRVPAAR LGFHLKASLT 

651 DRIDANLDYY RVFAQNKLAR YETRTPGHHM LNLGANYRRN TRYGEWNWYV 

701 KADNLLNQSV YAHSSFLSDT PQMGRSFTGG VNVKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml47 / gl47 92.3% identity in 142 aa overlap 

10 20 30 

ml 4 7 .pep PHKTEQSVDLETVSWGKSRPRATSGLLHTS 

1:11111 I I I I I I I I I I I I I I I I II I I | I 
9147 MRREAKMAQITLKPIVLSILLINTPLLAQAHETEQSVGLETVSVVGKSRPRATSGLLHTS 

10 20 30 40 50 60 

40 50 60 70 80 90 

ml47 .pep TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
M II I I I II I I I I I I I I I I I I I I I I I | | | | | | | | | | | | M | | | I | | | | | | | | | | | | | | j | 
g!47 TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 

70 80 90 100 110 120 

100 110 120 130 140 150 

ml 4 7 .pep GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 

M I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I : I I I 
gl47 GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGAGQCCRWKNPPKNA 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 587>: 

al47 . seq 

1 ATGCGACGAG AAGCCAAAAT GGCACAAACT ACACTCAAAC CCATTGTTTT 
51 ■ ATCAATTCTT TTAATCAACA CACCCCTCCT CTCCCAAGCG CATGGAACTG 

101 AGCAATCAGT GGGCTTGGAA ACGGTCAGCG TCGTCGGCAA AAGCCGTCCG 

151 CGCGCCACTT CGGGGCTGCT GCACACTTCT ACCGCCTCCG ACAAAATCAT 

201 CAGCGGCGAC ACCTTGCGAC AAAAAGCCGT CAACTTGGGT GATGCTTTAG 

251 ACGGCGTACC GGGCATTCAT GCCTCGCAAT ACGGCGGCGG CGCATCCGCT 

301 CCCGTTATTC GCGGTCAAAC AGGCAGACGG ATTAAAGTGT TGAACCATCA 

351 CGGCGAAACG GGCGACATGG CGGACTTCTC TCCAGACCAT G CAAT CAT GG 

4 01 TGGACAGCGC CTTGTCGCAA CAGGTCGAAA TCCTGCGCGG TCCGGTTACG 

451 CTCTTGTACA GCTCGGGCAA TGTGGCGGGG CTGGTCGATG TTGCCGATGG 

501 CAAAATCCCC GAAAAAATGC CTGAAAACGG CGTATCGGGC GAACTCGGAT 

551 TGCGTTTGAG CAGCGGCAAT CTGGAAAAAC TCACGTCCGG CGGCATCAAT 

601 ATCGGTTTGG GCAAAAACTT TGTATTGCAC ACGGAAGGGC TGTACCGCAA 

651 ATCGGGGGAT TACGCCGTAC CGCGTTACCG CAATCTGAAA CGCCTGCCCG 

701 ACAGCCACGC CGATTCGCAA ACGGGCAGCA TCGGGCTGTC TTGGGTTGGC 

751 GAAAAAGGCT TTATCGGCGC AG C AT AC AG C GACCGTCGCG ACCAATATGG 

801 TCTGCCTGCC CACAGCCACG AATACGATGA TTGCCACGCC GACATCATCT 

851 GGCAAAAGAG TTTGATTAAC AAACGCTATT TGCAGCTTTA TCCGCACCTG 

901 TTGACCGAAG AAGACATCGA TTACGACAAT CCGGGCTTGA GCTGCGGCTT 

951 TCACGACGAC GATGATGCAC ACGCCCATGC CCACAACGGC AAACCTTGGA 

1001 TAGACCTGCG CAACAAACGC TACGAACTCC GCGCCGAATG GAAGCAACCG 

1051 TTCCCCGGTT TTGAAGCCCT GCGCGTACAC CTGAACCGCA ACG ACT AC CG 

1101 CCACGACGAA AAAGCAGGCG ATGCAGTAGA AAACTTTTTT AACAACCAAA 

1151 CGCAAAACGC CCGTATCGAG TTGCGCCACC AACCCATAGG CCGTCTGAAA 

1201 GGCAGCTGGG GCGTGCAATA TTTGGGACAA AAATCCAGTG CTTTATCTGC 

1251 CACATCCGAA GCGGTCAAAC AACCGATGCT GCTTGACAAT AAAGTGCAAC 

1301 ATTACAGCTT TTTCGGTGTA GAACAGGCAA ACTGGGACAA CTTCACGCTT 

1351 GAAGGCGGCG TACGCGTGGA AAAACAAAAA GCCTCCATCC GCTACGACAA 

1401 AGCATTGATT GATCGGGAAA ACTACTACAA CCATCCCCTG CCCGACCTCG 

14 51 GCGCGCACCG CCAAACCGCC CGCTCATTCG CACTTTCGGG CAACTGGTAT 

1501 TTCACGCCAC AACACAAACT CAGCCTGACC GCCTCCCATC AGGAACGCCT 

1551 GCCGTCAACG CAAGAGCTGT ACGCACACGG CAAACACGTC GCCACCAACA 

1601 CCTTTGAAGT CGGCAACAAA CACCTCAACA AAGAGCGTTC CAACAATATC 

1651 GAACTCGCGC TGGGCTACGA AGGCGACCGC TGGCAATACA ATCTGGCACT 

1701 CTACCGCAAC CGCTTCGGCA ACTACATTTA CGCCCAAACC TTAAACGACG 

1751 GACGCGGCCC CAAATCCATC GAAGACGACA GCGAAATGAA GCTCGTGCGC 

1801 TACAACCAAT CCGGTGCGGA CTTCTACGGC GCGGAAGGCG AAATCTACTT 

1851 CAAACCGACA CCGCGCTACC GCATCGGCGT TTCCGGCGAC TATGTACGAG 

1901 GCCGTCTGAA AAACCTGCCT TCCCTACCCG GCAGGGAAGA CGCCTACGGC 

1951 AACCGCCCAC TCATTGCCCA AGCCGACCAA AACGCCCCTC GCGTTCCGGC 

2001 TGCGCGCCTC GGCGTCCACC TGAAAGCCTC GCTGACCGAC CGCATCGATG 

2051 CCAATTTGGA CTACTACCGC GTGTTCGCCC AAAACAAACT CGCCCGCTAC 

2101 GAAACGCGCA CGCCCGGACA CCATATGCTC AACCTCGGCG CAAACTACCG 

2151 CCG CAAT ACG CGCTATGGCG AGTGGAATTG GTACGTCAAA GCCGACAACC 

2201 TGCTCAACCA ATCCGTTTAC GCCCACAGCA GCTTCCTCTC TGATACGCCG 

2251 CAAATGGGCC GCAGCTTTAC CGGCGGCGTG AACGTGAAGT TTTAA 

This corresponds to the amino acid sequence <SEQ ID 588; ORP 147.a>: 

al47 . pep 

1 MRREAKMAQT TLKPIVLSIL LINTPLLSQA HG TEQSVGLE TVSVVGKSRP 

51 RATSGLLHTS TASDKIISGD TLRQKAVNLG DALDGVPGIH ASQYGGGASA 

101 PVIRGQTGRR IKVLNHHGET GDMADFSPDH AIMVDSALSQ QVEILRGPVT 

151 LLYSSGNVAG LVDVADGKIP EKMPENGVSG ELGLRLSSGN LEKLTSGGIN 

201 IGLGKNFVLH TEGLYRKSGD YAVPRYRNLK RLPDSHADSQ TGSIGLSWVG 

251 EKGFIGAAYS DRRDQYGLPA HSHEYDDCHA DIIWQKSLIN KRYLQLYPHL 

301 LTEEDIDYDN PGLSCGFHDD DDAHAHAHNG KPWIDLRNKR YELRAEWKQP 

351 FPGFEALRVH LNRNDYRHDE KAGDAVENFF NNQTQNARIE LRHQPIGRLK 

4 01 .GSWGVQYLGQ KSSALSATSE AVKQPMLLDN KVQHYSFFGV EQANWDNFTL 

4 51 EGGVRVEKQK ASIRYDKALI DRENYYNHPL PDLGAHRQTA RSFALSGNWY 

501 FTPQHKLSLT ASHQERLPST QELYAHGKHV ATNTFEVGNK HLNKERSNNI 

551 ELALGYEGDR WQYNLALYRN RFGNYIYAQT LNDGRGPKSI EDDSEMKLVR 
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601 YNQSGADFYG AEGEIYFKPT PRYRIGVSGD YVRGRLKNLP SLPGREDAYG 

651 NRPLIAQADQ NAPRVPAARL GVHLKASLTD RIDANLDYYR VFAQNKLARY 

701 ETRTPGHHML NLGANYRRNT RYGEWNWYVK ADNLLNQSVY AHSSFLSDTP 

751 QMGRSFTGGV NVKF* 

m!47/al47 98.1% identity in 734 aa overlap 

10 20 30 

ml 4 7 . pep PHKTEQS VDLETVS WGKSRPRATSGLLHTS 

I MM! M M M I M M M M M M M I 
al47 MRREAKMAQTTLKPIVLSILLINTPLLSQAHGTEQSVGLETVS WGKSRPRATSGLLHTS 

10 20 30 40 50 60 

40 . 50 60 70 80 90 

ml47 . pep TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
III INI I II IIIIIIIMI llllllllllllll MIIIMII llllll Mill llllll 
al47 TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 

70 80 90 100 110 120 

100 110 120 130 140 150 

ml47 . pep GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 
II I II I I I I M I I I Ml I II I I I I I I I II II I I I I II I II I I I I I I I ! II I M I I II I II 
al4 7 GDMADFSPDHAIMVDSALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 

130 140 150 160 170 180 

160 170 180 190 200 210 

ml 4 7. pep ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQ 

I M I M I M M M M M I M M I I I I I I M I M I I M M M M M II I I M M M I M II 
al 4 7 ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQ 

190 200 210 220 230 240 

220 230 240 250 260 270 

ml47.pep TGSIGLSWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHL 

II I II I I I I I II I M I : II I II I M II I II II I I I I I I I II II II M I I II I I II II M I 
a!47 TGSIGLSWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHL 

250 260 270 280 290 300 

280 290 300 310 320 330 

ml47 .pep LTEEDIDYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVH 
M I ! I I I I II I I I II II I II I : I M M M M I I II I I I i M M I I 1 I I I II M I M II II 
al47 LTEEDIDYDNPGLSCGFHDDDDAHAHAHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVH 

310 320 330 340 350 360 

340 350 360 370 380 390 

ml 4 7 . pep LNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISE 
I I I I I M I II I II II II M II II I M I II II I I I II II I I I I II II M II I II I I I I I 
al47 LNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLGQKSSALSATSE 

370 380 390 400 410 420 

400 410 420 430 440 450 

ml47 . pep AVKQPMLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPL 

I I I M I II II II I I II II I II I II I II I I II II I II M I I II M II II I II I M II I II I 
al47 AVKQPMLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNHPL 

430 440 450 460 470 480. 

460 470 480 490 500 510 

ml 4 7 . pep PDLGAHRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 

II I I I I I I I II II I II I I M M M M II I II II I I II II I I I II ! II I II II M I M I I I 
al4 7 PDLGAHRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 

490 500 510 520 530 540 

520 530 540 550 560 570 

ml 4 7 . pep HLNKERSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVR 
I I I I I I I I I II M M M I M II I I i I M I I M M I M I! M M II I I i I I I I I I I M M I 
al47 HLNKERSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVR 
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550 560 570 580 590 600 

580 590 600 610 620 630 

ml 4 7 . pep YNQSGADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQ 

I I ! I I I I I I I I I I I I I I I I I I I. I I I I I I I I ! I I | | I | j | | | | M | | | | : | | | i| 

al4 7 YNQSGADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLIAQADQ 

610 620 630 640 650 660 

640 650 660 670 680 690 

ml4 7 . pep NAPRVPAARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNT 
I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I M | I | | | | | | | | | | ! | | | | | | | | | | 
al4 7 NAPRVPAARLGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNT 

670 680 690 700 710 720 

700 710 720 730 

ml 4 7 . pep RYGEWNW YVKADNLLNQS V YAHS S FLS DT PQMGRS FTGGVNVKFX 

I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I i I I | I I I I I 
a 1 4 7 RYGEWNW YVKADNLLNQS V YAHS S FLS DT PQMGR S FTGGVNVKFX 

730 740 750 760 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 589>: 

gl4 8.seq 

1 ATGGCGTTAA AAACATCAAA CTTGGAACAC GCAATGctgg ttcaTCCCGA 

51 AgctATgagt gtcggcgCGC TTGccgAcaa AATCCGCAAA AtcgaAAact 

101 gGCCGCAAAA AGgcaTCTTA TTCCACGACA TCACGCCCGT CCTGCAAAGT 

151 GCGGAATACT TCCGCCTTTT GGTCGATTTG CTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGCTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAaCGtcg gctTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTAcg cgcTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGa tgccgTCAAA CCCGGTTCGC 

4 01 GCGTCCTGCT GGTCGATGAT TTGGTTGCCA CGGGCGGCAC AATGCTTGCC 

4 51 GGGCTGGAAC TGATCCGCAA ACTCGGCGGG GAAATTGTCG AAgccgccgC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGCGCAAGTG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGCAT GAAAGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 590; ORF 148.ng>: 

gl48 .pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 

101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK PGSRVLLVDD LVATGGTMLA 

151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 591 >: 

ml48 . seq 

1 ATGGCGTTAA AAACATCAAA CTTGGAACAC GCAATGCTGG TTCATCCCGA 

51 AGCTATGAGT GTCGGCGCGC TTGCCGACAA AATCCGCAAA ATCGAAAACT 

101 GGCCGCAAAA AGGCATCTTA TTCCACGACA TCACGCCCGT CCTTCAAAGC 

151 GCGGAATACT TCCGCCTTTT GGTTGATTTA TTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGTTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAACGTCG GTTTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTACG CGCTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGA TGCCGTCAAA CTCGGTTCGC 

4 01 GCGTGCTGCT GGTCGATGAT TTGATTGCCA CGGGCGGCAC GATGCTTGCC 

4 51 GGACTGGAAC TGATCCGCAA ACTCGGCGGA GAAATTGTCG AAGCCGCCGC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGTGCAAGCG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGTAT GAAGGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 592; ORF 148>: 

ml48 . pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 
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101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK LGSRVLLVDD LIATGGTMLA 
151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml48 / gl48 99.0% identity in 199 aa overlap 

10 20 30 40 50 60 

ml 4 8 . pep MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 
I I I I I I I I M I I II I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | j | | | | | | | j | | | , 
g!4 8 MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 8. pep LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 

MMIIIIIIIMIIII I I I I I I i I I I I I I I I I 1 I I I I I I i i I I I I I I I f I I I I 

gl 4 8 LVYRYMDQKIDIVAGLDARGFTIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 8. pep AVEIHTDAVKLGSRVLLVDDLIATGGTMLAGLELIRKLGGEIVEAAAILE FTDLQGGKNI 

I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I i II I I I I I I I I I i I I I I I I I I M I 
g!4 8 AVEIHTDAVKPGSRVLLVDDLVATGGTMLAGLELIRKLGGEIVEAAAILEFTDLQGGKNI 

130 140 150 160 170 ^ 180 

190 200 
ml 4 8 . pep RASGAPLFTLLQNEGCMKGX 
1 I 11 I I I I I I I I I I I I I I I I 
gl 4 8 RASGAPLFTLLQNEGCMKGX 

190 200 

The following partial DNA sequence was identified in JV. meningitidis <SEQ ID 593>: 

al48 . seq 

1 ATGGCGTTAA AAACATCAAA CTTGGAACAC GCAATGCTGG TTCATCCCGA 

51 AGCTATGAGT GTCGGTGCGC TTGCCGACAA AATCCGCAAA ATCGAAAACT 

101 GGCCGCAAAA AGGCATCTTA TTCCACGACA TCACGCCCGT CCTGCAAAGC 

151 GCGGAATACT TCCGACTTTT GGTTGATTTA TTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGTTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAACGTCG GTTTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTACG CGCTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGA TGCCGTCAAA CTCGGTTCGC 

4 01 GCGTGCTGCT GGTCGATGAT TTGGTTGCCA CGGGCGGCAC GATGCTTGCC 

4 51 GGACTGGAGC TGATCCGCAA ACTCGGCGGG GAAATTGTCG AAGCCGCCGC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGTGCAAGCG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGTAT GAAGGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 594; ORF 148. a>: 

al4 8.pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 

101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK LGSRVLLVDD LVATGGTMLA 

151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

ml48/al48 99.5% identity in 199 aa overlap 

10 20 30 40 50 60 

ml 4 8 . pep MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQS AEYFRLLVDL 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
al4 8 MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQS AEYFRLLVDL 

10 20 30 40 50 60 



7 ° 80 90 100 110 120 
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ml 4 8 .pep 
a!48 



LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 
J 1 1 1 1 1 1 1 1 ' 1 1 I I ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | || | | | |f | | Ml | 

LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 
70 8 0 90 100 110 120 

130 140 150 160 170 180 

ml 4 8 . pep AVEIHTDAVKLGSRVLLVDDLIATGGTMLAGLELIRKLGGEIVEAAAILE FTDLQGGKNI 

1 1 1 1 1 ' 1 1 1 1 1 1 ' I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I 
AVE IHTDAVKLGSRVLLVDDLVATGGTMLAGLELIRKLGGEIVEAAAILE FTDLQGGKNI 
130 140 150 160 170 180 



al48 



190 200 
ml 4 8 . pep RASGAPLFTLLQNEGCMKGX 
I M I I I I I II I I I I I I I I I I 
a 1 4 8 RASGAPLFTLLQNEGCMKGX 

190 200 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 595>- 

gl4 9.seq ~ " 

1 ATGTTGATTG ACAACAATGT CCGCCATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAATTGG GACAACTTCA CGCTTGAAGG CGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATCCGGTAC GACAAAGCAT TGATTGATCG AGAAAACTAC 

151 TACAACCAGC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 GTTCGCACTT TCGGGCAACT GGTATTTCAC GCCACACCAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAa cgCCTGCCGT CAACGCaagA actGtACgca 

301 cacggcAAGC ACGtcgccac CAACACCTTT GAagtcggca acaaACACCT 

351 CAACAAAGaG CgttccaacA atatcgaACT CGCGCTGGgc tAcaaaggcg 

4 01 accGCTGGCA ATACAATCTG GCAGCCTACC GCAACCGAtT CGGCAACTAC 

451 ATTTACGCCC AAACCTTAaa cgacggacgC GGCCCCAAAT CCATCgaaga 

501 cgacagcgaA ATGaagcTCG TGCGCTACAA CCAATCCGGT GCCGACTTCT 

551 ACGgcgcggA aggcgaaatc tACTTcaaaC CGAcACCGCG CTACCGCATC 

601 GGTGTTTCCG GCGACTatgt acgaggccgT CTGAAAAACC TGCCGTCCCT 

651 ACCCGGCAGG gaagatccCT AcggcAAACG TCccttcaTC GCACAAGCCG 

701 ACCAAAACGC CCCCCGCATT ccggctGCGC GCCTCGGCTT CCACCTGAAA 

751 ACCTCGCTAA CCGACCGTAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

801 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGTACGCCC GGACACCATA 

851 TGCTCAACCT CGGTGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCtc aACcaatCcg tTTACGCCCa 

951 CAGCAGCTTC CTCTCTGATA CGCCGCAAAt gGGCCGCAGC TTtgccgGCg 

1001 gcgtaAACGT GaAGTTttaA " ' 

This corresponds to the amino acid sequence <SEQ ID 596* ORF 149 ne>- 
gl49.pep ' 6 * 

1 MLIDNNVRHY SFFGVEQANW DNFTLEGGVR VEKQKASIRY DKALIDRENY 

51 YNQPLPDLGA HRQTARSFAL SGNWYFTPHH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YKGDRWQYNL AAYRNRFGNY 

151 IYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSGDYVRGR LKNLPSLPGR EDPYGKRPFI AQADQNAPRI PAARLGFHLK 

251 TSLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

301 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FAGGVNVKF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 597> 

ml49.seq " 

1 ATGCTGCTTG ACAACAAAGT GCAACATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAACTGG GACAACTTCA CGCTTGAAGG AGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATTCAGTAC GACAAAGCAT TGATTGATCG GGAAAACTAC 

151 TACAACCACC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 ATTCGCACTT TCGGGCAACT GGTATTTCAC GCCACAACAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAA CGCCTGCCGT CAACGCAAGA GCTGTACGCA 

301 CACGGCAAAC ACGTCGCCAC CAACACCTTT GAAGTCGGCA ACAAACACCT 

351 CAACAAAGAG CGTTCCAACA ATATCGAACT CGCGCTGGGC TACGAAGGCG 

401 ACCGCTGGCA ATACAATCTG GCACTCTACC GCAACCGCTT CGGTAACTAC 
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4 51 ATTTACGCCC AAACCTTAAA CGACGGACGC GGCCCCAAAT CCATCGAAGA 

501 CGACAGCGAA ATGAAGCTCG TGCGCTACAA CCAATCCGGC GCCGACTTCT 

551 ACGGCGCGGA AGGCGAAATC TACTTCAAAC CGACACCGCG CTACCGCATC 

601 GGCGTTTCCG GCGACTATGT ACGAGGCCGT CTGAAAAACC TGCCTTCCCT 

651 ACCCGGCAGA GAAGATGCCT ACGGCAACCG TCCTTTCATC GCACAGGACG 

701 ACCAAAATGC CCCCCGTGTT CCGGCTGCGC GCCTCGGCTT CCACCTGAAA 

751 GCCTCGCTGA CCGACCGTAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

8 01 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGCACGCCC GGACACCATA 

851 TGCTCAACCT CGGCGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCTC AACCAATCCG TTTACGCCCA 

951 CAGCAGCTTT CTCTCTGATA CGCCGCAAAT GGGCCGCAGC TTTACCGGCG 

1001 GCGTGAACGT GAAGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 598; ORF 149>: 

ml49.pep 

1 MLLDNKVQHY SFFGVEQANW DNFTLEGGVR VEKQKASIQY DKALIDRENY 

51 YNHPLPDLGA HRQTARSFAL SGNWYFTPQH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YEGDRWQYNL ALYRNRFGNY 

151 IYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

2 01 GVSGDYVRGR LKNLPSLPGR EDAYGNRPFI AQDDQNAPRV PAARLGFHLK 
251 ASLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

3 01 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FTGGVNVKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 149 shows 95.9% identity over a 339 aa overlap with a predicted ORF (ORF 149.ng) 
from N. gonorrhoeae: 

m!49/gl49 



10 20 30 40 50 60 

ml 4 9 . pep MLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPLPDLGA 

I I : I h h I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | : I I II t I I I I I M h I I I I I I I 
gl49 MLIDNNVRHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNQPLPDLGA 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml4 9 . pep HRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! M 1 1 1 1 1 1 1 1 II 1 1 1 M I M I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

gl4 9 HRQTARSFALSGNWYFTPHHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml49 .pep RSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSG 

II II I IMMM I MMM I I M 1 1 IM MM MM Ml II Ml M 1 1 1 1 ! M M I 

gl49 RSNNIELALGYKGDRWQYNLT^AYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSG 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml49 .pep • ADFYGAEGEI YFKPTPRYRI GVSGDYVRGRLKNLPSLPGREDAYGNRPFI AQDDQNAPRV 

II I II I II I II 1 1 II III II I MM MINIMI MM I II I IhllMII I II II M 

gl49 ADFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDPYGKRPFIAQADQNAPRI 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml49 . pep PAARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGAN YRRNTRYGEW 

III II I II IMM IMIIII IMIIMIMIIM IMM 1 1 1 1 II III II 1 1 1 1 II MM 

gl4 9 PAARLGFHLKTSLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 

250 260 270 280 290 300 



ml4 9 .pep 



310 320 330 340 

NW YVKADNLLNQS VYAHS S FLSDTPQMGRS FTGGVNVKFX 

II M 1 1 1 1 1 1 II 1 1 M 1 1 II MIMI Ml I M Mill II I 
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gl4 9 NWYVKADNLLNQSVYAHSSFLSDTPQMGRSFAGGVNVKFX 

310 320 330 340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 599>: 

al4 9 . seq 

1 ATGCTGCTTG ACAATAAAGT GCAACATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAACTGG GACAACTTCA CGCTTGAAGG CGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATCCGCTAC GACAAAGCAT T GAT T GAT CG GGAAAACTAC 

151 TACAACCATC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 ATTCGCACTT TCGGGCAACT GGTATTTCAC GCCACAACAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAA CGCCTGCCGT CAACGCAAGA GCTGTACGCA 

301 CACGGCAAAC ACGTCGCCAC CAACACCTTT GAAGTCGGCA ACAAACACCT 

351 CAACAAAGAG CGTTCCAACA ATATCGAACT CGCGCTGGGC TACGAAGGCG 

401 ACCGCTGGCA ATACAATCTG GCACTCTACC GCAACCGCTT CGGCAACTAC 

451 ATTTACGCCC AAACCTTAAA CGACGGACGC GGCCCCAAAT CCATCGAAGA 

501 CGACAGCGAA ATGAAGCTCG TGCGCTACAA CCAATCCGGT GCGGACTTCT 

551 ACGGCGCGGA AGGCGAAATC TACTTCAAAC CGACACCGCG CTACCGCATC 

601 GGCGTTTCCG GCGACTATGT ACGAGGCCGT CTGAAAAACC TGCCTTCCCT 

651 ACCCGGCAGG GAAGACGCCT ACGGCAACCG CCCACTCATT GCCCAAGCCG 

701 ACCAAAACGC CCCTCGCGTT CCGGCTGCGC GCCTCGGCGT CCACCTGAAA 

751 GCCTCGCTGA CCGACCGCAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

801 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGCACGCCC GGACACCATA 

851 TGCTCAACCT CGGCGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCTC AACCAAT CCG TTTACGCCCA 

951 CAGCAGCTTC CTCTCTGATA CGCCGCAAAT GGGCCGCAGC TTTACCGGCG 

1001 GCGTGAACGT GAAGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 600; ORF 149.a>: 

al49 .pep 

1 MLLDNKVQHY SFFGVEQANW DNFTLEGGVR VEKQKASIRY DKALIDRENY 

51 YNHPLPDLGA HRQTARSFAL SGNWYFTPQH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YEGDRWQYNL ALYRNRFGNY 

151 IYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSG DYVRGR LKNLPSLPGR EDAYGNRPLI AQADQNAPRV PAARLGVHLK 

251 ASLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

301 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FTGGVNVKF* 

ml49/al49 98.8% identity in 339 aa overlap 

10 20 30 40 50 60 

ml4 9.pep MLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPLPDLGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I t : I I I I I I I I I I I I I I I I I I I I I 
al4 9 MLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNHPLPDLGA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 9 . pep HRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I 
al4 9 HRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 9. -pep RSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al4 9 RSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 4 9. pep ADFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQNAPRV 

1 I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I) I I I I I : I I I I I I I I | I 
al4 9 ADFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLIAQADQNAPRV 

190 200 210 220 230 240 



ml 4 9. pep 



250 260 270 280 290 300 

PAARLG FHLKAS LT DRI DANLDY YRVFAQNKLARYETRT PGHHMLNLGAN YRRNTRYGEW 
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l l I I l l 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

al4 9 PAARLGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 

250 260 270 280 290 300 

310 320 330 340 

ml 4 9 . pep NW YVKADNLLNQS V YAHS S FLS DT PQMGRS FTGGVNVKFX 

I I I II I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
al4 9 NWYVKADNLLNQSVYAHSS FLS DTPQMGRS FTGGVNVKFX 

310 320 330 340 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 60 1>: 

gl49-l . aeq 

1 ATGGCACAAA TCACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCGCCCAAG CGCATGAAAC TGAGCAATCG GTGGGCTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCGAC TTCGGGGCTG 

151 CTGCACACTT CGACCGCCTC CGACAAAATC ATCTCCGGCG ATACTTTGCG 

201 CCAAAAAGCC GTCAACTTGG GCGACGCTTT GGACGGCGTA CCGGGCATCC 

251 ACGCTTCGCA ATACGGCGGC GGCGCATCCG CTCCCGTTAT TCGCGGTCAA 

301 ACGGGCAGAC GGATTAAAGT ATTGAACCAT CACGGCGAAA CGGGCGATAT 

351 GGCGGACTTT TCTCCCGATC ACGCCATTAT GGTAGATACC GCCTTGTCGC 

401 AACAGGTTGA AATCCTGCGC GGGCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGAAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGaagccgG ATTGCGTTTG AGCAGCGGCA 

551 ATTTAGAAAA ACTGACATCC GCAGGCATCA ATATCGGACT GGGCAAAAAC 

601 TTCGTGCTGC ATACCGAAGG CTTGTACCGC AAATCGGGCG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAT GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTGG GCGAAAAAGG CTTTATCGGC 

751 GCAGCATACA GCGACCGTCG CGACCGCTAC GGCCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGTTTGATCA 

851 ACAAACGCTA TTTGCAGCTT TATCCGCACT TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTCCACGACG GCGACGGTGC 

951 ACACGCACAC ACCCACAACG GCAAACCGTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGCGCCGAA TGGAAGCAGC CATTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ATCTGAACCG CAATGACTAC CACCACGACG AAAAAGCAGG 

1101 CGATGCAGTA GAAAACTTCT TCAACAACAA AACACACAAC GCCCGTATCG 

1151 AGTTGCGCCA CCAACCCATA GGCCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTGGGAC AAAAATCCAG CGCGCTTTCC GCCATTCCCG AAACCGTCCA 

1251 ACAACCGATG TTGATTGACA ACAATGTCCG CCATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAATTGGGAC AACTTCACGC TTGAAGGCGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT CCGGTACGAC AAAGCATTGA TTGATCGAGA 

1401 AAACTACTAC AACCAGCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCGTT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACACCACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAACT 

1551 GTACGCACAC GGCAAGCACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA GCCTACCGCA ACCGATTCGG 

1701 CAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGTGCC 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGT GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CGTCCCTACC CGGCAGGGAA GATCCCTACG GCAAACGTCC CTTCATCGCA 

1951 CAAGCCGACC AAAACGCCCC CCGCATTCCG GCTGCGCGCC TCGGCTTCCA 

2001 CCTGAAAACC TCGCTAACCG ACCGTATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG TACGCCCGGA 

2101 CACCATATGC TCAACCTCGG TGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTCCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TAAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 602; ORF 149-l.ng>: 

gl49-l .pep 

1 MAQITLKPIV LSILLINTPL LAQA HETEQS VGLETVSWG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRRI KVLNH HGETGDMADF SPDHAIMVDT ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGEAGLRL SSGNLEKLTS AGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 AAYSDRRDRY GLPAHSHEYD DCHADIIWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDGDGAHAH THNGKPWIDL RNKRYELRAE WKQPFPGFEA 
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351 LRVHLNRNDY HHDEKAGDAV ENFFNNKTHN ARIELRHQPI GRLKGSWGVQ 

401 YLGQKSSALS AIPETVQQPM LIDNNVRHYS FFGVEQANWD NFTLEGGVRV 

451 EKQKASIRYD KALIDRENYY NQPLPDLGAH RQTARS FALS GNWYFTPHHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 

551 EGDRWQYNLA AYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DPYGKRPFIA 

651 QADQNAPRIP AARLGFHLKT SLTDRI DANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

751 TGGVNVKF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 603>: 

ml49-l . soq 

1 ATGGCACAAA • CTACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCGCCCAAG CGCATGAAAC TGAGCAATCG GTGGATTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCCAC GTCGGGGCTG 

151 TTGCACACTT CGACCGCCTC CGACAAAATC ATCTCCGGCG ATACCTTGCG 

201 CCAAAAAGCC GTCAACTTGG GCGACGCTTT AGACGGCGTA CCGGGCATCC 

251 ACGCTTCGCA ATACGGCGGC GGCGCGTCTG CTCCCGTCAT TCGCGGTCAA 

301 ACAGGCAGGC GGATTAAAGT GTTGAACCAT CACGGCGAAA CAGGCGATAT 

351 GGCGGATTTT TCGCCCGATC ACGCCATTAT GGTAGATACC GCCTTGTCGC 

401 AACAGGTCGA AATCCTGCGC GGGCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGCAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGAACTCGG ATTGCGTTTG AGCAGCGGCA 

551 ATCTGGAAAA ACTCACGTCC GGCGGCATCA ATATCGGTTT GGGCAAAAAC 

601 TTTGTATTGC ACACGGAAGG GCTGTACCGC AAATCGGGGG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAC GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTTG GCGAAAAAGG TTTTATCGGC 

751 GTAGCGTACA GCGACCGTCG CGACCAATAT GGTCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGCTTGATTA 

851 ACAAACGCTA TTTACAGCTT TATCCGCACC TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTCCACGACG ACGATAATGC 

951 ACACGCACAC ACCCACAGCG GCAGACCGTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGTGCCGAA TGGAAGCAAC CGTTCCCCGG TTTTGAAGCG 

1051 CTGCGCGTAC ACCTGAACCG CAACGACTAC CGCCACGACG AAAAAGCAGG 

1101 CGATGCAGTC GAAAACTTTT TTAACAACCA AACGCAAAAC GCCCGCATCG 

1151 AGTTGCGCCA CCAACCCATA GGTCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTACAAC AAAAATCCAG TGCTTTATCT GCCATATCCG AAGCGGTTAA 

1251 ACAACCGATG CTGCTTGACA ACAAAGTGCA ACATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAACTGGGAC AACTTCACGC TTGAAGGAGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT TCAGTACGAC AAAGCATTGA TTGATCGGGA 

1401 AAACTACTAC AACCACCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCATT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACAACACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAGCT 

1551 GTACGCACAC GGCAAACACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA CTCTACCGCA ACCGCTTCGG 

1701 TAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGCGCC 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGC GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CTTCCCTACC CGGCAGAGAA GATGCCTACG GCAACCGTCC TTTCATCGCA 

1951 CAGGACGACC AAAATGCCCC CCGTGTTCCG GCTGCGCGCC TCGGCTTCCA 

2001 CCTGAAAGCC TCGCTGACCG ACCGTATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG CACGCCCGGA 

2101 CACCATATGC TCAACCTCGG CGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTTCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TGAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 604; ORF 149-1>: 

ml49-l.pep 

1 MAQTTLKPIV LSILLINTPL LAQA HETEQS VDLETVSVVG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDT ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGELGLRL SSGNLEKLTS GGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 VAYSDRRDQY GLPAHSHEYD DCHADIIWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDDDNAHAH THSGRPWIDL RNKRYELRAE WKQPFPGFEA 

351 LRVHLNRNDY RHDEKAGDAV ENFFNNQTQN ARIELRHQPI GRLKGSWGVQ 

401 YLQQKSSALS AISEAVKQPM LLDNKVQHYS FFGVEQANWD NFTLEGGVRV 

451 EKQKASIQYD KALIDRENYY NHPLPDLGAH RQTARS FALS GNWYFTPQHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 
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551 EGDRWQYNLA LYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DAYGNRPFIA 

651 QDDQNAPRVP AARLGFHLKA SLTDRIDANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

751 TGGVNVKF* 

ml49-l/gl49-l 96.2% identity in 758 aa overlap 

10 20 30 40 50 60 

ml4 9-l.pep MAQTTLKPIVLSILLINTPLLAQAHETEQSVDLETVSVVGKSRPRATSGLLHTSTASDKI 
III I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl49-l MAQITLKPIVLSILLINTPLLAQAHETEQSVGLETVSVVGKSRPRATSGLLHTSTASDKI 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml49-l.pep ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 1 1 ! 1 1 I I I I I I I I I I I I I I I I I I I I I I I I 

gl49-l ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml4 9-l .pep SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGELGLRL 
I I I I I I I I I I I I I II I I I I II I I I 1 1 I I I I I I I I I I I I I 1 I I I 1 I I I I I I II I I I INI 

gl4 9-l SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGEAGLRL 
130 140 150 160 170 180 

190 200 210 220 230 240 

ral49-l.pep SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 
I I I I I I I I I I : I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I 
gl4 9-l SSGNLEKLTSAGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml49-l.pep SWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 
I I I I I I I I I I : I I I I I I I : I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
gl49-l SWVGEKGFIGAAYSDRRDRYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 4 9 - 1 . pep DYDN PGLSCGFHDDDNAH AHTHSGRPWI DLRNKRYELRAEWKQP FPGFEALRVHLNRNDY 
I I I I I I I I I I I I I I : II I I I I : I : I I I I I I I II I I I I I I I II I I I I I I I I I I I I II M I 
gl 4 9 - 1 DYDNPGLSCGFHDGDG AHAHTHNGKPWI DLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 4 9-1. pep RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISEAVKQPM 
: I I I I I I I I I I II I I I : I : I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I : I : I I I 
gl49-l HHDEKAGDAVENFFNNKTHNARIELRHQPIGRLKGSWGVQYLGQKSSALSAIPETVQQPM 

370 • 380 390 400 410 420 

430 440 450 460 470 480 

ml 4 9 - 1 . pep LLDNKVQH YS FFGVEQANWDNFTLEGG VRVEKQKAS IQYDKALI DRENY YNH PLPDLGAH 
I : I I : I : I I I I I I I II I I I I I I I I I I I I I I I I II I I I : I I I I I I I I I I I I I : I I I I I I II 
g 1 4 9 - 1 LI DNNVRHYS FFGVEQANWDNFTLEGGVRVEKQKAS I RYDKALI DRENY YNQPLPDLGAH 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml4 9-l.pep RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 
I I I I I I I I I II II I I I I : I I II I I I I I I II i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl49-l RQTARSFALSGNWYFTPHHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 49-1. pep SNN I ELALG YEGDRWQYNL ALYRNRFGN YI YAQTLNDGRGPKS I EDDSEMKLVRYNQSGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I 
g 1 4 9 - 1 SNN I ELALGY EGDRWQYNLAAYRNRFGN YI YAQTLNDGRGPKS I EDDSEMKLVRYNQSGA 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml4 9-l .pep DFYGAEGEIYFKPTPRYRI GVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQNAPRVP 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I ICIIIIII I I I I I I : I 
gl4 9-l DFYGAEGEIYFKPTPRYRI GVSGDYVRGRLKNLPSLPGREDPYGKRPFIAQADQNAPRIP 

610 620 630 640 650 660 



FCT/usyy/uy34o 
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670 680 690 700 710 720 

ml49-l .pep AARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 
I I I I I I I I I : I I ! I I I I I I I I I I I I I 1 I I I I I I I I 1 I I I I I I I I I I I I I I 1 I I I I I 11 I I 
gl4 9-l AARLGFHLKTSLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 

670 680 690 700 710 720 

730 740 750 759 

ml4 9-l.pep WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 

1 1 1 1 1 1 1 II i 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

gl49-l W YVKADNLLNQS V YAHS S FLS DT PQMGRS FTGGVNVKFX 

730 740 750 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 605>: 

al4 9-l . seq 

1 ATGGCACAAA CTACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCTCCCAAG CGCATGGAAC TGAGCAATCA GTGGGCTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCCAC TTCGGGGCTG 

151 CTGCACACTT CTACCGCCTC CGACAAAATC ATCAGCGGCG ACACCTTGCG 

201 ACAAAAAGCC GTCAACTTGG GTGATGCTTT AGACGGCGTA CCGGGCATTC 

251 ATGCCTCGCA ATACGGCGGC GGCGCATCCG CTCCCGTTAT TCGCGGTCAA 

301 ACAGGCAGAC GGATTAAAGT GTTGAACCAT CACGGCGAAA CGGGCGACAT 

351 GGCGGACTTC TCTCCAGACC ATGCAATCAT GGTGGACAGC GCCTTGTCGC 

4 01 AACAGGTCGA AATCCTGCGC GGTCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGCAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGAACTCGG ATTGCGTTTG AGCAGCGGCA 

551 ATCTGGAAAA ACTCACGTCC GGCGGCATCA ATATCGGTTT GGGCAAAAAC 

601 TTTGTATTGC ACACGGAAGG GCTGTACCGC AAATCGGGGG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAC GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTTG GCGAAAAAGG CTTTATCGGC 

751 GCAGCATACA GCGACCGTCG CGACCAATAT GGTCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGTTTGATTA 

851 ACAAACGCTA TTTGCAGCTT TATCCGCACC TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTTCACGACG ACGATGATGC 

951 ACACGCCCAT GCCCACAACG GCAAACCTTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGCGCCGAA TGGAAGCAAC CGTTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ACCTGAACCG CAACGACTAC CGCCACGACG AAAAAGCAGG 

1101 CGATGCAGTA GAAAACTTTT TTAACAACCA AACGCAAAAC GCCCGTATCG 

1151 AGTTGCGCCA CCAACCCATA GGCCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTGGGAC AAAAATCCAG TGCTTTATCT GCCACATCCG AAGCGGTCAA 

1251 ACAACCGATG CTGCTTGACA ATAAAGTGCA ACATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAACTGGGAC AACTTCACGC TTGAAGGCGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT CCGCTACGAC AAAGCATTGA TTGATCGGGA 

14 01 AAACTACTAC AACCATCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

14 51 CCCGCTCATT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACAACACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAGCT 

1551 GTACGCACAC GGCAAACACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA CTCTACCGCA ACCGCTTCGG 

1701 CAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGTGCG 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGC GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CTTCCCTACC CGGCAGGGAA GACGCCTACG GCAACCGCCC ACTCATTGCC 

1951 CAAGCCGACC AAAACGCCCC TCGCGTTCCG GCTGCGCGCC TCGGCGTCCA 

2001 CCTGAAAGCC TCGCTGACCG ACCGCATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG CACGCCCGGA 

2101 CACCATATGC TCAACCTCGG CGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTCCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TGAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 606; ORF 149-1. a>: 

al49-l.pep 

1 MAQTTLKPIV LSILLINTPL LSQA HGTEQS VGLETVSVVG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDS ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGELGLRL SSGNLEKLTS GGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 AAYSDRRDQY GLPAHSHEYD DCHADIIWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDDDDAHAH AHNGKPWIDL RNKRYELRAE WKQPFPGFEA 
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351 LRVHLNRNDY RHDEKAGDAV ENFFNNQTQN ARIELRHQPI GRLKGSWGVQ 

401 YLGQKSSALS ATSEAVKQPM LLDNKVQHYS FFGVEQANWD NFTLEGGVRV 

451 EKQKASIRYD KALIDRENYY NHPLPDLGAH RQTARSFALS GNWYFTPQHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 

551 EGDRWQYNLA LYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DAYGNRPLIA 

651 QADQNAPRVP AARLGVHLKA SLTDRIDANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

751 TGGVNVKF* 

al49-l/ml49-l 98.0% identity in 758 aa overlap 

10 20 30 40 50 60 

al4 9-l .pep MAQTTLKPIVLSILLINTPLLSQAHGTEQSVGLETVSVVGKSRPRATSGLLHTSTASDKI 
I I I I I I I I I I I I I I I I I I I I i : I I I I ! I I I I I I I I I I I ! I I I 1 I I I I i ! I I I I I I I I I 
ml4 9-l MAQTTLKPIVLSILLINTPLLAQAHETEQSVDLETVSVVGKSRPRATSGLLHTSTASDKI 

10 20 . 30 40 50 60 

70 80 90 100 110 120 

al4 9-l .pep ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml4 9-l ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 

70 80 90 100 110 120 

130 140 150 160 170 180 

al49-l.pep S PDHAIMVDSALSQQVEI LRGPVTLLYSSGNVAGLVDVADGKI PEKMPENGVSGELGLRL 
I I I I I I I I I : I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I 
ml4 9-l SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKI PEKMPENGVSGELGLRL 

130 140 150 160 170 180 

190 200 210 220 230 240 

al4 9-l.pep SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 
I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
ml4 9-l SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

190 200 210 220 230 240 

250 260 270 280 290 300 

al4 9-l.pep SWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 
I I I I I I I I I I : I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml4 9-l SWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 

250 260 270 280 290 300 

310 320 330 340 350 360 

a 14 9-1 .pep DYDNPGLSCGFHDDDDAHAHAHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 
I I I ! I I I I I I I I I I I : I I I I : I : I : I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I ! I I I 
ml 4 9 - 1 DYDNPGLSCGFH DDDNAH AHTHSGRPWI DLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 

310 320 330 340 350 360 

370 380 390 400 410 420 

a 1 4 9 - 1 . pep RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLGQKS SALS ATSEAVKQPM 
I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I i 
ml4 9-l RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISEAVKQPM 

370 380 390 400 410 420 

430 440 450 460 470 480 

al4 9-l .pep LLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNHPLPDLGAH 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I i I I I I I M I I I I I I I 
ml4 9-l LLDNKVQHYS FFGVEQANWDNFTLEGGVRVEKQKAS I QYDKAL I DRENYYNHPLPDLGAH 

430 440 450 460 470 480 

490 500 510 520 530 540 

al49-l.pep RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f i i 1 1 r 1 1 1 1 j 1 1 1 1 1 1 r 1 1 1 1 1 1 i 1 1 1 1 1 

ml4 9-l RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 
490 500 510 520 530 540 

550 560 570 580 590 600 

al49-l.pep SNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 

I M M M I I ! I I ! I I I I I I I I I I I I I i I I II I I I I ) I I I I I I I I I t I ! I I I I II 1 I I I i i 

ml4 9-l SNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 
550 560 570 580 590 600 



610 620 630 640 650 660 
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al49-l.pep DFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLIAQADQNAPRVP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I t I : I II I I I I I I I I 
ml4 9-l DFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQNAPRVP 
610 620 630 640 650 660 

670 680 690 700 710 720 

al4 9-l .pep AARLGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 
I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I II I I I I I I i I I I I I I I I I I I I I I 
ml49-l AARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 
•670 680 690 700 710 720 

730 740 750 759 

al4 9-l .pep WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 
I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I i I 
ml4 9-l WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 

730 740 750 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 607>: 

gl50.seq (partial) 

1 . . TACTGCAAGG CAGACCCCTT TCCCGCCGCG CTGCTGGCCA ATCAGAAAAT 

51 CACCGCCCGC CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA 

101 GCGGTTCGGA TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT 

151 GACAACGATC CGGCACTGGT CGGGGAAATC CTAGACCTGC TCGGCATCAA 

2 01 TCCGGCAACG GAAATACAGG CGGGCGGAAA AACCCTGCCG GTTGCCTCCG 
251 CACTGTTATC CCATTTCGAA CTCACGCAAA ACACCCCCGC CTTTGTCAAA 
301 GGCTATGCCA CGTTCGCCGA TAATGACGAA CTCGACCGTA TTGCTGCCGA 
351 CAACGCCGTT TTGCAAGGCT TTGTGCAAAG CACGCCGATT GCCGGTGTGC 
4 01 TGCACCGCTT CCCGGCAAAA CTGACGGCGG AACAATTCGC CGGCCTGCTG 
451 CGCCCGCTTG CGCCGCGCCT GTATTCGATT TCCTCGTCGC AGGCGGAAGC 
501 GGGGGACGAA GTGCACCTGA CCGTCGGCGC AGTGCGTTTC GAACACGAAG 
551 GGCGCGCCAG GGCGGGCGGC GCATCGGGTT TCTTTGCCGA CCGGCTGGAA 
601 GAGGACGGCA CGGTGCGCGT GTTTGCGGAA CGCAACGACG GCTTCAGGCT 
651 GCCCGAAGAC AGCCGCAAGC CGATTGTGAT GATCGGCTCC GGTACCGGCG 
701 TCGCACCGTT CCGCGCCTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA 
751 GGCAGAAACT GGCTGATTTT CGGCAATCCG CATTTTGCCG CCGACTTCCT 
801 CTATCAGACC GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTGCACAGAT 
851 ATGACTTCGC CTGGTCGCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC 
901 AAAATCCGCG AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC 
951 GCATATCTAT GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GAAGTGGAAG 

1001 CCGCCTTGCT GGATGTGATT ATCGGGGCAG GGCATTCGGA CGAAGACGGC 
1051 GCAGAAGGAT ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA 
1101 TGTTTATTGA 

This corresponds to the amino acid sequence <SEQ ID 608; ORF 150.ng>: 

gl50 .pep (partial) 

1 . . YCKADPFPAA LLANQKITAR QSDKDVRHIE IDLSGSDLHY LPGDALGVWF 

51 DNDPALVGEI LDLLGINPAT EIQAGGKTLP VASALLSHFE LTQNTPAFVK 

101 GYATFADNDE LDRIAADNAV LQGFVQSTPI AGVLHRFPAK LTAEQFAGLL 

151 RPLAPRLYSI SSSQAEAGDE VHLTVGAVRF EHEGRARAGG ASGFFADRLE 

201 EDGTVRVFAE RNDGFRLPED SRKPIVMIGS GTGVAPFRAF VQQRAAENAE 

251 GRNWLIFGNP HFAADFLYQT EWQQFAKDGF LHRYDFAWSR DQEEKIYVQD 

3 01 KIREQAEGLW QWLQEGAHIY VCGDAAKMAK EVEAALLDVI IGAGHSDEDG 
3 51 AEGYLDMLRE EKRYQRDVY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 609>: 

mlSO.seq 

1 ATGCAGAACA CAAATCCGCC ATTACCGCCT CTGCCGCCCG AAATCACGCA 

51 GCTCCTGTCG GGGCTGGACG CGGCACAATG GGCGTGGCTG TCCGGCTACG 

101 CTTGGGCAAA AGCAGGAAAC GGGGCATCTG CAGGACTGCC CGCGCTTCAG 

151 ACGGCATTGC CGGCGGCAGA ACCTTTTTCC GTAACCGTCC TTTCCGCCTC 

201 GCAAACCGGC AATGCGAAAT CCGTTGCCGA CAAAGCGGCG GACAGCCTGG 

2 51 AAGCCGCCGG CATCCAAGTC AGTCGCGCCG AACTGAAAGA CTATAAGGCG 

301 AAAAACATCG CCGGCGAACG CCGCCTGCTG CTGGTTACCT CCACCCAAGG 

351 CGAAGGCGAA CCGCCGAAAG AAGCCGTCGT GCTGCACAAA CTGCTGAACG 

401 GCAAAAAAGC CCCGAAATTG GACAAACTCC AATTTGCCGT ACTGGGTTTG 
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451 GGCGACAGTT CCTATCCGAA TTTCTGTCAG GCAGGTAAAG ATTTCGACCG 

5 01 GCGTTTTGAA GAATTGGGCG CAAAACGGCT GCTCGAACGC GTTGATGCGG 

551 ATTTGGACTT TACCGCCTCC GCAAACGCCT GGACAGATAA TATCGCCGCA 

601 CTCTTAAAAG AAGAAGCCGC AAAAAACCGG GCAACGCCCG CGCCGCAGAC 

651 AACGCCCCCC GCCGGCCTTC AGACGGCACC GGATGGCAGG TACTGCAAGG 

701 CAGCCCCCTT TCCCGCCGCC CTGCTGGCCA ATCAGAAAAT CACCGCCCGC 

751 CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA GCGGTTCGGA 

801 TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT GACAACGATC 

851 CGGCACTGGT CAGGGAAATC CTAGACCTGC TCGGCATCGA TCCGGCAACG 

9 01 GAAATACAGG CGGGCGGAAA GATGATGCCG GTTGCGCGCG CACTTTCATC 

951 TCATTTCGAA CTCACGCAAA ACACTCCGGC TTTCGTCAAA GGCTATGCCG 

1001 CGTTCGCCCA TTATGAAGAA CTCGATAAAA TCATTGCCGA TAACGCCGTT 

1051 TTGCAGGATT TCGTGCAAAA CACGCCTATT GTCGATGTGC TGCACCGCTT 

1101 CCCGGCAAGC CTGACGGCAG AACAATTCAT CCGTTTACTG CGTCCGCTTG 

1151 CACCCCGTTT GTATTCGATT TCTTCAGCAC AGGCGGAAGT GGGCGATGAA 

12 01 GTGCATTTAA CTGTCGGCGT GGTTCGTTTT GAACACGAAG GCCGCGCCAG 

12 51 AACGGGCGGC GCATCGGGTT TCCTTGCCGA CCGGCTGGAA GAGGACGGCA 

13 01 CGGTGCGCGT" GTTTGTGGAA CGCAACGACG GCTTCAGGCT GCCCGAAGAC 
1351 AGCCGCAAGC CGATTGTGAT GATCGGCTCG GGCACCGGCG TCGCACCGTT 

14 01 CCGCGCTTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA GGCAAAAACT 

14 51 GGCTGATTTT CGGCAATCCG CATTTTGCCC GTGATTTTCT CTATCAAACC 
1501 GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTGCACAGGT ACGATTTCGC 

15 51 CTGGTCCCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC AAAATCCGCG 

16 01 AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC GCATATCTAT 
1651 GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GACGTGGAAG CCGCCTTGCT 
1701 GGATGTGATT ATCGGGGCAG GACATTTGGA CGAAGAGGGC GCAGAAGAAT 
1751 ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 

This corresponds to the amino acid sequence <SEQ ID 610; ORF 150>: 

ml50 .pep 

1 MQNTNPPLPP LPPEITQLLS GLDAAQWAWL SG YAW AKAGN GASAGLPALQ 

51 TALPAAEPFS VTVLSASQTG NAKSVADKAA DSLEAAGIQV SRAELKDYKA 

101 KNIAGERRLL LVTSTQGEGE PPKEAWLHK LLNGKKAPKL DKLQFAVLGL 

151 GDSSYPNFCQ AGKDFDRRFE ELGAKRLLER VDADLDFTAS ANAWTDNIAA 

201 LLKEEAAKNR ATPAPQTTPP AGLQTAPDGR YCKAAPFPAA LLANQKITAR 

251 QSDKDVRHIE IDLSGSDLHY LPGDALGVWF DNDPALVRE I LDLLGIDPAT 

301 EIQAGGKMMP VARALSSHFE LTQNTPAFVK GYAAFAHYEE LDKIIADNAV 

351 LQDFVQNTPI VDVLHRFPAS LTAEQFIRLL RPLAPRLYSI SSAQAEVGDE 

4 01 VHLTVGWRF EHEGRARTGG ASGFLADRLE EDGTVRVFVE RNDGFRLPED 

451 SRKPIVMIGS GTGVAPFRAF VQQRAAENAE GKNWLIFGNP HFARDFLYQT 

501 EWQQFAKDGF LHRYDFAWSR DQEEKIYVQD KIREQAEGLW QWLQEGAHIY 

551 VCGDAAKMAK DVEAALLDVI IGAGHLDEEG AEEYLDMLRE EKRYQRDVY* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 
ORF 150 shows 91.3% identity over a 369 aa overlap with a predicted ORF (ORF 150.ng) 
from TV. gonorrhoeae: 

ml50/gl50 

210 220 230 240 250 260 

LLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKAAPFPAALLANQKITARQSDKDVRHIE 

MM I M M M M M M M M M M M M 
YCKADPFPAALLANQKITARQSDKDVRHIE 
10 20 30 

270 280 290 300 310 320 

IDLSGSDLHYLPGDALGVWFDNDPALVREILDLLGIDPATEIQAGGKMMPVARALSSHFE 
M M M I M M M M M M M M M M II I I I I I h I I I I I I I I I I MM M MM 
IDLSGSDLHYLPGDALGVWFDNDPALVGEILDLLGINPATEIQAGGKTLPVASALLSHFE 
40 50 60 70 80 90 

330 340 350 360 370 380 

LTQNT PAFVKG YAAFAH YEELDK 1 1 ADNAVLQD FVQNT P I VDVLHR FPAS LTAEQFIRLL 



ml50 .pep 
gl50 

n\150 .pep 
gl50 

ml50 . pep 
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I I I I I I I I I I I I I : I I : I I I : I I I I 1 I II I I I : I I I : I II I I I I : I I I I I I I I 
gl50 LTQNTPAFVKGYATFADNDELDRIAADNAVLQGFVQSTPIAGVLHRFPAKLTAEQFAGLL 

100 110 120 130 140 150 

390 400 410 420 430 440 

ml50 . pep RPLAPRLYSISSAQAEVGDEVHLTVGWRFEHEGRARTGGASGFLADRLEEDGTVRVFVE 
I I I I I I I I I I I I : I I i : I I I I I I I I I : I I I I I I I I I I: I I I I I I : I I I I I I I I I I I I I : I 
g 1 5 0 RPLAPRLYS I S S SQAEAGDE VHLT VGAVRFEHEGRARAGG ASG FFADRLEE DGTVRVFAE 

160 170 180 190 200 210 

450 460 470 480 490 500 

ml50 . pep RNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAEGKNWLIFGNPHFARDFLYQT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 
gl50 RNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAEGRNWLIFGNPHFAADFLYQT 

220 230 240 250 260 270 

510 520 530 540 550 560 

ml50 . pep EWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLWQWLQEGAHIYVCGDAAKMAK 
I I 11 I I I I I I I I I I I I I I I I I I I I I I M I I! I II I II II I II I I I I I I I M I I I I I I I I I 
gl50 EWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLWQWLQEGAHIYVCGDAAKMAK 

280 290 300 310 320 330 

570 580 590 600 

ml 50 . pep DVEAALLDVIIGAGHLDEEGAEEYLDMLREEKRYQRDVYX 
: I I I I I I I I I I I I I I 11:111 I I I I I I I II I 1 I I I I I I 
gl50 EVEAALLDVIIGAGHSDEDGAEGYLDMLREEKRYQRDVYX 

340 350 360 370 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 61 1>: 

al50 . seq 

1 ATGCAGAACA CAAATCCGCC ATTACCGCCT ATGCCGCCCG AAATCACGCA 

51 GCTCCTGTCG GGGCTGGACG CGGCACAATG GGCGTGGCTG TCCGGCTACG 

101 CTTGGGCAAA AGCAGGAAAC GGGGCATCTG CAGGACTGCC CGCGCTTCAG 

151 ACGGCATTGC CGACGGCAGA ACCTTTTTCC GTAACCGTCC TTTCCGCCTC 

201 GCAAACCGGC AATGCGAAAT CCGTTGCCGA CAAAGCGGCG GACAGCCTGG 

251 AAGCCGCCGG CATCCAAGTC AGTCGCGCCG AACTGAAAGA CTATAAGGCG 

301 AAAAACAT CG CCGGCGAACG CCGCCTGCTG CTGGTTACCT CCACCCAAGG 

351 CGAAGGCGAA CCGCCGGAAG AAGCCGTCGT GCTGCACAAA CTGCTGAACG 

401 GCAAAAAAGC CCCGAAATTG GACAAACTCC AATTTGCCGT ACTGGGTTTG 

4 51 GGCGACAGCT CCTATCCGAA TTTCTGCCGG GCGGGCAAAG ATTTCGACAA 

501 ACGTTTTGAA GAATTGGGCG CAAAACGCCT GCTCGAACGC GTTGATGCGG 

551 ATTTGGACTT TGCCGCCGCC GCAGACGGAT GGACAGATAA TATCGCCGCA 

601 CTCTTAAAAG AAGAAGCCGC AAAAAACCGG GCAACGCCCG CGCCGCAGAC 

651 AACGCCCCCC GCCGGCCTTC AGACGGCACC GGATGGCAGG TACTGCAAGG 

701 CAGACCCCTT TCCCGCCGCC CTGCTGGCCA ATCAGAAAAT CACCGCCCGC 

751 CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA GCGGTTCGGA 

801 TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT GACAACGATC 

851 CGGCACTG.GT CAGGGAAATC CTAGACCTGC TCGGCATCGA TCAGGCAACG 

901 GAAATACAGG CGGGCGGAAA AACCCTGCCG GTTGCCTCCG CACTGTTATC 

951 CCATTTTGAA CTCACGCAAA ACACCCCCGC CTTTGTCAAA GGCTATGCCC 

1001 CGTTCGCCGA TGATGACGAA CTCGACCGTA TTGCTGCCGA CAACGCCGTT 

1051 TTGCAAGGCT TTGTGCAAAG CACGCCGATT GCCGATGTGC TGCACCGCTT 

1101 CCCGGCAAAA CTGACAGCGG AACAATTCGC CGGCCTACTG CGCCCGCTTG 

1151 CGCCGCGCCT GTATTCGATT TCCTCGTCGC AGGCGGAAGT GGGGGACGAA 

1201 GTGCACCTGA CCGTCGGCGC GGTGCGTTTC GAACACGAAG GGCGCGCCAG 

1251 GGCGGGCGGC GCATCGGGTT TCCTTGCCGA CCGGCTGGAA GAGGACGGCA 

1301 CGGTGCGCGT GTTTGTGGAA CGCAACGACG GCTTCAGGCT GCCCGAAGAC 

1351 AGCCGCAAGC CGATTGTGAT GATCGGCTCG GGCACCGGCG TCGCACCGTT 

14 01 CCGCGCTTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA GGCAAAAACT 

14 51 GGCTGTTTTT CGGCAATCCG CATTTTGCCC GTGATTTTCT CTATCAAACC 

1501 GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTGCACAGAT ACGATTTCGC 

1551 CTGGTCGCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC AAAATCCGCG 

1601 AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC GCATATCTAT 

1651 GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GACGTGGAAG CCGCCTTGCT 
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17 01 GGATGTGATT ATCGGGGCAG GACATTTGGA CGAAGAGGGC GCAGAAGAAT 
1751 ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 



This corresponds to the amino acid sequence <SEQ ID 612; ORF 

al50. pep 

MQNTNPPLPP MPPEITQLLS GLDAAQWAWL SGYAWAKAGN 
TALPTAEPFS VTVLSASQTG 
KNIAGERRLL LVTSTQGEGE 
GDSSYPNFCR AGKDFDKRFE 
LLKEEAAKNR ATPAPQTTPP AGLQTAPDGR 
QSDKDVRHIE IDLSGSDLHY 
EIQAGGKTLP VASALLSHFE 

LQGFVQSTPI ADVLHRFPAK LTAEQFAGLL RPLAPRLYSI 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



NAKSVADKAA 
PPEEAWLHK 



DSLEAAGIQV 
LLNGKKAPKL 



ELGAKRLLER VDADLDFAAA 
YCKADPFPAA 
LPGDALGVWF DNDPALVREI 
LTQNTPAFVK GYAPFADDDE 



VHLTVGAVRF EHEGRARAGG ASGFLADRLE EDGTVRVFVE 
SRKPIVMIGS GTGVAPFRAF VQQRAAENAE GKNWLFFGNP 



EWQQFAKDGF LHRYDFAWSR 
VCGDAAKMAK DVEAALLDVI 



DQEEKIYVQD KIREQAEGLW 
IGAGHLDEEG AEEYLDMLRE 



150.a>: 

GASAGLPALQ 
SRAELKDYKA 
DKLQFAVLGL 
ADGWTDNIAA 
LLANQKITAR 
LDLLGIDQAT 
LDRIAADNAV 
SSSQAEVGDE 
RNDGFRLPED 
HFARDFLYQT 
QWLQEGAHIY 
EKRYQRDVY* 



m!50/al50 



94.8% identity in 599 aa overlap 
10 

ml50 .pep 



20 



30 



40 



50 



60 



MQNTNPPLPPLPPEITQLLSGLDAAQWAWLSGYAWAKAGNGASAGLPALQTALPAAEPFS 

I 1 I ■ I I 1 I 1 1 - ■ 1 I I I 1 I I I I I t t 1 1 I I J f I f MM li III I I I II M:|| I ! I 

a 150 MQNTNPPLPPMPPEITQLLSGLDAAQWAWLSGYAWAKAGNGASAGLPALQTALPTAEPFS 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml 50 . pep VTVLSASQTGNAKSVADKAADSLEAAGIQVSRAELKDYKAKNIAGERRLLLVTSTQGEGE 

iMiiiiiii milium iiimi i i 1 1 ! 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 

al50 VTVLSASQTGNAKSVADKAADSLEAAGIQVSRAELKDYKAKNIAGERRLLLVTSTQGEGE 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml50.pep PPKEAVVLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCQAGKDFDRRFEELGAKRLLER 

MsMIM Itlllt IMM! f I I I MMim I I ill: IIIIIIM 

alSO PPEEAVVLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCRAGKDFDKRFEELGAKRLLER 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 50 . pep VDADLDFTASANAWTDNIAALLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKAAPFPAA 

Ml Ml m : i : : I I I i [ I I I I I I! I I I I I t ! I | | | | | I I I I I I I I I I I Mill 

al50 VDADLDFAAAADGWTDNIAALLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKADPFPAA 

190 200 210 220 230 240 

250 260 270 280 290 300 

mlSO . pep LLANQKITARQSDKDVRHIEIDLSGSDLHYLPGDALGVWFDNDPALVREILDLLGIDPAT 

1 1 m mm i ii ii ii iiiiiiiiiiii 1 1 ii inn 1 1 1 1 ii in iii mi ii u 

al50 LLANQKITARQSDKDVRHI EI DLSGSDLHYLPGDALGVWFDNDPALVREI LDLLGIDQAT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml50 . pep EIQAGGKMMPVARALSSHFELTQNTPAFVKGYAAFAHYEELDKIIADNAVLQDFVQNTPI 
MMIM MM II lllllllllllllllll II MUM MMMI I I I M II 
al50 EIQAGGKTLPVASALLSHFELTQNTPAFVKGYAPFADDDELDRIAADNAVLQGFVQSTPI 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml50 . pep VDVLHRFPASLTAEQFIRLLRPLAPRLYSISSAQAEVGDEVHLTVGVVRFEHEGRARTGG 
MMMMIMMMI M t I M I M II I I I M I I I I i Ml M I I M M I I II I M M I 
al50 ADVLHRFPAKLTAEQFAGLLRPLAPRLYSISSSQAEVGDEVHLTVGAVRFEHEGRARAGG 
370 380 390 400 410 420 

430 440 450 460 470 480 

ml50 . pep ASGFLADRLEEDGTVRVFVERNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAE 
I I Ml I I I I I I I I II I I I I M M I Ml I II I I Ml I I I I III II I I I Ml Ml I II Ml I 
al50 ASGFLADRLEEDGTVRVFVERNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAE 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml50 . pep GKNWLIFGNPHFARDFLYQTEWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLW 
I M I I M I II II I I I I I I I I I | I Ml | | | I I I I! I I I I I I M I I I I I I I | | IN ! | | | 1 | 
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al50 GKNWLFFGNPHFARDFLYQTEWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLW 
490 500 510 520 530 540 

550 560 570 580 590 600 

ml50.pep QWLQEGAHIYVCGDAAKMAKDVEAALLDVIIGAGHLDEEGAEEYLDMLREEKRYQRDVYX 
I I 1 I t I I I I I I I I I I I 1 1 I I I I I I I II I II I I I I I I I I I I I I 1 I I ! I I I I I I I I I I I I I I 
al50 QWLQEGAHI YVCGDAAKMAKDVEAALLDVIIGAGHLDEEGAEEYLDMLREEKRYQRDVYX 

550 560 570 580 590 600 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 613>: 

gl51 . seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCACACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTG 

2 01 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 
251 TGGAGCGCGT TTTGGGGATG GTGGATTGCG TCGTCTTGTT GGTGGACGCA 
301 CAGGAAGGTC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

3 51 TTTGGGGCTG AAACCGATTG TCGTCATCAA CAAAATCGAC AAACCGTCCG 

4 01 CCCGTCCGAG CTGGGTTATC GACCAGACTT TCGAGTTGTT CGACAACTTG 

4 51 GGTGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTACG CTTCAGGTTT 
501 GAGCGGCTTT GCCAAGCTGG AAGAAAccga CGAGAGCAGC GATATGCGCC 

5 51 CGCtgttcgA CACCATCCTA AAATACAcgc ctgCACCGAG CGGCAGCGCG 
601 GACGAGCCGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 
651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGCATC AAACCCGGCC 
701 AAACCGTTGC CGTGATGAAC CACGAGCAGC AAATCGCCCA AGGCCGCATC 
751 AACCAGCTTT TGGGTTTCAA AGGCTTGGAA CGCGTGCCGC TTGAAGAAGC 
801 CGAAGCCGGC GACATTGTGA TTATTTCCGG TATCGAAGAC ATCGGCATCG 
851 GCGTAACCAT CACCGACAAA GACAACCCCA AAGGCCTGCC GATGTTGAGC 
901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTAAACA CCAGCCCGCT 
951 CGCAGGTACA GAAGGCAAAT TCGTGACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTTGCCC TGCGCGTGGA AGACACCGCC 

1051 GatgCCGACG TGTTCCGCGT ATCcgGGCGC GGCGAACTGC ACCTGACGAT 

1101 TTTGCTGGAA AATATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAGC 

1151 CGCGCGTCGT GTACCGAGAC ATCGACGGTC AAAAATGCGA ACCTTATGAA 

1201 AACCTGACTG TGGACGTACc cgacgacAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACCAA TATGGAAAGC GACGGCAACG 

13 01 GacgCACCCG CCTCGAATAC CATATTCCAG CGCGCGGCTT GATCGGTTTC 
1351 CAAGGCGAAT TCATGACCCT GACGCGCGGC GTCGGGCTGA TGAgccacGT 

14 01 GTTcgacgac tacgcgcccg tcaAACCCGA TATGCCCGGC CGCCACAACG 
14 51 GCGTactggt GtcccaAGAG CAGGGCGAGG CGGTTGCTTA CGCCTTGTGG 
1501 AATCTTGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 
1551 CGAAGGTATG ATTATCGGCA TCCACAGCCG CGACAACGAT TTGGTGGTCA 
1601 ACCCGCTCAA AGGCAAAAAA CTCAC CAATA TCCGTGCCAG CGGTACCGAC 
1651 GAAGCGGTGC GCCTGACCAC GCCGATCAAA CTGAcgcTGG AAGGCGCGGT 
1701 CGAGTTTATC GACGATGACG AGCTGGTGGA AATCACGCCG CAAtccatCC 
1751 gcctgcgcat gcgttacctG AGCGaattgg aacgccgccg tcaTTTTAAA 
1801 AagctgGATT AA 

This corresponds to the amino acid sequence <SEQ ID 614; ORF 151.ng>: 

gl51 .pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGCHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESS DMRPLFDTIL KYTPAPSGSA 

2 01 DEPLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQTVAVMN HEQQIAQGRI 
251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

3 01 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 
3 51 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 
401 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 
451 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 
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501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRMRYL SELERRRHFK 

601 KLD* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 61 5>: 

ml51.seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCCCACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTA 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TAGAGCGCGT TTTGGGGATG GTGGACTGCG TCGTCTTGTT GGTGGACGCG 

301 CAGGAAGGCC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

351 TTTGGGGCTG AAACCGATTG TCGTCATCAA CAAAATCGAC AAGCCGTCCG 

4 01 CTCGTCCGAG CTGGGTTATC GACCAAACTT TCGAGCTGTT CGACAACTTG 

4 51 GGCGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTACG CTTCAGGGTT 

501 GAGCGGTTTC GCCAAATTGG AAGAAACCGA CGAGAGCAAC GACATGCGTC 

551 CGCTGTTCGA TACTATCTTA AAATATACGC CTGCACCGAG CGGCAGCGCG 

601 GACGAAACGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 

651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGCATC AAACCCGGCC 

701 AAACCGTTGC CGTCATGAAC CACGATCAGC AAATCGCCCA AGGCCGCATC 

751 AACCAGCTTT TGGGTTTCAA AGGTTTGGAA CGCGTGCCGC TTGAAGAAGC . 

801 CGAAGCCGGC GACATCGTGA TTATTTCCGG TATCGAAGAC ATCGGTATCG 

851 GCGTAACCAT C AC CGACAAA GACAATCCCA AAGGCCTACC GATGTTGAGC 

901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTCAACA CCAGCCCGCT 

951 GGCGGGTACG GAAGGCAAAT TCGTAACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTCGCCC TGCGCGTGGA AGATACCGCC 

1051 GATGCCGACG TGTTCCGCGT ATCCGGGCGC GGCGAGCTGC ACCTGACCAT 

1101 TTTGCTGGAA AACATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAAC 

1151 CGCGCGTCGT GTACCGCGAC ATCGACGGTC AAAAATGCGA ACCGTATGAA 

1201 AACCTGACCG TGGATGTACC CGACGACAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACTAA TATGGAAAGC GACGGCAACG 

13 01 GACGCACCCG CCTCGAATAC CATATTCCAG CGCGCGGCTT GATCGGTTTC 
1351 CAAGGCGAAT TTATGACCCT GACGCGCGGG GTCGGGCTGA TGAGCCACGT 

14 01 GTTCGACGAT TACGCGCCCG TCAAACCCGA TATGCCCGGC CGCCACAACG 
14 51 GCGTGCTGGT GTCCCAAGAG CAGGGCGAGG CAGTCGCTTA CGCCTTGTGG 
1501 AATCTGGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 
1551 CGAAGGCATG ATTATCGGCA TCCACAGTCG CGACAACGAT TTGGTGGTCA 
1601 ACCCGCTCAA AGGCAAAAAA CTTACCAACA TCCGTGCCAG CGGTACCGAC 
1651 GAAGCCGTTC GCCTGACCAC GCCAATCAAG CTGACGCTGG AAGGTGCGGT 
1701 TGAGTTTATC GACGATGACG AACTCGTTGA AATCACGCCG CAATCCATCC 
1751 GTCTGCGCAA GCGTTACTTG AGCGAATTGG AACGCCGCCG CCACTTTAAA 
1801 AAGCTGGATT GA 

This corresponds to the amino acid sequence <SEQ ID 616; ORF 151>: 

ml 51 .pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGYHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESN DMRPLFDTIL KYTPAPSGSA 

201 DETLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQTVAVMN HDQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

301 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

351 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 

401 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 

451 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 

501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRKRYL SELERRRHFK 

601 KLD* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 151 shows 99.2% identity over a 603 aa overlap with a predicted ORF (ORF 151.ng) 
from N. gonorrhoeae: 

ml51/gl51 

10 20 30 40 50 60 

mlSl.pep MKQIRN I AI I AHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSN DLEKERG IT I LAKNT 

I I I I I I I I I I I I M I I i 1 I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I ! I M I | | I I I I 

gl51 MKQIRNIAIIAHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSNDLEKERGITILAKNT 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml51 . pep AIDYEGYHINIVDTPGHADFGGEVERVLGMVDCWLLVDAQEGPMPQTRFVTKKALALGL 
I I I I I I I I I I I I I II I 1.1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | 
gl51 AIDYEGCHINIVDTPGHADFGGEVERVLGMVDCWLLVDAQEGPMPQTRFVTKKALALGL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 51 . pep KPIWINKIDKPSARPSWVIDQTFELFDNLGATDEQLDFPIVYASGLSGFAKLEETDESN 
I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! : 
gl51 KPIWINKIDKPSARPSWVIDQTFELFDNLGATDEQLDFPIVYASGLSGFAKLEETDESS 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml51 . pep DMRPLFDTILKYTPAPSGSADETLQLQISQLDYDNYTGRLGIGRILNGRIKPGQTVAVMN 

I I I I II I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I [ I I I I I 

gl51 DMRPLFDTILKYTPAPSGSADEPLQLQISQLDYDNYTGRLGIGRILNGRIKPGQTVAVMN 

190 200 210 220 230 240 

250 260 270 280 290 300* 

ml 51 .pep HDQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 
I : I I I I I I I I 1 I I I I I I I I I I 1 I I 1 I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
gl51 HEQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml51 . pep VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I | | | | 
gl51 VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml51 . pep GELHLTILLENMRREGYELAVGKPRWYRDIDGQKCEPYENLTVDVPDDNQGAVMEELGR 
I I I I I 1 ! I I I I I I I I I I I I I I 1 I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl51 GELHLTILLENMRREGYELAVGKPRWYRDIDGQKCEPYENLTVDVPDDNQGAVMEELGR 

370 380 390 400 410 420 

430 440 450 460 470 480 

mlSl.pep RRGELTNMESDGNGRTRLEYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I II I I 
gl51 RRGELTNMESDGNGRTRLEYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml51 .pep RHNGVLVSQEQGEAVAYALWNLEDRGRMFVSPNDKIYEGMIIGIHSRDNDLWNPLKGKK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl51 RHNGVLVSQEQGEAVAYALWNLEDRGRMFVSPNDKIYEGMIIGIHSRDNDLVVNPLKGKK 

490 500 510 520 530 540 

550 560 570 580 590 600 

mlSl.pep LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRKRYLSELERRRHFK 
I I I M I I I I I I I II I I I I I I I I II 1 I I I I I I I I I I I | | I I I I I I I I I I 1 I I I I I I I I I I 
gl51 LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRMRYLSELERRRHFK 

550 560 570 580 590 600 
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mlSl.pep KLDX 
(Ml 

gl51 KLDX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 617>: 

al51.seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCCCACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTA 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TAGAGCGAGT TTTGGGGATG GTGGACTGCG TCGTCTTGTT GGTGGACGCG 

301 CAGGAAGGCC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

351 TTTGGGGCTG AAACCGATTG TCGTCATCAA TAAAATCGAC AAACCGTCCG 

4 01 CCCGTCCGAG CTGGGTCATC GACCAAACTT TCGAGCTGTT CGACAACTTG 

451 GGCGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTATG CTTCCGGTCT 

501 GTCCGGTTTC GCCAAATTGG AAGAAACCGA CGAGAGCAAC GACATGCGTC 

551 CGCTGTTCGA TACTATCTTA AAATATACGC CTGCACCGAG CGGCAGCGCG 

601 GACGAAACGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 

651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGTATC AAGCCCGGTC 

701 AAGTTGTTGC CGTCATGAAC CACGATCAAC AAATCGCCCA AGGCCGCATC 

751 AACCAGCTTT TGGGTTTCAA AGGTTTAGAA CGCGTGCCGC TTGAAGAAGC 

801 CGAAGCCGGC GACATCGTGA TTATTTCCGG TATTGAAGAC ATCGGCATCG 

851 GCGTAACCAT CACCGACAAA GACAACCCCA AAGGCCTGCC GATGTTGAGC 

901 GTGGACGAAC CGACGCTGAC GAT GG ACT TT ATGGTCAACA CCAGCCCGTT 

951 GG CAGGT AC G GAAGGCAAAT TCGTAACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTCGCCC TGCGCGTGGA AGATACCGCC 

1051 GATGCCGACG TGTTCCGCGT ATCCGGGCGC GGCGAGCTGC ACCTGACCAT 

1101 TTTGCTGGAA AACATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAAC 

1151 CGCGCGTCGT GTACCGCGAC ATCGACGGTC AAAAATGCGA ACCGTATGAA 

1201 AACCTGACCG TGGACGTACC CGACGACAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACTAA TATGGAAAGC GACGGCAACG 

1301 GACGCACCCG CCTCGAATAC CATATTCCAG CGCGCGGCTT GATCGGCTTC 

1351 CAAGGCGAAT TTATGACCCT GACGCGCGGG GTCGGGCTGA TGAGCCACGT 

14 01 GTTCGACGAT TACGCGCCCG TCAAACCCGA TATGCCTGGC CGCCACAACG 

14 51 GCGTGCTGGT GTCCCAAGAG CAGGGCGAGG CAGTCGCTTA CGCCTTGTGG 

1501 AATCTGGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 

1551 CGAAGGTATG ATTATCGGCA TCCACAGTCG CGACAACGAT TTGGTGGTCA 

1601 ACCCGCTCAA AGGCAAAAAA CTTACCAACA TCCGTGCCAG CGGTACCGAC 

1651 GAAGCCGTTC GCCTGACCAC GCCGATTAAG CTGACGCTGG AAGGTGCGGT 

1701 CGAGTTTATC GACGATGATG AGCTGGTAGA AATCACGCCG CAATCCATCC 

1751 GTCTGCGCAA GCGTTACTTG AGCGAATTGG AACGCCGCCG CCATTTCAAA 

1801 AAGCTAGATT GA 

This corresponds to the amino acid sequence <SEQ ID 618; ORF 151.a>: 

al51 .pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGYHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESN DMRPLFDTIL KYTPAPSGSA 

201 DETLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQVVAVMN HDQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

301 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

351 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRVVYRD IDGQKCEPYE 

401 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 

451 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 

501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRKRYL SELERRRHFK 

601 KLD* 

ml51/al51 99.8% identity in 603 aa overlap 

10 20 30 40 50 60 
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mlSl.pep MKQIRNIAIIAHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSNDLEKERGITILAKNT 
I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I 1 I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I 
al51 MKQIRNIAIIAHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSNDLEKERGITILAKNT 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml 5 1 . pep AIDYEGYHINI VDTPGHADFGGEVERVLGMVDCVVLLVDAQEGPMPQTRFVTKKALALGL 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I I I I I I I 1 1 I I I 

al51 AIDYEGYHINI VDTPGHADFGGEVERVLGMVDCVVLLVDAQEGPMPQTRFVTKKALALGL 

70 80 90 100 110 120 

130 140 150 160 170 180 

mlSl.pep KPIVVINKIDKPSARPSWVIDQTFELFDNLGATDEQLDFPIVYASGLSGFAKLEETDESN 

1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

al51 ' KPIVVINKIDKPSARPSWVIDQTFELFDNLGATDEQLDFPIVYASGLSGFAKLEETDESN 

130 140 150 160 170 180 

190 200 210 220 230 240 

mlSl.pep DMRPLFDTILKYTPAPSGSADETLQLQISQLDYDNYTGRLGIGRILNGRIKPGQTVAVMN 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 
al51 DMRPLFDTILKYTPAPSGSADETLQLQISQLDYDNYTGRLGIGRILNGRIKPGQVVAVMN 

190 200 210 220 230 240 

250 260 270 280 290 300 

mlSl.pep HDQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 
I I I I I I I I I I i I I I I I I I I I I! I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I 

al51 HDQQIAQGRINQLLGFKGLERVPLEEAEAGDIVI ISGIEDIGIGVTITDKDNPKGLPMLS 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 51 . pep VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
al51 VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 5 1 . pep GELHLTILLENMRREGYELAVGKPRVVYRDI DGQKCEPYENLTVDVPDDNQGAVMEELGR 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
al51 GELHLTILLENMRREGYELAVGKPRVVYRDI DGQKCEPYENLTVDVPDDNQGAVMEELGR 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 5 1 . pep RRGELTNMESDGNGRTRLEYHI PARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 

I M I I I I I I I I I I I I I I I I I I I I I I 1 1 I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al51 RRGELTNMESDGNGRTRLEYHI PARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml51 . pep RHNGVLVSQEQGEAVAYALWNLEDRGRMFVSPNDKIYEGMIIGIHSRDNDLVVNPLKGKK 
I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I 
al51 RHNGVLVSQEQGEAVAYALWNLEDRGRMFVSPNDKIYEGMIIGIHSRDNDLVVNPLKGKK 

490 500 510 520 530 540 

550 560 570 580 590 600 

mlSl.pep LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRKRYLSELERRRHFK 

I I I I I I I I I I I I I I M I I I 1 1 I I I I I I I I I I 1 1 I I I I I I 1 1 I I I I I I I I I I I I I I I I I 1 I 

al51 LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRKRYLSELERRRHFK 
550 560 570 580 590 600 



mlSl.pep KLDX 
I I I I 

al51 KLDX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 619>: 

gl52 . seq 

1 ATGAAAAaca aAACCaaagt ctgGGacttc cCcacccgcc ttTTCCactG 

51 GctgcttgCC gCATCCctgc CCTTTATGTG gtatagCGCA AAAGCCGGCG 

101 GcgataTGCT GcaatgGCAC ACGCGCGTCG GGCTGCTCGT CCTTTTCCTG 
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151 CTCGTATTCC GCCTCTGCTG GGGCATTTGG GGCAgcgATA CCGCCCGTTT 

2 01 CTCccgTtTC GTCCGAGGTT GGGCAGGTAT ACGCGGCTAT CTGAAAAAcg 
251 gCATTCCCGA ACAtatcCAG CCCGGACACA ACCCCTTGGG CGCACTgatg 

3 01 gtcGTTGCGC TTTTGgccgc cgtcTCATTT CAagtcggcA CGGGGCTTTT 
351 Tgccgccaat gaaaacacct tcagcaCCAa cggctacctc aaccatttgg 
401 tttccgaaca tacgGGCAGC CTTATACGGA AAATCCACCT CAACTTTTTC 

4 51 AAGCTGCTCG CCGTTTTTTC CGCAGTCCAC ATCGCCGCCG TCGCCGCATA 
501 CCGCATATTC AAAAAGAAAA ACCTCGTCCG CCCGATGATA ACCGGCTTCA 
551 AATACATCGA AGGCAAAACC TCAATCCGCT TTGCCGGCAA AGCCGCGCTT 
6 01 GCCGCCGCAT TATCGGTTGC CGCGCTTGCC GCAGCCGCCA TCCTGCTCCT 
651 GTCCTGA 



This corresponds to the amino acid sequence <SEQ ID 620; ORF 152.ng>: 



1 MKNKTKVWDF PTRLFHWLLA ASLPFMWYSA KAGGDMLQWH TR VGLLVLFL 

51 LVFRLCWGI W GSDTARFSRF VRGWAGIRGY LKNGIPEHIQ PGHNPLGALM 

101 WALLAAVSF QV GTGLFAAN ENTFSTNGYL NHLVSEHTGS LIRKIHLNFF 

151 KLLAVFSAVH IAAVAA YRIF KKKNLVRPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVAALA AAAILLL S* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 62 1>: 



1 ATGAAAAACA AAACCAAAGT CTGGGACCTC CCCACCCGCC TTTTCCACTG 

51 GCTGCTTGCC GCGTCCCTGC CCTTTATGTG GTATAGCGCG AAAGCCGGCG 

101 GCGATATGCT GCAATGGCAC ACGCGCGTCG GGCTGTTCGT CCTTTTCCTG 

151 CTCGTATTTC GCCTCTGCTG GGGCATTTGG GGCAGCGATA CCGCCCGTTT 

2 01 TTCCCGTTTC GTCCAAGGCT GGGCAGGCAT ACGCGGCTAT CTGAAAAACG 
251 GTATTCCCGA ACACATCCAG CCCGGACACA ACCCCTTGGG CGCACTGATG 
301 GTCGTTGCGC TTTTGGCCGC CGTGTCCTTC CAAGTCGGCA CCGGGCTTTT 

3 51 TGCCGCCGAT GAAAACACCT TCAGCACCAA CGGCTACCTC AACCATTTGG 

4 01 TTTCCGAACA TACGGGCAGC CTTATGCGGA AAATCCACCT CAACTTTTTC 

4 51 AAGCTGCTCG CCGTTTTTTC TGCAATCCAC ATCGCCGCCG TCGCCGCATA 

5 01 CCGCGTATTC AAAAAGAAAA ACCTCATCCT CCCGATGATA ACCGGCTTCA 
551 AATACATCGA AGGCAAAACC TCAATCCGCT TTGCAGGCAA AGCCGCGCTT 
601 GCCGCCGCAT TATCGGTTGC CTCGCTTGCC GCAGCCGCCA TCCTGCTCCT 
651 GTCCTGA 

This corresponds to the amino acid sequence <SEQ ID 622; ORF 152>: 

ml52.pep 

1 MKNKTKVWDL PTRLFHWLLA ASLPFMWYSA KAGGDMLQWH TR VGLFVLFL 

51 LVFRLCWGI W GSDTARFSRF VQGWAGIRGY LKNGIPEHIQ PGHNP LGALM 

101 WALLAAVSF QV GTGLFAAD ENTFSTNGYL NHLVSEHTGS LMRKI HLNFF 

151 KLLAVFSAIH IAAVAA YRVF KKKNLILPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVASLA AAAILLL S* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from Kgonorrhoeae 

ORF 152 shows 95.4% identity over a 218 aa overlap with a predicted ORF (ORF 152.ng) 
from N. gonorrhoeae: 

ml52/gl52 



gl52 .pep 



ml52 . seq 



ml52 .pep 



10 20 30 40 50 60 

MKNKTKVWDLPTRLFHWLLAASLPFMWYSAKAGGDMLQWHTRVGLFVLFLLVFRLCWGIW 



gl52 




10 20 30 40 50 60 



m!52 .pep 



70 80 90 100 110 120 

GSDTARFSRFVQGWAGIRGYLKNGIPEHIQPGHNPLGALMWALLAAVSFQVGTGLFAAD 



gl52 




70 80 90 100 110 120 
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130 140 150 160 170 180 

ml52 . pep ENTFSTNGYLNHLVSEHTGSLMRKIHLNFFKLLAVFSAIHI AAVAAYRVFKKKNLILPMI 

I I I I I I I I I I I I I I I I I I I I I - I I I I I I I I I I I I I I I I : I I I I I I I I I : I I I I I I ' III 
g i52 ENTFSTNGYLNHLVSEHTGSLIRKIHLNFFKLLAVFSAVHIAAVAAYRIFKKKNLVRPMI 

130 140 150 160 170 180 

190 200 210 219 

ml5 2 .pep TGFKY I EGKTS I RFAGKAALAAALS VASLAAAAI LLLSX 

I I I I I I I I II I I I I I I I I I I I I I I I I I * I I I I I I I I I I I 
gl 5 2 TGFKYI EGKTS I RFAGKAALAAALSVAALAAAAI LLLSX 

190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 623>: 

al52.seq 

1 ATGAAAAACA AAACCAAAGT CTGGGACTTC CCCACCCGCC TTTTCCACTG 

51 GCTGCTTGCC GCATCCCTAC CCTTTATGTG GTATAGCGCG AAAACCGGCG 

101 GCGATATGCT GCAATGGCAC ACGCGCGTCG GGCTGTTTAT CCTTTTCCTG 

151 CTCGTATTCC GCCTCTGCTG GGGCATTTGG GGCAGCGATA CCGCCCGTTT 

201 CTCCCGTTTC GTCCGCGGAT GGTCGGGTAT CAGAGAGTAT ATGAAAAACG 

251 GTATTCCCGA ACACGTCCAA CCCGGACACA ACCCCTTGGG CGCACTGATG 

301 GTCGTTGCGC TTTTGGCCGC CGTGTCGTTC CAAGTCGGCA CAGGGCTTTT 

351 TGCCGCCGAT GTAAACACCT TCAGCACCAA CGGCTACCTC AACCATTTGG 

401 TTTCCGAACA TACGGGCAGC CTTATGCGGA AAATCCATCT CAACTTTTTC 

4 51 AAACTGCTCG CCGTTTTTTC CGCAGTCCAC ATCGCCGNCG TCGCCGCATA 

501 CCGCGTGTTC AAAAAGAAAA ACCTCGTCCT CCCGATGATA ACCGGCTTCA 

551 AATACATCGA AGGCAAAACC TCAATCCGCT TTGCCGGCAA AGCCGCGCTT 

601 GCCGCCGCAT TATCGGTTGC CGCGCTTGCC GCAGCCGCCA TCCTGCTCCT 

651 GTCCTGA 

This corresponds to the amino acid sequence <SEQ ID 624; ORF 152.a>: 

al52.pep 

1 MKNKTKVWDF PTRLFHWLLA ASLPFMWYSA KTGGDMLQWH TR VGLFILFL 

51 LVFRLCWGI W GSDTARFSRF VRGWSGIREY MKNGIPEHVQ PGHNPLGALM 

101 VVALLAAVSF QV GTGLFAAD VNTFSTNGYL NHLVSEHTGS LMRKIHLNFF 

151 KLLAVFSAVH IAXVA AYRVF KKKNLVLPMI TGFKYIEGKT SIRFAGKAAL 

201 AAAL S VAAL A AAAILLLS* 



ml 52/al 52 94.0% identity in 21 8 aa overlap 

10 20 30 40 50 60 

ml52 . pep MKNKTKVWDLPTRLFHWLLAASLPFMWYSAKAGGDMLQWHTRVGLFVLFLLVFRLCWGIW 
I I I I I I I I I : I i I I I 1 I I I I I I I II I I I I I I : I I I I I I I I I ! I I I I : II M I 1 I I I I I I I 
al52 MKNKTKVWDFPTRLFHWLLAASLPFMWYSAKTGGDMLQWHTRVGLFILFLLVFRLCWGIW 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml52 . pep GSDTARFSRFVQGWAGIRGYLKNGIPEHIQPGHNPLGALMWALLAAVSFQVGTGLFAAD 

I I I i I ! I I I 1 I : I I : I I I I : I I I I I I I : I I > I i I I I I I I I I I I I I I I I t I I i I I I I I I I 
al52 GSDTARFSRFVRGWSGIREYMKNGIPEHVQPGHNPLGALMVVALLAAVSFQVGTGLFAAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml52 . pep ENTFSTNGYLNHLVSEHTGSLMRKIHLNFFKLLAVFSAIHIAAVAAYRVFKKKNLILPMI 
I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I : I I I I I I I I I I I M I I : II I I 
a 152 VNTFSTNGYLNHLVSEHTGSLMRKIHLNFFKLLAVFSAVHIAXVAAYRVFKKKNLVLPMI 

130 140 150 160 170 180 



190 200 210 219 

ml52 . pep TGFKY I EGKTS I RFAGKAALAAALSVASLAAAAI LLLSX 

I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I 
al52 TGFKYIEGKTS IRFAGKAALAAALSVAALAAAAI LLLSX 

190 200 210 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 625>: 

gl53.seq 



1 


atggggtttg 


LLL awiy La L 


na r•^^t*at•3t"r , 






51 


/">c/~t ft f~ f 

yycatccyLL. 


n t- 1~ t* Pa n f" P. P 


v-uu/iort l y a. l 


a car 1 r»t" era t* Pi 


GTGTTtCagg 


i ni 

JLU X 






GTGATGTTTG 


TP. n t" a a PTT T 


cGGCGcgcCG 


151 


1 Itlbl ILL 


TP.PTP.PTP.TP. 


CPTGTATGTP 


TATP.PCGCGC 


TGATACGGAA 


OAT 


apap^pp.tat 


PPTPPGCTGC 


GTTTGGCAAC 


GCGTGTGATG 


GTGCGCTTGA 


OCT 


pp.p zifinpn at 


p. ATGG TGG AT 


GTGTTTTTTG 


TTTCCACTCT 


GGTGGCGTAT 


7 fi 1 


ATPAAP.PTCT 


CGTCTGTGGC 


AAAGGTTCGC 


TTCGGGCCGG 


CGTTTTATCT 


351 


PATP.TTPP.PPr 


CTGTCGGTTA 


TGCTGATTCG 


GACTTCGGTA 


TCGGTTCCCC 


401 


AGCATTGGGT 


GTATTTCCAA 


ATCGGGCGGC 


TGACGGGGAA 


TAATGCGGTT 


451 


CAGACGGCAT 


CGGAAGGCAA 


AACCTGTTGC 


AGCCGCTGCC 


TGTATTTccg 


501 


cgacAGTgcc 


gaatccCCCT 


GCGGGGTGTg 


cgGCGcggaA 


CTgtacggcg 


551 


gacggccgaa 


aagtCTGAGt 


atttCgtCGG 


CGTTTCTgac 


ggcggcggTT 


601 


GTTTTGTATT 


TCCctgCcaa 


TATCctgccg 


attaTGAttt 


cgtccAATCc 


651 


tgccgccacg 


GAGGcCAACA 


CCATCTTTAG 


CGGCATCGCT 


TATATGTGGG 


701 


ACGagggcgA 


CAGGCTGATT 


GCGGCGGTTA 


TTTTCAGCGC 


GAGTATTTTG 


751 


GTGCCGGTGC 


TGAAGATTGC 


GGCAATGTCG 


GTTTTGATTG 


CGGCGGCACG 


801 


GTTCGCTTTG 


CCGGCGGGCG 


CAAAGAAATT 


GTCGCACCTC 


tacCGCATCA 


851 


CCGAAGCGGT 


CGGCCGCTGG 


TCGATGATTG 


ATATTTTTGT 


GATTATTATT 


901 


TTGATGTGTT 


CGTTCCacaC 


TTATGCCGCG 


CGCGTCATTC 


CGGGCAGTGC 


951 


GGCAGTCTAT 


TTCTGCCTGG 


TCGTGATTTT 


GACGATGCTG 


TCCGCCTATT 


1001 


ATTTCGACCC 


GCGCCTGCTT 


TGGGACAAAC 


GCGCTTCAGA 


CGGCATTGCT 


1051 


TTCAACGAAA 


CGGAAAAATA 


TGACTGA 






This corresponds to the amino acid sequence <SEQ ID 626; ORF 153.ng>: 


gl53 .pep 












1 


MGFAYSMTYI 


EVGIPEAASV 


LSLPEMMRLM 


VFQDYGFLAE 


VMFVLTFGAP 


51 


VLFLLLCLYV 


YAALIRKQAY 


PALRLATRVM 


VRL.RQAMMVD 


VFFVSTLVAY 


101 


IKLSSVAKVR 


FGPAFYLMFA 


LSVMLIRTSV 


SVPQHWVYFQ 


IGRLTGNNAV 


151 


QTASEGKTCC 


SRCLYFRDSA 


ESPCGVCGAE 


LYGGRPKSLS 


ISSAFLTAAV 


201 


VLYFPANILP 


IMISSNPAAT 


EANTIFSGIA 


YMWDEGDRLI 


AAVIFSASIL 


251 


VPVLKIAAMS 


VLIAAARFAL 


PAGAKKLSHL 


YRITEAVGRW 


SMIDIFVIII 


301 


LMCSFHTYAA 


RVIPGSAAVY 


FCLWILTML 


SAYYFDPRLL 


WDKRASDGIA 


351 


FNETEKYD* 











The following partial DNA sequence was identified in N. meningitidis <SEQ ID 627>: 

ml53 .seq 

1 ATGGCGTTTG CTTACGGTAT GACGTATATC GAGGTCGGGA TACCGGGTGC 

51 GGCATCCGTC CTTTCGCTGC CCGAGATGAT GCGCCTGATG GTGTTTCAGG 

101 ATTATGGTTT TTTGGCCGAA GTGATGTTTG TGCTGACTTT CGGCGCGCCG 

151 GTTCTGTTTC TGCTGCTGTG CCTGTATGTC TATGCCGCGC TGATACGGAA 

201 ACAGGCGTAT CCTGCGCTGC GTTTGGCAAC GCGTGTGATG GTGCGCTTGA 

251 GACAGGCGAT GATGGTGGAT GTGTTTTTTG TTTCCACTTT GGTGGCGTAT 

301 ATCAAGCTCT CGTCTGTGGC AGAGGTTCGC TTCGGGCCGG CGTTTTATCT 

351 GATGTTCGCG CTGTCAGTTA TGCTGATTCG GACTTCGGTA TCGGTTCCCC 

401 AGCATTGGGT GTATTTTCAA ATCGGGCGGC TGACGGGGGA TAATGCGGTT 

451 CAGACGGCAT CGGAAGGTAA AACCTGTTGC AGCCGCTGCC TGTATTTCCG 

501 CGACAGTGCC GAATCCCCCT GCGGCGTGTG CGGTGCGGAA CTGTACCGCC 

551 GACGGCCGAA AAGTCTGAGT ATTTCGTCGG CGTTTCTGAC GGCGGCGGTT 

601 ATTTTGTATT TCCCTGCCAA TATCCTGCCG ATTATGATTT CGTCCAATCC 

651 TGCCGCCACG GAGGTCAATA CCATCCTTAA CGGCATCGCT TATATGTGGG 

701 ACGAGGGCGA CAGGCTGATT GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 

751 GTGCCGGTAC TGAAGATTGC GGCAATGTCG GTTTTGATTG CGTCCGCCCG 

801 CTTCGCTTTG CCAACGGGTG CAAAGAAATT GTCGCACCTC TACCGCATCA 

851 CCGAAGCGGT CGGCCGCTGG TCGATGATTG ATATTTTTGT GATTATTATT 

901 TTGATGTGTT CGTTCCACAC TTATGCCGCG CGCGTCATTC CGGGCAGTGC 

951 GGCAGTCTAT TTCTGCCTGG TCGTGATTCT GACGATGCTG TCCGCCTATT 

1001 ATTTCGACCC GCGCCTGCTT TGGGACAAAC GCGCTTCAGA CGGCATTGCT 

1051 TTCAATGAAA CGGAAAAACA TGACTGA 

This corresponds to the amino acid sequence <SEQ ID 628; ORF 153>: 

ml53 -pep 

1 MAFAYGMTYI EVGIPGAASV LSLPEMMRLM VFQDYGFLAE VMFVLTFGAP 

51 VLFLLLCLYV YAALIRKQAY PALRLATRVM VRLRQ AMMVD VFFVSTLVAY 
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101 IKLSSVAEVR FGP AFYLMFA LSVMLIRTSV SVPQHWVYFQ IGRLTGDNAV 

151 QTASEGKTCC SRCLYFRDSA ESPCGVCGAE LYRRRPKSLS ISSA FLTAAV 

201 ILYFPANILP I MISSNPAAT EVNTILNGIA YMWDEGDRLI AAVIFSASIL 

251 VPVLKIAAMS VLIAS ARFAL PTGAKKLSHL YRITE AVGRW SMIDIFVIII 

301 LMCSFHTYAA R VIPGSAAVY FCLWILT ML SAYYFDPRLL WDKRASDGIA 

3 51 FNETEKHD* 

mi53 / gl53 96.1% identity in 358 aa overlap 

10 20 30 40 50 60 

MAFAYGMTYIEVGIPGAASVLSLPEMMRLMVFQDYGFLAEVMFVLTFGAPVLFLLLCLYV 

hllhlMIMMI MIIMIIIIIMIMIIIIIIllllllllllllllllllllll 

MG F A YS MT Y I E VG I P E AAS VLS L P EMMRLMVFQD YG FLA E VM F V LTFG A P VL F LLLC L YV 
10 20 30 40 50 60 

70 80 90 100 110 120 

YAAL I RKQAYPALRLATR VMVRLRQAMMVDVFFVSTLVAYI KLSSVAEVRFG PAFYLMFA 

III II I II I Mill 1 1 1 i I Mil Mill II Ml II III 1 1 1 1 M 1 1 hi I II llllll II 

YAAL I RKQAY PALRLATRVMVRLRQAMMVDVFF VS TLVAYI KLS S VAK VR FG PAFYLMFA 
70 80 90 100 110 120 



ml53 .pep 
gl53 

ml53 .pep 
gl53 



130 140 150 160 170 180 

ml53 .pep LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 

1 1 1 II I M 1 1 II II I II I II 1 1 1 1 1 h M I i 1 1 1 1 1 1 II I M 1 1 1 1 1 1 1 II II II 1 1 1 II 

gl53 LSVMLIRTSVSVPQHWVYFQIGRLTGNNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 
130 140 150 160 170 180 



190 200 210 220 230 240 

ml53 . pep LYRRRPKSLS I SSAFLTAAVILYFPANILPIMISSNPAATEVNTILNGIAYMWDEGDRLI 

II I I I I I I II I I I I I I I h I I I I I I I I I I I I I I I I I II I : II h : I I I I I I I I I II I I 
gl53 LYGGRPKSLSISSAFLTAAWLYFPANILPIMISSNPAATEANTIFSGIAYMWDEGDRLI 
190 200 210 220 230 240 



250 260 270 280 290 300 

ml 5 3 . pep AAVI FSAS I LVPVLKI AAMSVLI ASARFALPTGAKKLSHLYRITEAVGRWSMIDI FVI 1 1 

1 1 II I II 1 1 1 II II 1 1 1 1 1 1 1 1 M M M II I M I M II 1 1 M II M 1 1 II II II I II I II 

gl53 AAVI FSAS I LVPVLKI AAMSVLIAAARFALPAGAKKLSHLYRITEAVGRWSMIDI FVI 1 1 

250 260 270 280 290 300 



310 320 330 340 350 359 

ml53 . pep LMCSFHTYAARVIPGSAAVYFCLWILTMLSAYYFDPRLLWDKRASDGIAFNETEKHDX 

II 1 1 I 1 1 II 1 1 M I II II I I 1 1 II 1 1 1 1 1 II M II I II M M II 1 1 1 1 II 1 1 M I h 1 1 

gl 5 3 LMCS FHTYAARVI PGS AAVYFCL WI LTMLS AYYFDPRLLWDKRAS DG I AFNETEKYDX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 629>: 

al53 . seq 

1 ATGGCGTTTG CTTACGGTAT GACGTATATC GAGGTCGGGA TACCGGGTGC 

51 GGCATCCGTC CTTTCGCTGC CCGAGATGAT GCGCCTGATG GTGTTTCAGG 

101 ATTATGGTTT TTTGGCCGAA GTGATGTTTG TGCTGACCTT CGGCGCGCCG 

151 GTTCTGTTTC TGCTGCTGTG CCTGTATGTC TATGCCGCGC TGATACGGAA 

201 ACAGGCGTAT CCTGCGCTGC GTTTGGCAAC GCGTGTGATG GTGCGCTTGA 

251 GACAGGCGAT GATGGTGGAT GTGTTTTTTG TTTCCACTTT GGTGGCGTAT 

301 ATCAAGCTCT CGTCTGTGGC AGAGGTTCGC TTCGGATCGG CGTTTTATCT 

351 GATGTTCGCG CTGTCGGTTA TGCTGATTCG GACTTCGGTA TCGGTTCCCC 

4 01 AGCATTGGGT GTATTTTCAA ATCGGGCGGC TGACGGGGGA TAATGCGGTT 

4 51 CAGACGGCAT CGGAAGGTAA AACCTGTTGC AGCCGCTGCC TGTATTTCCG 

501 CGACAGTGCC GAATCCCCCT GCGGCGTGTG CGGTGCGGAA CTGTACCGCC 

551 GACGGCCGAA AAGTCTGAGT ATTTCGTCGG CGTTTCTGAC GGCGGCGGTT 

601 ATTTTGTATT TCCCTGCCAA TATCCTGCCG ATTATGATTT CGTCCAATCC 

651 TGCCGCCACG GAGGTCAATA CCATCCTTAA CGGCATCGCT TATATGTGGG 

7 01 ACGAGGGCGA CAGGCTGATT GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 

751 GTGCCGGTAC TGAAGATTGC GGCAATGTCG GTTTTGATTG CGTCCGCCCG 

801 CTTCGCTTTG CCAACGGGTG CAAAGAAATT GTCGCACCTC TACCGCATCA 
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8 51 CCGAAGCGGT CGGCCGCTGG TCGATGATTG ATATTTTTGT GATTATTATT 

901 TTGATGTGTT CGTTCCACAC TTATGCCGCG CGCGTCATTC CGGGCAGTGC 

951 GGCAGTCTAT TTCTGCCTGG TCGTGATTCT GACGATGCTG TCCGCCTATT 

1001 ATTTCGACCC GCGCCTGCTT TGGGACAAAC GCGCTTCAGA CGGCATTGCT 

1051 TTCAATGAAA CGGAAAAACA TGACTGA 



This corresponds to the amino acid sequence <SEQ ID 630; ORF 153.a>: 

al53 . pep 

1 MAFAYGMTYI EVGIPGAASV LSLPEMMRLM VFQDYGFLAE VMFVLTFGAP 

51 VLFLLLCLYV YAALIRKQAY PALRLATRVM VRLRQ AMMVD VFFVSTLVAY 

101 IKLSSVAEVR FGS AFYLMFA LSVMLIRTSV SVPQHWVYFQ IGRLTGDNAV 

151 QTASEGKTCC SRCLYFRDSA ESPCGVCGAE LYRRRPKSLS ISSAFLTAAV 

201 ILYFPANILP I MISSNPAAT EVNTILNGIA YMWDEGDRL I AAVIFSASIL 

251 VPVLKI AAMS VLIASARFAL PTGAKKLSHL YRITE AVGRW SMIDIFVIII 

301 LMCSFHTYAA R VIPGSAAVY FCLWILT ML SAYYFDPRLL WDKRASDGIA 

351 FNETEKHD* 



ml53/al53 99.7% identity in 358 aa overlap 

10 20 

ml53 .pep 



al53 

ml 5 3 .pep 
a!53 

ml53 .pep 
al53 

ml53 . pep 
al53 

ml53 . pep 
al53 

ml 5 3 . pep 
al53 



30 40 50 60 

MAFAYGMTYIEVGIPGAASVLSLPEMMRLMVFQDYGFLAEVMFVLTFGAPVLFLLLCLYV 
I N I I I I I I I I I I II I I I I ! ! I I I i I | I I | | | | | | | | | | | | | J | | | | | | | | | | |||,| | | 

MAFAYGMTYIEVGIPGAASVLSLPEMMRLMVFQDYGFLAEVMFVLTFGAPVLFLLLCLYV 
10 20 30 40 50 60 

70 80 * 90 100 110 120 

YAALIRKQAYPALRLATRVMVRLRQAMMVDVFFVSTLVAYIKLSSVAEVRFGPAFYLMFA 

I M I I I I I I I I I I I M I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | 

YAALIRKQAYPALRLATRVMVRLRQAMMVDVFFVSTLVAYIKLSSVAEVRFGSAFYLMFA 
70 80 90 100 110 120 

130 140 150 160 170 180 

LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 
I I I N I I i I 1 I I I i I I I I I I I I I 1 1 I I I I I I I I i I I I I I I I M I I I I I I I I I I I I I I I I I 

LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 
130 140 150 160 170 180 

190 200 210 220 230 240 

LYRRRPKSLS IS SAFLTAAVILYFPAN I LPIMISSNPAATEVNTILNGIAYMWDEGDRLI 

IN I I i f I I I I ! I I I I I I I | H | | | | | | | | | | | I I I I I I I I ( I I i ! I i i I I I I I I I I I I I 

L YRRR PKS L S I S S AFLTAAV I L Y FP AN ILPIMISSN PAATE VNT I LNG I AYMW DEG DRL I 
190 200 210 220 230 240 

250 260 270 280 290 300 

AAVIFSASILVPVLKIAAMSVLIASARFALPTGAKKLSHLYRITEAVGRWSMIDIFVIII 

I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | I I II M I I I II I I I I I I I I I M I I 

AAVIFSASILVPVLKIAAMSVLIASARFALPTGAKKLSHLYRITEAVGRWSMIDIFVIII 
250 260 270 280 290 300 

310 320 330 340 350 359 

LMCSFHTYAARVIPGSAAVYFCLWILTMLSAYYFDPRLLWDKRASDGIAFNETEKHDX 
N I I I I M I I I I II I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | j 1 | 

LMCSFHTYAARVIPGSAAVYFCLVVILTMLSAYYFDPRLLWDKRASDGIAFNETEKHDX 
310 320 330 340 350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 63 1 >: 

gl54 . seq 



1 ATGACTGACA 

51 CAAAAACAAC 

101 TGATTGCCGG 

151 GTGGTTACGC 

201 GGTCATTAAG 

251 TGCGCGACGA 

301 GTATCCGGCC 

351 TATCGACCAA 



ACAGCCCTCC 
accttcctCT 
CGGCTGGCTT 
TCTTGATGGA 
GTATTGAGCA 
CCAAAAAGGC 
TCATCCGCAG 
AGCGGcgtAA 



TCCAAACGGA 
CCGCCGTCTG 
TGGGTTAAGG 
CAGCGCGGAA 
TCGATGTCGG 
GTGGAAGTTA 
CGATACCCAG 
CCGGTTTGGG 



CACGCTCAAG 
GCTGGTCCCG 
AAATCCGCAA 
GGCATCGAAG 
ACGCGTTACC 
CTGCCCAACT 
TTTTGGGTGG 
TACGCTGCTT 



CACGCGTCCG 
CTGATCGCGC 
CAGGGGGCCT 
TCAACAATAC 
CGAATCAAAC 
CAATGCGGAC 
TCAAGCCGCG 
TCGGGTTCGT 
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401 


ACATCGCTTT 


TACACCCGGC 


451 


GTGCAGGACA 


TTCCGCCCGT 


501 


GAATTTGATT 


GGTAAAAACG 


551 


TGTATGAAAA 


CTTTATGGTC 


601 


TCCGACCAAA 


GCGTGCATTA 


651 


ACTGATTCAT 


TCCGCCAGCC 


701 


AAACCACAGG 


CAGCGGCATC 


751 


CTGTCAGGCG 


CGATTTCATT 


801 


CGTCAAAAGC 


GAGGACAGCT 


851 


CCAACCTGCC 


TGACGACCGC 


901 


TCCGTGCGCG 


GACTGACCGT 


951 


TGTcggCATG 


GTTTCCGATG 


1001 


ACCtgtTTGA 


aaacggctgg 


1051 


CGTTTGGAAA 


TCAATGCCGA 


1101 


ATTCCAGACG 


G C CTTAAACA 


1151 


ACCTGCTGAC 


CGGCGGCAAA 


1201 


TCGCCCAAGC 


TGCGACCGCA 


1251 


CACACGGGGC 


GGCGGTTTGG 


1301 


TGGACaaatT 


CAACAATCTG 


1351 


GGCTCGCTCG 


CCGAACTCAA 


1401 


AAGCTCCATT 


GacaAACTGG 


1451 


ACGAACTGAA 


CCAAACTCTG 


1501 


TCGcctCAAT 


CGCCTATCTa 


1551 


GG AC AAAACC 


TTAAAagacg 


1601 


aacCCaaCgc 


actGATTTtc 


1651 


GGAAGCCGAT 


AA 



AAAAGCGGCG AGGCAAAAGA CGTGTTCCAA 
TACCGCCATC GGGCAAAgcg GGCTGCGCTT 
AccgCATCCT CAACGTcaaC AGCCCTGTTT 
GGGCAAATCG AAAGCGCGCA TTTCGAcccG 
CACCATCTTC ATCCAAAGCC CCAACGACAA 
GTTTTTGGCT GGAAAGCGGC ATCAATATCG 
AAACTCAATT CCGCCCCTCT GCCTGCCCTG 
TGATTCGCCG AAAACCAAAA ACAGTAAAAA 
TCACGCTTTA CGACAGCCGC AGCGAAATCG 
TCGCTGTACT ACACCGCGTT TTTCAAACAA 
cggTTCGCCT GTcgaATACA AAGGGCtgaA 
TCCCTTATTT TGACCGCAAt gacagCCTGC 
aTTcccGtac gCATCCGCAT cgagccTTCC 
CGAGCAAAGC AAAGAGCATT GGAAACAACA 
AAGGCCTGAC CGCCACCATC TCCAGCAACA 
ATGATTGAGT TGAACGATCA GCCTTCCGCC 
TACCGTTTAT GCAGGCGATA CCGTCATCGC 
ATGACTTGCA GGTCAAATTG GCGGATTTGC 
CCATTggata aAACCGTTGC CGAATTGAAC 
GTCCGCACTC AAATCCGCCA ATGCCGCCCT 
TCGgcaaTCC GCAGACGCAA AACATCCCGA 
AAAGAGTTGC GCATAACCCT GCAAGGCGTA 
cgGAgacgta caAAATAcgc tgCaAAGTTT 
TtcaACCCGT CATTAACACT TTGAaAGAAa 
aacaACAGCA GCAAAGAccc tATCCCGAAA 



This corresponds to the amino acid sequence <SEQ ID 632; ORF 154.ng>: 



gl54 .pep 






1 


MTDNSPPPNG 


HAQARVRKNN 


51 


WTLLMDSAE 


GIEVNNTVIK 


101 


VSGLIRSDTQ 


FWWKPRIDQ 


151 


VQDIPPVTAI 


GQSGLRLNLI 


201 


SDQSVHYTIF 


IQSPNDKLIH 


251 


LSGAISFDSP 


KTKNSKNVKS 


301 


SVRGLTVGSP 


VEYKGLNVGM 


351 


RLEINADEQS 


KEHWKQQFQT 


401 


SPKLRPHTVY 


AGDTVIATRG 


451 


GSLAELKSAL 


KSANAALSSI 


501 


SPQSPIYGDV 


QNTLQSLDKT 


551 


GSR* 





SASRFWLESG INIETTGSGI KLNSAPLPAL 
EDSFTLYDSR SEIANLPDDR SLYYTAFFKQ 
VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 
ALNKGLTATI SSNNLLTGGK MIELNDQPSA 
GGLDDLQVKL ADLLDKFNNL PLDKTVAELN 
DKLVGNPQTQ NIPNELNQTL KELRITLQGV 



The following partial DNA sequence was identified in N. meningitidis <SEQ DD 633>: 

ml54.seq 

1 ATGACTGACA ACAGCCCTCC TCCAAACGGA CACGCCCAAG CACGCGTCCG 

51 CAAAAACAAC ACCTTCCTCT CTGCCGTCTG GCTGGTTCCG CTGATCGCGC 

101 TGATTGCCGG CGGCTGGCTT TGGGTTAAGG AAATCCGCAA CAGGGGGCCT 

151 GTGGTTACGC TCTTGATGGA CAGCGCGGAA GGCATTGAGG TCAACAATAC 

201 GGTCATCAAA GTATTGAGCA TCGATGTCGG ACGCGTTACC CGAATCAAAC 

251 TGCGCGACGA CCAAAAAGGC GTGGAAGTAA CCGCCCAACT CAATGCGGAC 

301 GTATCCGGCC TCATCCGCAG CGATACCCAG TTTTGGGTGG TCAAGCCGCG 

351 TATCGACCAA AGCGGCGTAA CCGGTTTGGG TACGCTGCTT TCGGGTTCGT 

401 ACATCGCCTT TACACCCGGC AAAAGCGACG AGGCAAAAGA CGTGTTCCAA 

451 GTGCAGGACA TTCCGCCCGT TACCGCCATC GGGCAAAGCG GGCTGCGCTT 

501 GAATTTGATT GGTAAAAACG ACCGCATCCT CAACGTCAAC AGCCCTGTTT 

551 TGTATGAAAA TTTTATGGTC GGGCAAGTCG AAAGCGCGCA TTTCGACCCG 

601 TCCGACCAAA GCGTGCATTA CACCATCTTC ATCCAAAGCC CCAACGACAA 

6 51 ACTGATTCAT TCCGCCAGCC GTTTTTGGCT GGAAAGCGGC ATCAATATCG 

701 AAACCACAGG CAGCGGCATC AAACTCAATT CCGCCCCTCT GCCTGCCCTG 

751 CTGTCGGGCG CGATTTCATT TGATTCGCCG AAAACCAAAA ACAGTAAAAA 

801 CGTCAAAAGC GAAGACAGCT TCACGCTTTA CGACAGCCGC AG CG AAGTCG 

851 CCAACCTGCC TGACGACCGC TCGCTGTACT ACACCGCGTT TTTCAAACAA 

901 TCCGTGCGCG GCCTGACCGT CGGTTCGCCC GTCGAGTACA AAGGGCTGAA 

951 TGTCGGCGTG GTTTCCGACG TTCCTTATTT CGACCGCAAC GACAGCCTGC 

10 01 ACCTGTTTGA AAACGGCTGG ATACCCGTAC GCATCCGCAT TGAACCTTCC 

1051 CGTTTGGAAA TCAATGCCGA CGAACAAAGC AAAGAACATT GGAAACAACA 

1101 ATTTCAGACG GCCTTAAACA AAGGCCTGAC CGCCACCATC TCCAGCAACA 

1151 ACCTGCTGAC CGGAAGCAAA ATGATTGAGT TGAACGATCA GCCTTCCGCA 
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12 01 TCACCTAAGC TGCGACCGCA TACCGTTTAT GCAGGCGATA CCGTTATCGC 

1251 GACCCAGGGC GGCGGTTTGG ACGATTTGCA GGTCAAATTG GCGGATTTGC 

1301 TGGACAAGTT CGACAAACTG CCTTTAGATA AGACGGTTGC CGAATTGAAC 

1351 GGTTCGCTTG CCGAGCTCAA ATCCACACTC AAATCTGCCA ATGCCGCCCT 

1401 AAGCTCCATC GACAAACTGG TCGGCAAACC GCAGACACAA AACATTCCGA 

14 51 ACGAACTGAA CCAAACCCTG AAAGAGTTGC GCACAACCCT GCAAGGCGTA 

1501 TCGCCGCAAT CGCCTATCTA CGGCGACGTA CAAAATACGC TGCAAAGTTT 

1551 GGACAAAACT TTAAAAGACG TTCAACCCGT GATTAATACT TT GAAAG AAA 

1601 AACCCAACGC GCTGATTTTC AACAGCAGCA GCAAAGACCC TATCCCGAAA 

1651 GGAAGCCGAT AA 



This corresponds to the amino acid sequence <SEQ ID 634 ORF 154.a>: 

ml54.pep 

1 MTDNSPPPNG HAQARVRKNN T FLSAVWLVP LIALIAGG WL WVKEIRNRGP 

51 WTLLMDSAE GIEVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGLIRSDTQ FWWKPRIDQ SGVTGLGTLL SGSYIAFTPG KSDEAKDVFQ 

151 VQDIPPVTAI GQSGLRLNLI GKNDRILNVN SPVLYENFMV GQVESAHFDP 

201 SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTGSGI KLNSAPLPAL 

251 LSGAISFDSP KTKNSKNVKS EDSFTLYDSR SEVANLPDDR SLYYTAFFKQ 

301 SVRGLTVGSP VEYKGLNVGV VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 

351 RLEINADEQS KEHWKQQFQT ALNKGLTATI SSNNLLTGSK MIELNDQPSA 

4 01 SPKLRPHTVY AGDTVIATQG GGLDDLQVKL ADLLDKFDKL PLDKTVAELN 

451 GSLAELKSTL KSANAALSSI DKLVGKPQTQ NIPNELNQTL KELRTTLQGV 

501 SPQSPIYGDV QNTLQSLDKT LKDVQPVINT LKEKPNALIF NSSSKDPIPK 

551 GSR* 



ml54 / g!54 97.8% identity in 553 aa overlap 



10 20 30 40 50 60 

ml 54 . pep MTDNSPPPNGHAQARVRKNNTFLSAWLVPLIALIAGGWLWKEIRNRGPVVTLLMDSAE 

I II II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t II I I I I I I I I I I I I I 
g!54 MTDNSPPPNGHAQARTOKNNTFLSAWLVPLIALIAGGWLWVKEIRNRGPVVTLLMDSAE 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml54 . pep GIEVmTVIKVLSIDVGRV^RIKLRDDQKGVEVTAQLNADVSGLIRSDTQFWVVKPRIDQ 

I I I II I I i I I I I I I II I i I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
gl54 GIEV1JNTVIKVLSIDVGRVTRIKLRDDQKGVEVTAQLNADVSGLIRSDTQFWVVKPRIDQ 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml54 . pep SGV^GLGTLLSGSYIAFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRI^LIGKNDRILNVN 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I II 
gl54 SGVTGLGTLLSGSYIAFTPGKSGEAKDVFQVQDIPPVTAIGQSGLRIiNLIGKNDRILNVN 
130 140 150 160 170 180 



190 200 210 220 230 240 

ml54 . pep SPVLYENFMVGQVESAHFDPSDQSVHYTIFIQSPNDKLIHSASRFWLESGINIETTGSGI 

I I | I I I I I II I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl54 SPVLYENFMVGQIESAHFDPSDQSVHYTIFIQSPNDKIilHSASRFWLESGINIETTGSGI 
190 200 210 220 230 240 



250 260 270 280 290 300 

ml54 .pep KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKQ 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I h I I I I I I I I I I I I I I I I I 
gl54 KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEIANLPDDRSLYYTAFFKQ 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml54 .pep S VRG LT VG S P VE YKGLNVG WSD V P Y FDRND S LHLFENGWI PVR I R I E PS RLE I NAD E Q S 

I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl54 SVRGLTVGSPVEYKGLNVGMVSDVPYFDRNDSLHLFENGWIPVRIRIEPSRLEINADEQS 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml54 . pep KEHWKQQFQTALNKGLTATISSNNLLTGSKMIELNDQPSASPKLRPHTVYAGDTVIATQG 

I I II I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I 1 I I M I h I 
gl54 KEHWKQQFQT ALNKGLTAT I SSNNLLTGGKMIELNDQPS AS PKLRPHTVYAGDTVIATRG 
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370 380 390 400 410 420 

430 440 450 460 470 480 

GGLDDLQVKLADLLDKFD KLPLDKTVAELNGS LAELKS TLKS ANAALS SID KLVGKPQTQ 

I M i 1 1 1 1 1 1 1 i 1 1 1 1 1- I II 1 1 M II ill I M 1 1 II H II 1 1 1 1 M 1 1 1 1 1 1 hi 1 1 1 

GGLDDLQVKLADLLDKFNNLPLDKTVAELNGSLAELKSALKSANAALSSIDKLVGNPQTQ 
430 440 450 460 470 480 

490 500 510 520 530 540 

NIPNELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 

I 1 1 II 1 1 1 1 1 1 1 1 1 IIIIIIMIIIIIMIIIIMIIIIIIIIIIIMMIIIIIIIII 

NIPNELNQTLKELRITLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 
490 500 510 520 530 540 

550 

NSSSKDPIPKGSRX 

I :|| Ml II II III 
NNSSKDPIPKGSRX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 635>: 

al54 . seq 

1 ATGACTGACA ACAGCCCTCC TCCAAACGGA CACGCCCAAG CACGCGTCCG 

51 CAAAAACAAC ACCTTCCTCT CTGCCGTCTG GCTGGTTCCG CTGATCGCGC 

101 TGATTGCCGG CGGCTGGCTT TGGGTTAAGG AAATCCGCAA CAGGGGGCCT 

151 GTGGTTACGC TCTTGATGGA CAGCGCGGAA GGCATTGAGG TCAACAATAC 

201 GGTCATCAAA GTATTGAGCA TCGATGTCGG ACGCGTTACC CGAATCAAAC 

251 TGCGCGACGA CCAAAAAGGC GTGGAAGTAA CCGCCCAACT CAATGCGGAC 

301 GTATCCGGCC TCATCCGCAG CGATACCCAG TTTTGGGTGG TCAAGCCGCG 

351 TATCGACCAA AGCGGCGTAA CCGGTTTGGG TACGCTGCTT TCGGGTTCGT 

4 01 ACATCGCCTT TACACCCGGC AAAAGCGACG AGGCAAAAGA CGTGTTCCAA 

4 51 GTGCAGGACA TTCCGCCCGT TACCGCCATC GGGCAAAGCG GGCTGCGCTT 

501 GAATTTGATT GGTAAAAACG ACCGCATCCT CAACGTCAAC AGCCCTGTTT 

551 TGTATGAAAA CTTTATGGTC GGGCAAGTCG AAAGCGCGCA TTTCGACCCG 

601 TCCGACCAAA GCGTGCATTA CACCATCTTC ATCCAAAGCC CCAACGACAA 

651 ACTGATTCAT TCCGCCAGCC GTTTCTGGCT GGAAAGCGGC ATCAATATCG 

7 01 AAACCACAGG CAGCGGCATC AAACTCAATT CCGCCCCTCT GCCTGCCCTG 

751 CTGTCGGGCG CGATTTCATT TGATTCGCCG AAAACCAAAA ACAGTAAAAA 

801 CGTCAAAAGC GAAGACAGCT TCACGCTTTA CGACAGCCGC AGCGAAGTCG 

851 CCAACCTGCC TGATGACCGT TCGCTGTACT ACACCGCGTT TTTCAAACAA 

901 TCCGTGCGCG GACTGACCGT CGGTTCGCCT GTCGAGTACA AAGGGCTGAA 

951 TGTCGGCGTG GTTTCCGATG TTCCTTATTT CGACCGCAAC GACAGCCTGC 

1001 ACCTGTTTGA AAACGGCTGG ATTCCCGTAC GCATCCGTAT TGAGCCTTCC 

1051 CGTTTGGAAA TCAATGCCGA CGAACAAAGC AAAGAACATT GGAAACAACA 

1101 ATTTCAGACG GCCTTAAACA AAGGCCTGAC CGCCACCATC TCCAGCAACA 

1151 ACCTGCTGAC CGGCAGCAAA ATGATTGAGT TGAACGATCA GCCTTCCGCC 

1201 TCGCCCAAGC TGCGACCGCA TACCGTTTAT GCAGGCGATA CCGTTATCGC 

1251 GACCCAGGGC GGCGGTTTGG ACGATTTGCA GGTCAAATTG GCGGATTTGC 

1301 TGGACAAGTT CGACAAACTG CCTTTAGATA AGACGGTTGC CGAATTGAAC 

1351 GGTTCGCTTG CCGAGCTCAA ATCCACACTC AAATCTGCCA ATGCCGCCCT 

14 01 AAGCTCCATC GACAAACTGG TCGGCAAACC GCAGACACAA AACATTCCGA 

1451 ACGAACTGAA CCAAACCCTG AAAGAGTTGC GCACAACCCT GCAAGGCGTA 

1501 TCGCCTCAAT CGCCTATCTA CGGCGACGTA CAAAATACGC TGCAAAGTTT 

1551 GGACAAAACC TTAAAAGACG TTCAACCCGT CATTAACACT TTGAAAGAAA 

1601 AACCCAACGC GCTGATTTTC AACAGCAGCA GCAAAGACCC TATCCCGAAA 

1651 GGAAGCCGAT AA 

This corresponds to the amino acid sequence <SEQ ID 636; ORF 154.a>: 

al54 -pep 

1 MTDNSPPPNG HAQARVRKNN T FLSAVWLVP LIALIAGG WL WVKEIRNRGP 

51 WTLLMDSAE GIEVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGLIRSDTQ FWWKPRIDQ SGVTGLGTLL SGSYIAFTPG KSDEAKDVFQ 



ml54 .pep 
gl54 

ml54 .pep 

gi54 

ml54 .pep 
gl54 
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151 VQDIPPVTAI GQSGLRLNLI GKNDRILNVN SPVLYENFMV GQVESAHFDP 

201 SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTGSGI KLNSAPLPAL 

251 LSGAISFDSP KTKNSKNVKS EDSFTLYDSR SEVANLPDDR SLYYTAFFKQ 

301 SVRGLTVGSP VEYKGLNVGV VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 

351 RLEINADEQS KEHWKQQFQT ALNKGLTATI SSNNLLTGSK MIELNDQPSA 

401 SPKLRPHTVY AGDTVIATQG GGLDDLQVKL ADLLDKFDKL PLDKTVAELN 

451 GSLAELKSTL KSANAALSSI DKLVGKPQTQ NIPNELNQTL KELRTTLQGV 

501 SPQSPIYGDV QNTLQSLDKT LKDVQPVINT LKEKPNALIF NSSSKDPIPK 

551 GSR* 

ml54/al54 100.0% identity in 553 aa overlap 

10 20 30 40 50 60 

ml54 . pep MTDNSPPPNGHAQARVRKNNTFLSAVWLVPLIALIAGGWLWVKEIRNRGPWTLLMDSAE 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
al54 MT DN S P P PNGHAQARVRKNNT FLS AVWLV PL I AL I AGGWLWVKE I RNRG P WTLLM DS AE 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml54 • pep GIEVNNTVIKVLSIDVGRVTRIKLRDDQKGVEVTAQLNADVSGLIRSDTQFWWKPRIDQ 
I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al54 GIEVNNTVIKVLSIDVGRVTRIKLRDDQKGVEVTAQLNADVSGLIRSDTQFWWKPRIDQ 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml54 .pep SGVTGLGTLLSGSYIAFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 
I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 
al54 SGVTGLGTLLSGSYIAFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml54 .pep SPVLYENFMVGQVESAHFDPSDQSVHYTIFIQSPNDKLIHSASRFWLESGINIETTGSGI 
I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al54 SPVLYENFMVGQVESAHFDPSDQSVHYTIFIQSPNDKLIHSASRFWLESGINIETTGSGI 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml54 .pep KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKQ 

I I I I I I I I I I I I I I I II I! I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I II M I I I I 
al54 KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKQ 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml54 . pep SVRGLTVGSPVEYKGLNVGWSDVPYFDRNDSLHLFENGWIPVRIRIEPSRLEINADEQS 
I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
al54 SVRGLTVGSPVEYKGLNVGWSDVPYFDRNDSLHLFENGWIPVRIRIEPSRLEINADEQS 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml54 ,pep KEHWKQQFQTALNKGLTATISSNNLLTGSKMIELNDQPSASPKLRPHTVYAGDTVIATQG 
I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I 
al54 KEHWKQQFQTALNKGLTATISSNNLLTGSKMIELNDQPSASPKLRPHTVYAGDTVIATQG 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml54 . pep GGLDDLQVKLADLLDKFDKLPLDKTVAELNGSLAELKSTLKSANAALS SI DKLVGKPQTQ 

1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 

al54 GGLDDLQVKLADLLDKFDKLPLDKTVAELNGSLAELKSTLKSANAALS SI DKLVGKPQTQ 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml54 . pep NIPNELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I M I I I M I I I I I I I I I I I I 
al54 NIPNELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 

490 500 510 520 530 540 
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550 

ml54 . pep NSSSKDPIPKGSRX 
I I I I I I I I I I I I I I 
al54 NSSSKDPIPKGSRX 

550 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 637>: 

gl55 . seq 

1 atGAAaatcg GtatcCCACG CGAGTCAtta tcCGGCGAAA cccgcgtagc 

51 ctgcAcgccc gCCACCGTTG CCctgctggg caAactAGGC TTTGAAACCG 

101 TTGtcgaAAG CGGTGCAggt TTGGCGGCAA GTTTggaCGA TGCCGCTTAC 

151 CAAACAGCAG GCGCAACCGT TGCCGACAAA GCGGCGGTTT GGGCCTGCCC 

201 TTTAATTTAT AAGGTCAACG CGCCGTCCGA AGGCGAGCTG CCGCTGCTCA 

251 AAGAAGGTCA AACCATCGTC AGCTTCCTGT GGCCGCGCCA AAACGAGGCT 

301 TTGGTCGAGG CCTTGCGCGC CAAGAAAGTC AACGCGCTGG CGATGGACAT 

351 GGTTCCCCGC ATTTCCCGCG CTCAGGCCTT GGACGCTTTG TCTTCAATGG 

4 01 CAAACATCAG CGGCTACCGC GCCGTGATTG AAGCCGCCAA CGCCTTCGGC 

4 51 CGTTTCTTCA CCGGTCAAAT CACTGCCGCC GGCAAAGTGC CGCCTGCGCA 

501 GGTTTTGGTG ATTGGCGCCG GTGTGGCGGG TTTGGCGGCA ATCGGTACGG 

551 CAAATTCGCT CGGCGCAGTG GTGCGCGCGT TCGATACCCG CTTGGAAGTG 

601 GCGGAACAAA TCGAATCGAT GGGCGGTAAG TTcctGAAAC TCGACTTCCT 

651 GCAAGAATCG GGCGGCAGCG GAGACGgctA CGCCAAAGTG ATGAGCGACG 

701 AATTTATCGC CGCCGAAATG AAGCTCTTTG CCGAACAGGC GAAAGAAGTG 

751 GACATCATCA TCACCACCGC CGCCATTCCG GGCAAACCCG CTCCCAAGCT 

801 GATTACCAAA GAAATGGTGG AAAGCATGAA ATCCGGATCC GTCATCGTCG 

851 ATTTGGCGGC GACGGGCGGC AACTGCGAAC TCACCCGACC GGGCGAATTG 

901 TCCGTAACCG GCAACGGCGT GAAAATCATC GGCTACACCG ACATGGCAAA 

951 CCGCCTTGCC GGACAGTCTT CCCAGCTTTA CGCCACCAAC TTGGTGAACC 

1001 TGACCAAGCT GTTAAGCCCG AACAAAGAcg gcgaAATCAC GCTGGACTTC 

1051 GAAGacgtGA TTATCCGCAA TATGACCGTT ACCCGcgacg gcgaaATCAC 

1101 CTTCCCGCCT CCGccgaTTc aggtTTCcgc ccggccgCAG CAAAcgccgt 

1151 ctgaAAAagc cgcGCCTGCC GCCAagcccg AgccGaaacc tgttCCcctg 

1201 tggaAAAaac tcgCGCCCGC CGCcatcgCC GCCGTATTGG tgctgtgGgt 

1251 cggCgcggtc gcacccgcag CATTCTTGAA CCACTTTATC GTCTTCGTCC 

1301 TCGCCTGCGT CATCGGCTAC CATGTCGTTT GgaacgTCAG CCACTCGCTG 

1351 CACACACCGC TGAtgtcggt aaccaaCgcc atctccGGCA tcatggtcgt 

14 01 cggCGCGCTG CTGCAAATCG GTCAGGGcaa cggcttcgtT TCgctGCTGT 

14 51 CGTTTGTTGC CATCCTGATT GCCGGCATCA ATATCTTCGG CGGCTTTGCG 

1501 GTTACACGGC GTATGCTGAA TATGTTTAAG AAAGGGTAA 

This corresponds to the amino acid sequence <SEQ ID 638; ORF 155.ng>: 

gl55 . pep 

1 MKIGIPRESL SGETRVACTP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QTAGATVADK AAVWACPLIY KVNAPSEGEL PLLKEGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDAL SSMANISGYR AVIEAANAFG 

151 RFFTGQITAA GKVPPA QVLV IGAGVAGLAA IGTA NSLGAV VRAFDTRLEV 

201 AEQIESMGGK FLKLDFLQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKLITK EMVESMKSGS VIVDLAATGG NCELTRPGEL 

301 SVTGNGVKII GYTDMANRLA GQSSQLYATN LVNLTKLLSP NKDGEITLDF 

351 EDVIIRNMTV TRDGEITFPP PPIQVSARPQ QTPSEKAAPA AKPEPKPVPL 

4 01 WKK LAPAAIA AVLVLWVGAV AP AAFLNHFI VFVLACVIG Y HWWNVSHSL 

4 51 HTP LMSVTNA ISGIMWGAL LQIGQGNG FV SLLSFVAILI AGINI FGGFA 

501 VTRRMLNMFK KG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 639>: 

ml55 . seq 

1 ATGAAAATCG GTATCCCACG CGAGTCATTA TCCGGCGAAA CCCGCGTCGC 

51 CTGTACGCCC GCCACCGTCG CCCTGCTGGG CAAACTGGGC TTTGAAACCG 

101 TTGTCGAAAG CGGTGCAGGT TTGGCGGCAA GTTTGGACGA TGCCGCTTAC 

151 CAAACAGCAG GCGCAACCGT TGCCGACAAA GCGGCGGTTT GGGTCTGCCC 

201 TTTGATTTAT AAGGTCAACG CGCCGTCCGA ACAGGAACTG CCGCTTTTGA 

251 ACGAAGGTCA AACCATCGTC AGCTTCCTGT GGCCGCGCCA AAACGAGGCT 
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301 TTGGTCGAAG CCTTGCGCGC 

351 GGTGCCCCGC ATTTCGCGCG 

4 01 CAAACATCAG CGGCTACCGC 

4 51 CGTTTCTTCA CCGGTCAAAT 

501 GGTTTTGGTG ATTGGTGCAG 

551 CAAACTCGCT CGGCGCAGTG 

601 GCGGAACAAA TCGAATCGAT 

651 ACAAGAATCG . GGCGGCAGCG 

701 AATTTATCGC AGCCGAGATG 

7 51 GACATCATCA TCACCACCGC 

801 GATTACCAAA GAAATGGTGG 

851 ATTTGGCGGC GGCGACGGGC 

901 TTGTCCGTAA CCGGCAACGG 

951 AAACCGCCTT GCCGGACAGT 

1001 ACCTGACCAA GCTGTTAAGC 

1051 TTCGAAGACG TGATTATCCG 

1101 CACCTTCCCG CCTCCGCCGA 

1151 CGTCTGAAAA AGCCGTGCCT 

1201 CTGTGGAAAA AACTCGCGCC 

1251 GGTCGGCGCG GTCGCACCCG 

1301 TTCTCGCCTG CGTCATCGGC 

1351 CTGCACACAC CGCTGATGTC 

14 01 CGTCGGCGCG CTGCTGCAAA 

14 51 TGTCGTTTGT TGCCATCCTG 

1501 GCGGTAACAC GGCGTATGCT 



CAAGAAAGTG AACGCGCTGG CGATGGATAT 
CGCAGGCTTT GGACGCTTTG TCTTCGATGG 
GCCGTAATTG AAGCCGCCAA CGCCTTCGGC 
TACCGCCGCC GGCAAAGTGC CGCCCGCGCA 
GTGTGGCAGG TTTGGCGGCG ATCGGTACGG 
GTACGCGCGT TCGATACCCG CTTGGAAGTG 
GGGCGGCAAG TTCCTGAAAC TCGACTTCCC 
GAGACGGCTA CGCCAAAGTG ATGAGCGACG 
AAGCTCTTTG CCGAGCAGGC GAAAGAAGTG 
CGCCATTCCG GGCAAACCCG CGCCCAAGCT 
AAAGCATGAA ATCCGGCTCC GTCATCGTCG 
GGCAACTGCG AACTCACCCG CCCGGGCGAA 
CGTGAAAATC ATCGGCTACA CCGACATGGC 
CTTCCCAGCT TTACGCCACC AACTTGGTCA 
CCGAACAAAG ACGGCGAAAT CACGTTGGAC 
CAACATGACC GTTACCCACG ACGGCGAAAT 
TTCAAGTTTC CGCCCAGCCG CAGCAAACGC 
GCCGCCAAGC CCGAGCCAAA ACCCGTTCCC 
CGCCGTCATC GCCGCCGTCT TGGTACTGTG 
CAGCATTCCT GAACCACTTT ATCGTGTTCG 
TACTACGTCG TCTGGAACGT CAGCCACTCG 
GGTAACCAAC GCCATCTCCG GCATCATCGT 
TCGGTCAGGG CAACGGCTTC GTTTCGCTGC 
ATTGCCGGCA TCAACATCTT CGGCGGCTTT 
GAATATGTTT AAGAAAGGGT AA 



This corresponds to the amino acid sequence <SEQ ID 640; ORF 155>: 

m!55.pep 

1 MKIGIPRESL SGETRVACTP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QTAGATVADK AAVWVCP LIY KVNAPSEQEL PLLN EGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDAL SSMANISGYR AVIEAANAFG 

151 RFFTGQITAA GKVPPAQVLV IGAGVAGLAA IGTANSLGAV VRAFDTRLEV 

201 AEQIESMGGK FLKLDFPQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKLITK EMVESMKSGS VIVDLAAATG GNCELTRPGE 

301 LSVTGNGVKI IGYTDMANRL AGQSSQLYAT NLVNLTKLLS PNKDGEITLD 

351 FEDVIIRNMT VTHDGEITFP PPPIQVSAQP QQTPSEKAVP AAKPEPKPVP 

4 01 LWKK LAPAVI AAVLVLWVGA V AP AAFLNHF IVFVLACVIG YYWWNVSHS 

451 LHTP LMSVTN AISGIIVVGA L LQIGQGNG F VSLLSFVAIL IAGINI FGGF 

501 AVTRRMLNMF KKG+ 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 155 shows 97.9% identity over a 513 aa overlap with a predicted ORF (ORF 155.ng) 
from N. gonorrhoeae: 

ml55 / gl55 97.9% identity in 513 aa overlap 

10 20 30 40 50 60 

ml 55 . pep MKIGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQTAGATVADK 
I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I M I I I I I I I I I I I M I I I I I 
gl55 MKIGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQTAGATVADK 

10 20 30 40 50 60 



70 80 90 100 110 120 

ral55 . pep AAVWVCPLIYKVNAPSEQELPLLNEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 
I I I I : I I I ! I I I i I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I f I I 
gl55 AAVWACPLIYKVNAPSEGELPLLKEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml55 . pep ISRAQALDALSSMANISGYRAVIE7UVNAFGRFFTGQITAAGKVPPAQVLVIGAGVAGLAA 
I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl55 ISRAQALDALSSMANISGYRAVIEAANAFGRFFTGQITAAGKVPPAQVLVIGAGVAGLAA 
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130 140 150 160 170 180 

190 200 210 220 230 240 

ml55 . pep IGTANSLGAWRAFDTRLEVAEQIESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 
I I M I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I | 
gl55 IGTANSLGAVVRAFDTRLEVAEQIESMGGKFLKLDFLQESGGSGDGYAKVMSDEFIAAEM 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml55 . pep KLFAEQAKEVDIIITTAAIPGKPAPKLITKEMVESMKSGSVIVDLAAATGGNCELTRPGE 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I MINIMUM 
gl55 KLFAEQAKEVDIIITTAAIPGKPAPKLITKEMVESMKSGSVIVDLAA-TGGNCELTRPGE 

250 260 270 280 290 



310 320 330 340 350 360 

ml55.pep LSVTGNGVKI IGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVI IRNMT 

M I I I I I I II I I I M M I II I II I II I I I I II I I! I I I II M I II M I M I I II I M II I 
gl55 LSVTGNGVKI IGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVI IRNMT 

300 310 320 330 340 350 



370 380 390 400 410 420 

ml55 . pep VTHDGEITFPPPPIQVSAQPQQTPSEKAVPAAKPEPKPVPLWKKLAPAVIAAVLVLWVGA 
I M I I I I I II I I I II I I I : II I I I II I I: I II I I I II I I I I I I I I II I : I M I I I I II II 
gl55 VTRDGEITFPPPPIQVSARPQQTPSEKAAPAAKPEPKPVPLWKKLAPAAIAAVLVLWVGA 
360 370 380 390 400 410 



430 440 450 460 470. 480 

ml55.pep VAPAAFLNHFIVFVLACVIGYYWWNVSHSLHTPLMSVTNAISGIIVVGALLQIGQGNGF 
I I I I M I I M I I I I I I II I I I : II I I I I I II I M I I II I II I I I I : I I I I I I II I I I II ! 
gl55 VAPAAFLNHFIVFVLACVIGYHVVWNVSHSLHTPLMSVTNAISGIMWGALLQIGQGNGF 
420 430 440 450 460 470 



490 500 510 

ml55 . pep VSLLSFVAILIAGINIFGGFAVTRRMLNMFKKGX 
I M II I I II I M M I M I I II II II I I II II I I I 
gl55 VSLLS FV A ILIAGINIFGG FAVTRRMLNM FKKGX 

480 490 500 510 



The following partial DNA sequence 



al55. seq 






1 


ATGAAAATCG 


GTATCCCACG 


51 


CTGTACGCCC 


GCCACCGTCG 


101 


TTGTCGAAAG 


CGGCGCAGGT 


151 


CAAGCAGCAG 


GCGCAACCGT 


201 


TTTAATTTAT 


AAGGTTAACG 


251 


AAGAAGGACA 


GACCATCGTC 


301 


TTGGTCGAAG 


CCTTGCGCGC 


351 


GGTGCCCCGC 


ATTTCGCGCG 


401 


CAAACATCAG 


CGGCTACCGC 


451 


CGTTTNTTCA 


CCGGCCAAAT 


501 


GGTTTTGGTG 


ATTGGTGCAG 


551 


CAAACTCGCT 


CGGCGCAGTG 


601 


GCGGAACAAT 


TAGAATCGAT 


651 


GCAAGAATCG 


GGCGGCAGCG 


701 


AATTTATCGC 


CGCCGAGATG 


751 


GACATCATCA 


TCACCACCGC 


801 


NNTNANCAAA 


GAAATGGTCG 


851 


ATTTGGCGGC 


GGCGACGGGC 


901 


TTGTTCGTAA 


CCGGCAACGG 


951 


AAACCGCCTT 


GCCGGACAGT 


1001 


ACCTGACCAA 


GCTGTTAAGC 


1051 


TTCGAAGACG 


TGATTATCCG 


1101 


CACCTTCCCG 


CCTCCGCCGA 


1151 


CGTCTGAAAA 


AGCCGCGCCT 



identified in N. meningitidis <SEQ ID 64 1>: 

TGAGTCATTA TCCGGCGAAA CCCGCGTCGC 
CCCTGCTGGG CAAACTGGGC TTTGAAACCG 
TTGGCGGCAA GTTTGGACGA TGCCGCTTAC 
TGCCGACAAA GCAGCGGTTT GGGCATACCC 
CGCCGTCCGA AGACGAGCTG CCGCTGCTCA 
AGCTTCCTGT GGCCGCGCCA AAACGAGGCT 
CAAGAAAGTG AACGCGCTGG CAATGGACAT 
CGCAGGCTTT GGACGNTTTG TCTTNGATGG 
GCCGTGATTG AAGCCGCCAA CGCCTTCGGC 
TACTGCCGCA GGCAAAGTGC CGCCCGCGCA 
GTGTGGCAGG TTTGGCGGCG ATCGGTACGG 
GTACGCGTGT TCGATACCCG CCTG.AAGTG 
GGGCGGCAAG TTCCTGAAAC TCGACTTCCC 
GCGACGGCTA CGCCAAAGTG ATGAGCGACG 
AAGCTTTTTG CCGAGCAGGC GAAAGAAGTG 
CGCCATTCCG GGCAAACCCG CGCCCAAGCN 
AAAGCATGAA ACCCGGCTCC GTCATCGTCG 
GGCAACTGCG AACTCACCAA ACAGGGCGAA 
CGTGAAAATC ATCGGCTACA CCGACATGGC 
CTTCGCAGCT TTACGCCACC AACTTGGTCA 
CCGAACAAAG ACGGCGAAAT CACGCTGGAC 
CAACATGACC GTTACCCGCG ACGGCGAAAT 
TTCAAGTTTC CGCCCAACCG CAGCAAACGC 
GCCGCCAAGC CCGAACCGAA ACCCGTTCCC 
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1201 CTGTGGAAAA AACTCGCGCC CGCCNTNATC GCCGCCGTGT TGGTACTGTG 

1251 GGTCGGCGCG GTCGCACCCG CAGCATTCCT GAACCACTTT ATCGTCTTCG 

1301 TCCTCGCCTG CGTCATCGGC TACTATGTCG TTTGGAACGT CAGCCACTCG 

1351 CTGCACACAC CGCTGATGTC GGTGACCAAC GCCATTTCCG GCATCATCGT 

14 01 CGTCGGCGCG CTGCTGCAAA TCGGTCAGGG CAACGGCTTC GTTTCGCTGC 

14 51 TGTCGTTTGT TGCCATCCTG ATTGCCAGCA TCAACATCTT CGGCGGCTTC 

1501 TTTGTAACGC GGCGGATGCT GAATATGTTT AGGAAAGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 642; ORF 155.a>: 

al55 .pep 

1 MKIGIPRESL S GET R VAC TP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QAAGATVADK AAVWAYPLIY KVNAPSEDEL PLLKEGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDXL SXMANISGYR AVIEAANAFG 

151 RXFTGQITAA GKVPPA QVLV IGAGVAGLAA IGTA NSLGAV VRVFDTRLXV 

201 AEQLESMGGK FLKLDFPQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKXXXK EMVESMKPGS VIVDLAAATG GNCELTKQGE 

301 LFVTGNGVKI IGYTDMANRL AGQSSQLYAT NLVNLTKLLS PNKDGEITLD 

351 FEDVIIRNMT VTRDGEITFP PPPIQVSAQP QQTPSEKAAP AAKPEPKPVP 

4 01 LWK KLAPAXI AAVLVLWVGA V AP AAFLNHF IVFVLACVIG YYVVWNVSHS 

4 51 LHTP LMSVTN AISGIIVVGA L LQIGQGNG F VSLLSFVAIL IASINI FGGF 

501 FVTRRMLNMF RKG* 

ml55/al55 95.3% identity in 513 aa overlap 

10 20 30 40 50 60 

ml55 . pep MK I G I PRE S LSGETRVACT PATVALLGKLG FET WE SGAGL AAS LD D AAYQTAG AT V ADK 

I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I I : I I I I I I I I 
al55 MKIGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQAAGATVADK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml55 . pep AAVWVCPLIYKVNAPSEQELPLLNEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 
Mil: I I I I I I 1 I i 1 I : I I I M : i I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I 
al55 AAVWAYPLIYKVNAPSEDELPLLKEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml55.pep ISRAQALDALSSMANISGYRAVIEAANAFGRFFTGQITAAGKVPPAQVLVIGAGVAGLAA 

I I I I I I I I I i I I I I I I I I i I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I 
al55 I SRAQALDXLSXMAN I SGYRAVIEAANAFGRXFTGQITAAGKVPPAQVLV IGAGVAGLAA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml55 . pep IGTANSLGAWRAFDTRLEVAEQIESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 

II I II I I I I I I I : ! I I I I I I II : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al55 IGTANSLGAWRVFDTRLXVAEQLESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml55.pep KLFAEQAKEVDI I ITTAAIPGKPAPKLITKEMVESMKSGSVIVDLAAATGGNCELTRPGE 

I I I I I I I I I I I I I I I I I i I I I I I I I I MINIMI M I I I I I I II I I I I I M I : II 
al55 KLFAEQAKEVDI I ITTAAI PGKPAPKXXXKEMVESMKPGSVIVDLAAATGGNCELTKQGE 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml55.pep LSVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 

i I I I II I I I I I I II I II I I I I I I I II I I I I I I I I M I I I I I I I I I II I I II I I I II I II 
al55 LFVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 55. pep VTHDGEITFPPPPIQVSAQPQQTPSEKAVPAAKPEPKPVPLWKKLAPAVIAAVLVLWVGA 
I I M II II M I II II I I I I I II I I I I I I M II I I I I I II I I I I M I I I I I I I I I I I II I 
al55 VTRDGEITFPPPPIQVSAQPQQTPSEKAAPAAKPEPKPVPLWKKLAPAXIAAVLVLWVGA 

370 380 390 400 410 420 
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430 440 450 460 470 480 

VAPAAFLNHFIVFVLACVIGYYWWNVSHSLHTPLMSVTNAISGIIWGALLQIGQGNGF 

I I I I I I I I I I [ I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 1 I i I I I I I I I I I I I I I I I I 

VAPAAFLNHFIVFVLACVIGYYVWNVSHSLHTPLMSVTNAISGIIVVGALLQIGQGNGF 
430 440 450 460 470 480 

490 500 510 

ml55 . pep VSLLSFVAILIAGINIFGGFAVTRRMLNMFKKGX 
I I I I I I I I I I I I : I I I I I I I I I I I I I I I I : I I I 
a 1 5 5 VS LLS FVAI LI AS INI FGGFFVTRRMLNMFRKGX 

490 500 510 



ml55.pep 
al55 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 643>: 

gl56. seq 

1 ATGACTTTCG CCTATTGGTG CATTCTGATT GCCTGCCTAT TGCCGCTTTT 

51 TTGTGCGGCG TATGCCAAAA AAGCGGGCGG ATTCCGGTTT AAAGACAACC 

101 ACAATCCTCG CGGTTTTCTG GCACATACGC AAGGCGCAGC CGCCCGTGCC 

151 CACGCCGCGC AGCAAAACGG TTTTGAAGCC TTTGCACCGT TTGCCGCCGC 

201 CGTTTTGACG GCACACGCAA CCGGCAATGC CGGACAAGCA ACCGTCAACA 

251 CGCTTGCCGG ATTGTTCATC CTGTTCCGCC TCGCCTTTAT CTGGTGCTAC 

301 ATCGCAGACA AAGCAGCATT GCGCTCGCTG ATGTGGGCGG GCGGATTTGC 

351 CTGCACCGTC GGACTGTTTG TCGCGGCTGC TTGA 

This corresponds to the amino acid sequence <SEQ ID 644; ORF 156.ng>: 

gl56 .pep 

1 MTFAYWCILI ACLLPLFCAA YAKKAGGFRF KDNHNPRGFL AHTQGAAARA 

51 HAAQQNGFEA FAPFAAAVLT AHATGNAGQA TVNT LAGLFI LFRLAFIWCY 

101 IADKAALRS L MWAGGFACTV GLFVAA A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 645>: 

ml56. seq 

1 ATGACTTTCG CCTATTGGTG .TATTCTGATT GCCTGCCTAT TGCCGCTTTT 

51 TTGTGCGGCG TATGCCAAAA AAGCGGGCGG ATTCCGGTTT AAAGACAACC 

101 ACAATCCGCG CGGTTTTCTA GCGCACACGC AAGGCGCAGC CGCCCGTGCC 

151 CACGCCGCAC AGCAAAACGG TTTTGAAGCC TTTGCACCGT TTGCCGCCGC 

201 CGTTTTGACG GCACACGCAA CCGGCAATGC GGCGCAATCG ACCATCAACA 

251 CGCTTGCCTG CCTGTTCATC CTGTTCCGCC TCGCCTTTAT CTGGTGCTAT 

301 ATCGCCGACA AAGCCGCTAT GCGCTCACTG ATGTGGGCAG GCGGATTTGC 

351 CTGCACCGTC GGGCTGTTTG TCGCGGCTGC TTGA 

This corresponds to the amino acid sequence <SEQ ID 646; ORF 156>: 

ml56.pep 

1 MTFAYWCILI ACLLPLFCAA YAKKAGGFRF KDNHNPRGFL AHTQGAAARA 

51 HAAQQNGFEA FAPFAAAVLT AHATGNAAQS TINT LACLFI LFRLAFIWCY 

101 IADKAAMRS L MWAGGFACTV GLFVAA A* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml56 / gl56 96.1% identity in 127 aa overlap 

10 20 30 40 50 60 

ml56 . pep MTFAYWCILIACLLPLFCAAYAKKAGGFRFKDNHNPRGFLAHTQGAAARAHAAQQNGFEA 

I I I I I I I I I I I I I I I I I I I I I 1 1 I It I I I I I I I ! I I I I I I I I I I i I I I I I I I I I i I I I I I 
gl56 MTFAYWCILI AC LLP LFCAAYAKKAGGFRFKDNHNPRGFLAHTQGAAARAHAAQQNGFEA 

10 20 30 40 . 50 60 

70 80 90 100 110 120 

ml56 . pep FAPFAAAVLTAHATGNAAQS TINTLACLFI LFRLAFIWCY I ADKAAMRSLMWAGGFACTV 

j I I I I I I I I I M I i I I I : i : I : I I I I I I f I I I I I I I I I 1 I I I I I I : I t I I I t I I i I I I! 
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g 156 FAPFAAAVLTAHATGNAGQATVNTLAGLFILFRLAFIWCYIADKAALRSLMWAGGFACTV 

70 80 90 100 110 120 

ml 5 6. pep GLFVAAAX 
I I I I I I I I 
gl56 GLFVAAAX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 647>: 



al56 . seq 

1 ATGACTTTCG CCTATTGGTG TATTCTGATT GCCTACCTAT TGCCGCTTTT 

51 TTGTGCGGCG TATGCCAAAA AAGCGGGCGG ATTCCGGTTT AAAGACAACC 

101 ACAATCCGCG CGATTTTCTG GCGCGCACGC AAGGCACAGC CGCCCGTGCC 

151 CACGCCGCGC AGCAAAACGG TTTTGAAGCC TTTGCACCGT TTGCAGCCGC 

201 CGTTTTGACG GCACACGCAA CCGGCAATGC CGGACAAGCA ACCGTCAACA 

251 CGCTTGCCGG CCTGTTCATC CTGTTCCGCC TCGCCTTTAT CTGGTGCTAC 

301 ATCGCAGACA AAGCAGCATT ACGCTCGCTG ATGTGGGTGG GCGGATTTGT 

351 CTGCACCGTC GGGCTGTTTG TCGTGGCTGC TTGA 

This corresponds to the amino acid sequence <SEQ ID 648; ORF 156.a>: 



al56.pep 

1 MTFAYWCILI AYLLPLFCAA YAKKAGGFRF KDNHNPRDFL ARTQGTAARA 

51 HAAQ QNGFEA FAPFAAAVLT AHATGNAGQA TVNT LAGLFI LFRLAFIWCY 

101 IADKAALRS L MWVGGFVCTV GLFWA A* 

ml56/al56 90.6% identity in 127 aa overlap 



10 20 30 40 50 60 

ml56.pep MTFAYWCILIACLLPLFCAAYAKKAGGFRFKDNHNPRGFLAHTQGAAARAHAAQQNGFEA 

I | || I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I II : I I I I I I IN 

al56 MT FAYWC I LI AYLLPL FCAAY AKKAGG FRFKDNHN PRDFLARTQGT AARAHAAQQNG FEA 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml56.pep FAPFAAAVLTAHATGNAAQSTINTLACLFILFRLAFIWCYIADKAAMRSLMWAGGFACTV 
I | | | | I I I II I I I I I I I : I : I : I I I I I I I I I I I I I I I i I i I I I I I : I I I I I : I I I : I I I 
al56 FAPFAAAVLT AHATGNAGQATVNTLAGLFI LFRLAFIWCY I ADKAALRSLMWVGGFVCTV 

70 80 90 100 110 120 



ml 5 6. pep GLFVAAAX 
1111:11! 
al56 GLFWAAX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 649>: 

gl57 .seq 

1 atgaggaacg aggAAAAACg cgccctgcgc cgcgaattgC gCgGgcggcg 

51 ttcgcAAATg GGgcgagacg tGCGggCGGC GGCGgCgatA Aaaatcaacc 

101 gcctgctcaa aCGTtatatc AAGCGCggtc gGaAaatcgG CGTGTATTgg 

151 cCGATGGGCA AGGAATTGcg TTTGGGCGgc tTtgtcCGCG CGGCGCAAAA 

201 ACGCgGCGCA AAactctatc tgccttATAT CGAACCGCAC ACGCGGCGGA 

251 TGTGGTTTAC GCCGTATCCT GAACGCGGAA TGGAACGGGA ACGCAAGCGC 

301 GGTAGGGCGA AGCTGCATGT CCCTCAGTTT GCAGGGCGCA AAATCCGCGT 

351 GCACGGTTTG TCGGTATTGC TCGTCCCGCT TGTCGGCATA GACCGCGAAG 

401 GCTACCGTTT GGGGCAGGCA GGCGGCTATT ACGATGCGAC GCTTTCGGCG 

451 ATGAAATACC GTTTGCAGGC GAAAACCGTG GGCGTGGGCT TTGCCTGCCA 

501 GTTGGTGGAC AGGCTCCCAC GCGAGGCGCA CGACCTGCCG CTGGACGGTT 

551 TTGTATCGGA AGCGGGGATA TTGTGTTTTT AG 
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This corresponds to the amino acid sequence <SEQ ID 650; ORF 157.ng>: 

gl57 .pep 

1 MRNEEKRALR RELRGRRSQM GRDVRAAAAI KINRLLKRYI KRGRKIGVYW 

51 PMGKELRLGG FVRAAQKRGA KLYLPYIEPH TRRMWFTPYP ERGMERERKR 

101 GRAKLHVPQF AGRKIRVHGL SVLLVPLVGI DREGYRLGQA GGYYDATLSA 

151 MKYRLQAKTV GVGFACQLVD RLPREAHDLP LDGFVSEAGI LCF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 65 1>: 

ml57.seq 

1 ATGAGGAACG AGGAAAAACG CGCCCTGCGC CGCGAATTGC GCGGGCGGCG 

51 TTCGCAAATG GGGCGGGACG TGCGGGCGGC GGCAACGGTA AAAATCAACC 

101 ACCTGCTCAA ACGTTATATT AAAAAAGGGC GGAAAATCGG CGTGTATTGG 

151 CCGATGGGCA AGGAATTGCG TTTGGACGGC TTTGTCCGCG CGGCGCAAAA 

201 ACGCGGTGCG GAACTCTACC TGCCTTATAT CGAACCGCGT TCGCGGCGGA 

251 TGTGGTTTAC GCCGTATCCT GCCGATGGAG TAAAACAAGA ACGCAAGCGC 

301 GGTAGGGCGA AGCTGCATGT CCCTCAGTTT GCAGGTCGGA AAAAGCGTGT 

351 GCATGATTTG AACCTCCTGC TTGTGCCAGT GGTCGGTATG GACAGGCTGG 

4 01 GCTACCGCTT GGGACAGGCA GGCGGCTATT ACGATGCGAC GCTTTCAGCG 

451 ATGAAATACC GTTTGCAGGC AAAAACCGTG GGCGTGGGCT TTGCCTGCCA 

501 GTTGGTGGAC AGGCTGCCGG TCGAGGCGCA CGACCGGTCT TTGGACGGTT 

551 TTGTGTCGGA GGCGGGGATA TTGTGTTTTT AG 



This corresponds to the amino acid sequence <SEQ ID 652; ORF 157>: 

ml57 .pep 

1 MRNEEKRALR RELRGRRSQM GRDVRAAATV KINHLLKRYI KKGRKIGVYW 

51 PMGKELRLDG FVRAAQKRGA ELYLPYIEPR SRRMWFTPYP ADGVKQERKR 

101 GRAKLHVPQF AGRKKRVHDL NLLLVPWGM DRLGYRLGQA GGYYDATLSA 

151 MKYRLQAKTV GVGFACQLVD RLPVEAHDRS LDGFVSEAGI LCF* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml57 / gl57 88.1% identity in 193 aa overlap 

10 20 30 40 50 60 

ml 5 7 . pep MRNEEKRALRRELRGRRSQMGRDVRAAATVKINHLLKRYIKKGRKIGVYWPMGKELRLDG 

■ | 1 | I I I I I I I I I I 1 I I I l-l I I I I I I I I I : : I I I : I I I I M I : I I I I I I I I I I I I I I I I I 
g 157 MRNEEKRALRRELRGRRSQMGRDVRAAAAIKINRLLKRYIKRGRKIGVYWPMGKELRLGG 
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ml 57 . pep FVRAAQKRGAELYLPYIEPRSRRMWFTPYPADGVKQERKRGRAKLHVPQFAGRKKRVHDL 
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gl57 FVRAAQKRGAKLYLPYIEPHTRRMWFTPYPERGMERERKRGRAKLHVPQFAGRKIRVHGL 

70 80 90 100 110 120 
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ml57 . pep NLLLVPWGMDRLGYRLGQAGGYYDATLSAMKYRLQAKTVGVGFACQLVDRLPVEAHDRS 
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g 157 SVLLVPLVGI DREGYRLGQAGGYYDATLSAMKYRLQAKTVGVGFACQLVDRLPREAHDLP 
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ml57 . pep LDGFVSEAGI LCFX 

I I I I I I II I I I I I I 
gl57 LDGFVSEAGI LCFX 

190 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 653>: 

al57.seq 

1 ATGAGGAACG AGGAAAAACA CGCCTTGCGC CGAGAGTTGC GCCGCGCCCG 
51 CGCGCAGATG GGGCATCAAG GGCGGTTGGC GGCGGGGCAA ACGATTAACC 
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101 GCCTGCTCAA ACGTTATATC 

151 CCGATGGGCA AGGAATTGCG 

201 ACGCGGTGCA AAACTTTATC 

251 TGTGGTTTAC GCCGTATCCT 

301 GGCAGGGCGA AGTTGAACGT 

351 GCACGGTTTG TCGGTATTGC 

4 01 GCTACCGCTT AGGACAGGCA 

4 51 ATGAAATACC GTTTGCAGGC 

501 GTTTGTGGAC AGGCTGCCGC 

551 TTGTGTCGGA GGCGGGGATA 



AAGCGTGGTC GGAAAATCGG CGTGTATTGG 
TTTGGACGGC TTTGTCCGCG CGGCGCAAAA 
TGCCTTATAT CGAACCGCGT TCGCGGCGGA 
GAAAGCGGAA TGGAACGGGA GCGCATACGG 
GCCGCAGTTT GCAGGGCGCA AAATCCGCGT 
TCGTCCCGCT TGTCGGCATA GACCGCGAGG 
GGCGGCTATT ACGATGCGAC GCTTGCGGCG 
AAAAACCGTG GGCGTGGGCT TTGCCTGCCA 
GCGAACCGCA CGATCTGCTG CTGGACGGTT 
TTGTGCTTTT AG 



This corresponds to the amino acid sequence <SEQ ID 654; ORF 157.a>: 

al57 .pep 

1 MRNEEKHALR RELRRARAQM GHQGRLAAGQ TINRLLKRYI KRGRKIGVYW 

51 PMGKELRLDG FVRAAQKRGA KLYLPYIEPR SRRMWFTPYP ESGMERERIR 

101 GRAKLNVPQF AGRKIRVHGL SVLLVPLVGI DREGYRLGQA GGYYDATLAA 

151 MKYRLQAKTV GVG FACQFVD RLPREPHDLL LDGFVSEAGI LCF* 

ml57/al57 82.4% identity in 193 aa overlap 



10 20 30 40 50 60 

ml57 . pep MRNEEKRALRRELRGRRSQMGRDVRAAATVKINHLLKRYIKKGRKIGVYWPMGKELRLDG 
111111:1111111 I : I I I : : I I I I I : I I I I I I I : I 1 I I I I I I I I I I I I I ! I i 
al57 MRNEEKHALRRELRRARAQMGHQGRLAAGQTINRLLKRYIKRGRKIGVYWPMGKELRLDG 
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ml57 . pep FVRAAQKRG7VELYLPYIEPRSRRMWFTPYPADGVKQERKRGRAKLHVPQFAGRKKRVHDL 
I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : I : : : I I I I I I I I : I I I I I I I I Ml i 
a 157 FVRAAQKRGAKLYLPYIEPRSRRMWFTPYPESGMERERIRGRAKLNVPQFAGRKIRVHGL . 

70 80 90 100 110 120 
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ml 57 . pep NLLLVPWGMDRLGYRLGQAGGYYDATLSAMKYRLQAKTVGVGFACQLVDRLPVEAHDRS 
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al57 SVLLVPLVGI DREGYRLGQAGGYYDATLAAMKYRLQAKTVGVGFACQFVDRLPREPHDLL 

130 140 150 160 170 180 



190 

ml57 .pep LDGFVSEAGI LCFX 

I I I I I I I I I I I I I I 
al57 LDGFVSEAGI LCFX 

190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 65 5>: 

gl58 . seq 

1 ATGAAAACCA ATTCAGAAGA ACTGACCGTA TTTGTTCAAG TGGTGGAAAG 

51 CGGCAGCTTC AGCCGTGCGG CGgagcAGTT GGAGAtggCA AATTCTGCCG 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA AGTTGGGCGT GAAcCTGCtc 

151 aACCGCACCA CGCGGCAACT CAATCTGACG GAAGAAGGCG CGCAATATTT 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT GGCAGCGGCG GAAACCGAAA 

251 TGCTGGCAGT GCACGAAGTA CCGCAAGGCG TGTTGCGCGT GGATTCCGCG 

301 ATGCcgatgg TGCTGCATCT GCTGGCGCCG CTGGCAGCAA AATTCAACGA 

351 ACGCTATCCG CATATCcgaC TTTCGCTCGT TTCTTCCGAa ggctatatca 

4 01 atctGattGA Acgcaaagtc gAtatTGCCT TACGGGCCGG AGAATTGGAC 

4 51 GATTCCGGGC TGCGTGCACG CCATCTGTTT GACAGCCACT TCCGCGtagt 

501 cgCCAGTCCT GAATATTTAG CAAAACACGG CACGCCACAA TCTGCAGAAG 

551 atcTTGCCAA CCATCAATGT TTAGGCTTCA CAGAACCCGG TTCTCTAAAT 

601 ACATGGGCGG TTTTAGAtgC GCAGGGAAAT CCCTATAAAA TTTCACCGCA 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG CTCGTTGTGC CTTTCAAGtt 

701 gCGGTATTGC TTGCTTATCA GATTTTTTGG TTGACAACGA CATCACTGAA 

7 51 GGAAAGTTAA TTCCcctatt cgCCGAACAA ACCTCCAATA AAACACACCC 
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801 CTTTAATGCT GTTTATTACA GCGATAAAGC CGTCAACCTC CGCTTACGCG 
851 TATTTTTGGA TTTTTTAGTG AAGGAACTGG GAAAAAATAT GAATAGAACG 
901 - AATACCAAAT AA 

This corresponds to the amino acid sequence <SEQ ID 656; ORF 158.ng>: 

g-158 .pep 

1 MKTNSEELTV FVQVVESGSF SRA AEQLEMA NSAVSRIVKR LEEKLGVNLL 

51 NRTTRQLNLT EEGAQYFRRA QRILQEMAAA ETEMLAVHEV PQGVLRVDSA 

101 MPMVLHLLAP LAAKFNERYP HIRLSLVSSE GYINLIERKV DIALRAGELD 

151 DSGLRARHLF DSHFRVVASP EYLAKHGTPQ SAEDLANHQC LGFTEPGSLN 

201 TWAVLDAQGN PYKISPHFTA SSGEILR SLC LSSCGIACLS DFLV DNDITE 

251 GKLIPLFAEQ TSNKTHPFNA VYYSDKAVNL RLRVFLDFLV KELGKNMNRT 

301 NTK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 65 7>: 

ml58.seq 

1 ATGAAAACCA ATTCAGAAGA ACTGACCGTA TTTGTTCAAG TGGTGGAAAG 

51 CGGCAGCTTC AGCCGTGCGG CGGAGCAGTT GGCGATGGCA AATTCTGCCG 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA AGTTGGGTGT GAACCTGCTC 

151 AACCGCACCA CGCGGCAACT CAGTCTGACG GAAGAAGGCG CGCAATATTT 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT GGCAGCGGCG GAAACCGAAA 

251 TGCTGGCAGT GCACGAAATA CCGCAAGGCG TGTTGAGCGT GGATTCCGCG 

301 ATGCCGATGG TGCTGCATCT GCTGGCGCCG CTGGCAGCAA AATTCAACGA 

351 ACGCTATCCG CATATCCGAC TTTCGCTCGT TTCTTCCGAA GGCTATATCA 

401 ATCTGATTGA ACGCAAAGTC GATATTGCCT TACGGGCCGG AGAATTGGAC 

4 51 GATTCCGGGC TGCGTGCACG CCATCTGTTT GACAGCCGCT TCCGCGTAAT 

501 CGCCAGTCCT GAATACCTGG CAAAACACGG CACGCCGCAA TCTACAGAAG 

551 AGCTTGCCGG CCACCAATGT TTAGGCTTCA CCGAACCCGG TTCTCTAAAT 

601 ACATGGGCGG TTTTAGATGC GCAGGGAAAT CCCTATAAGA TTTCACCGCA 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG CTCGTTGTGC CTTTCAGGTT 

701 GCGGTATTGT TTGCTTATCA GATTTTTTGG TTGACAACGA CATCGCTGAA 

751 GGAAAGTTAA TTCCCCTGCT CGCCGAACAA ACCTCCGATA AAACACACCC 

801 CTTTAATGCT GTTTATTACA GCGATAAAGC CGTCAATCTC CGCTTACGCG 

851 TATTTTTGGA TTTTTTAGTG GAGGAACTGG GAAACAATCT CTGTGGATAA 

This corresponds to the amino acid sequence <SEQ ID 658; ORF 158>: 

ml58 .pep 

1 MKTNSEELTV FVQVVESGSF SRAA EQLAMA NSAVSRIVKR LEEKLGVNLL 

51 NRTTRQLSLT EEGAQYFRRA QRILQEMAAA ETEMLAVHEI PQGVLSVDSA 

101 MPMVLHLLAP LAAKFNERYP HIRLSLVSSE GYINLIERKV DIALRAGELD 

151 DSGLRARHLF DSRFRVIASP EYLAKHGTPQ STEELAGHQC LGFTEPGSLN 

201 TWAVLDAQGN PYKISPHFTA SSGEILR SLC LSGCGIVCLS DFLV DNDIAE 

251 GKLIPLLAEQ TSDKTHPFNA VYYSDKAVNL RLRVFLDFLV EELGNNLCG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

94.3% identity in 297 aa overlap 
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MKTNSEELTV FVQWESGSFSRAAEQLAMANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 
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MKTNSEELTVFVQVVESGSFSRAAEQLEMANSAVSRIVKRLEEKLGVNLLNRTTRQLNLT 
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EEGAQYFRRAQRILQEMAAAETEMLAVHEIPQGVLSVDSAMPMVLHLLAPLAAKFNERYP 
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EEGAQYFRRAQRILQEMAAAETEMLAVHEVPQGVLRVDSAMPMVLHLLAPLAAKFNERYP 
70 80 90 100 110 120 



ml58 / gl58 

ml58 .pep 
gl58 

ml58 .pep 
gl58 
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ml 58. pep HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSRFRVIASPEYLAKHGTPQ 
I I I I 1 I I II I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I t I : I I I : I I I I I I I I I I I I I 
gl58 HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSHFRVVASPEYLAKHGTPQ 
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m!58 . pep STEELAGHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSGCGIVCLS 
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gl58 SAEDLANHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSSCGIACLS 

190 200 210 220 230 240 
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ml58 . pep DFLVDNDIAEGKLIPLLAEQTSDKTHPFNAVYYSDKAVNLRLRVFLDFLVEELGNNLCGX 
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gl58 DFLVDNDITEGKLIPLFAEQTSNKTHPFNAVYYSDKAVNLRLRVFLDFLVKELGKNMNRT 

250 260 270 280 290 300 



gl58 



NTKX 



identified in N. meningitidis <SEQ ID 659>: 



The following partial DNA sequence was 

al58.seq 

1 ATGAAAACCA ATTCAGAAGA ACTGACCGTA TTTGTTCAAG TGGTGGAAAG 

51 CGGCAGCTTC AGCCGTGCGG CGGAGCAGTT GGCGATGGCA AATTCTGCCG 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA AGTTGGGTGT GAACCTGCTC 

151 AACCGCACCA CGCGGCAACT CAGTCTGACG GAAGAAGGCG CGCAATATTT 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT GGCAGCGGCG GAAACCGAAA 

251 TGCTGGCAGT GCACGAAATA CCGCAAGGCG TGTTGCGCGT GGATTCCGCG 

301 ATGCCGATGG TGCTGCATCT GCTGGCGCCG CTGGCAGCAA AATTCAACGA 

351 ACGCTATCCG CATATCCGAC TTTCGCTCGT TTCTTCCGAA GGCTATATCA 

4 01 ATCTGATTGA ACGCAAAGTC GATATTGCCT TACGGGCCGG AGAATTGGAC 

451 GATTCCGGGC TGCGTGCACG CCATCTGTTT GACAGCCGCT TCCGCGTAAT 

501 CGCCAGTCCT GAATACCTGG CAAAACACGG CACGCCGCAA TCTACAGAAG 

551 AGCTTGCCGG CCACCAATGT TTAGGCTTCA CCGAACCCGG TTCTCTAAAT 

601 ACATGGGCGG TTTTAGATGC GCAGGGAAAT CCCTATAAGA TTTCACCGCA 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG CTCGTTGTGC CTTTCAGGTT 

701 GCGGTATTGC TTGCTTATCA GATTTTTTGG TTGACAACGA CATCGCTGAA 

751 GGAAAGTTAA TTCCCCTGCT CGCCGAACAA ACCTCCAATA AAACGCACCC 

801 CTTTAATGCT GTTTATTACA GCGATAAAGC CGTCAACCTC CGCTTACGCG 

851 TATTTTTGGA TTTTTTAGTG GAGGAACTGG GAAACAATCT CTGTGGATAA 

This corresponds to the amino acid sequence <SEQ ID 660; ORF 158.a>: 



al58 .pep 

1 
51 
101 
151 
201 
251 



MKTNSEELTV FVQWESGSF SRAAEQLAMA 



NRTTRQLSLT EEGAQYFRRA 

MPMVLHLLAP LAAKFNERYP 

DSGLRARHLF DSRFRVIASP 

TWAVLDAQGN PYKISPHFTA 

GKLIPLLAEQ TSNKTHPFNA VYYSDKAVNL RLRVFLDFLV EELGNNLCG* 



QRILQEMAAA 
HIRLSLVSSE 
EYLAKHGTPQ 
SSGEILRSLC 



NSAVSRIVKR 
ETEMLAVHEI 
GYINLIERKV 
STEELAGHQC 
LSGCGIACLS 



LEEKLGVNLL 
PQGVLRVDSA 
DIALRAGELD 
LGFTEPGSLN 
DFLVDNDIAE 



ml58/al58 99.0% identity in 299 aa overlap 

10 20 30 40 50 60 

ml 58 . pep MKTNSEELTVFVQWESGSFSRAAEQU^MANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 
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al58 MKTNSEELTVFVQWESGSFSRAAEQLAMANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 
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ml58 . pep E EGAQ Y FRRAQR I LQEMAAAETEMLAVHE I PQGVLS VDS AM PMVLHLLAPLAAKFNE R Y P 
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al58 EEGAQY FRRAQR I LQEMAAAETEMLAVHE I PQGVLRVDS AM PMVLHLLAPLAAKFNERYP 
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130 140 150 160 170 180 

ml58 . pep HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSRFRVIASPEYLAKHGTPQ 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al58 HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSRFRVIASPEYLAKHGTPQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml58 . pep STEELAGHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSGCGIVCLS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I : I I I 
a 158 STEELAGHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSGCGIACLS 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml58 . pep DFLVDNDIAEGKLIPLLAEQTSDKTHPFNAVYYSDKAVNLRLRVFLDFLVEELGNNLCGX 
I | I | I I I I I I I I I I I I I I II I I: I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
al58 DFLVDNDIAEGKLIPLLAEQTSNKTHPFNAVYYSDKAVNLRLRVFLDFLVEELGNNLCGX 

250 260 270 280 290 300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 66 1>: 

gl60. seq 

1 ATGGAcattc tgGACAAact ggtcgatCTC GCccaATTGA CGGGCAGTGC 

51 GGATGTGCAG TgcctTTTGG GCGGACAATG gcATGaaacc TTGCAACGCG 

101 AAGGGCTGGT ACACATTGTT ACGGCGGGCA GCGGTTATCT CTGCATCGAC 

151 GGCGAAACTT CCCCGCGTCC GGTCGGCACG GGCGATATTG TATTTTTCCC 

201 GCGCGGCTTG GGTCATGTGT TGAGCCACGA CGGAAAATAC GGAGAAAGTT 

251 TACAACCGGA CATACGACAA AACGGCACAT TTATGGTCAA ACAGTGCGGC 

301 AACGGGCTGG ATATGAGCCT GTTTTGCGCC CGTTTCCGCT ACGACACCCA 

351 CGCCGATTTG ATGAACGGGC TGCCGGAAAC CGTTTTTCTG AACATTGCCC 

4 01 ATCCAAGTTT GCAGTATGTG GTTTCAATGC TGCAACTGGA AAGCGAAAAA 

4 51 CCTTTGACGG GGACGGTTTC CGTGGTCAAC GCATTACCGT CCGTGCTGCT 

501 GGTGCTTATC CTGCGCGCCT ATCTCGAACA GGATAAGGAT GTCGAACTCT 

551 CGGGCGTATT GAAAGGTTGG CAGGACAAAC GTTTGGGACA TTTGATCCAA 

601 AAGGTGATAG ACAAACCGGA AGACGAATGG AATATTGACA AAATGGTTGC 

651 CGCCGCCAAT ATGTCGCGCG CGCAACTGAT GCGCCGCTTC AAAAGCCAAG 

701 TCGGACTCAG CCCGCACGCC TTTGTGAACC ATATCCGCCT GCAAAAAGGC 

751 GCATTGCTGC TGAAGAAAAC CCCGGATTCG GTTTTGGAGG TCGCGCTGTC 

801 GGTGGGCTTT CAGTCGGAAA CGCATTTCGG CAAGGCGTTC AAACGGCAAT 

851 ATCACGTTTC GCCGGGGCAA TACCGGAAAG AAGGCGGGCA AAAATAA 



This corresponds to the amino acid sequence <SEQ ID 662; ORF 160. ng> : 

gl60 .pep 

1 MDILDKLVDL AQLTGSADVQ CLLGGQWHET LQREGLVHIV TAGSGYLCID 

51 GETSPRPVGT GDIVFFPRGL GHVLSHDGKY GESLQPDIRQ NGTFMVKQCG 

101 NGLDMSLFCA RFRYDTHADL MNGLPETVFL NIAHPSLQYV VSMLQLESEK 

151 PLTG TVSWN ALPSVLLVLI L RAYLEQDKD VELSGVLKGW QDKRLGHLIQ 

201 KVIDKPEDEW NIDKMVAAAN MSRAQLMRRF KSQVGLSPHA FVNHIRLQKG 

251 ALLLKKTPDS VLEVALSVGF QSETHFGKAF KRQYHVSPGQ YRKEGGQK* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 663>: 

ml60. seq 

1 ATGGACATTC TGGACAAACT GGTCGATTTC GCCCAATTGA CGGGCAGTGT 

51 GGATGTGCAG TGCCTTTTGG GCGGACAATG GTCGGTACGG CATGAAACCT 

101 TGCAACGCGA AGGATTGGTA CACATTGTTA CATCGGGCAG CGGCTATCTC 

151 TGCATCGACG GCGAAACTTC CCCGCGTCCG GTCAGTACAG GGGATATTGT 

201 ATTTTTCCCG CGCGGCTTGG GTCATGTGTT GAGCCACGAC GGAAAATGCG 

251 GAGAAAGTTT ACAACCGGAT ATGCGGCAGC ACGGTGCGTT TACGGTCAAG 

301 CAGTGCGGCA ACGGACAGGA TATGAGCCTG TTTTGCGCCC GTTTCCGCTA 

351 CGACACCCAC GCCGATTTGA TGAACGGGCT GCCTGAAACC GTTTTTCTGA 

401 ACATTGCCCA TCCGAGTTTA CAGTATGTGG TTTCAATGCT GCAACTGGAA 

4 51 AGCAAAAAAC CTTTGACGGG GACGGTTTCC ATGGTCAACG CATTGTCGTC 
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501 CGTCCTGCTG GTGCTTATCC TGCGCGCCTA TCTCGAACAG GATAAGGATG 

551 TCGAACTCTC GGGCGTATTG AAAGGTTGGC AGGACAAACG TTTGGGACAT 

601 TTAATCCAAA AGGTGATAGA CAAACCGGAA GACGAATGGA ATGTCGACAA 

651 AATGGTGGCG GCTGCCAATA TGTCGCGCGC GCAACTGATG CGCCGTTTCA 

701 AAAGCCGGGT CGGACTCAGC CCGCACGCCT TTGTGAACCA TATCCGCCTG 

751 CAAAAAGGCG CGTTGCTGCT GAAAAAAAAC CCGGATTCGG TTTTGTCGGT 

801 CGCACTGTCG GTAGGCTTTC AGTCGGAAAC GCACTTCGGC AAGGCGTTCA 

851 AACGGCAATA TCACGTTTCG CCGGGTCAAT ACCGGAAAGA AggCGGGCAA 

901 AAATAA ^ 



This corresponds to the amino acid sequence <SEQ ID 664; ORF 160>: 



ml 60 



pep 
1 
51 
101 
151 
201 
251 
301 



MDILDKLVDF AQLTGSVDVQ CLLGGQWSVR 
CIDGETSPRP VSTGDIVFFP RGLGHVLSHD 
QCGNGQDMSL FCARFRYDTH ADLMNGLPET 
SKKPLTG TVS MVNALSSVLL VLILRAYLEQ 
LIQKVIDKPE DEWNVDKMVA AANMSRAQLM 
QKGALLLKKN PDSVLSVALS VGFQSETHFG 
. _ _ K + 

Computer analysis of this amino acid sequence gave the following 
Homology with a predicted ORF from N. gonorrhoeae 

ml60 / gl60 93.4% identity in 301 aa overlap 



HETLQREGLV 
GKCGESLQPD 
VFLNIAHPSL 
DKDVELSGVL 
RRFKSRVGLS 
KAFKRQYHVS 



HIVTSGSGYL 
MRQHGAFTVK 
QYWSMLQLE 
KGWQDKRLGH 
PHAFVNHIRL 
PGQYRKEGGQ 

results: 



ml 60 . pep 
g!60 

ml 60 .pep 
gl60 

ml 60 . pep 
gl60 

ml 60. pep 
gl60 

ml 60 .pep 
gl60 



10 20 30 40 50 60 

MDILDKLVDFAQLTGSVDVQCLLGGQWSVRHETLQREGLVHIVTSGSGYLCIDGETSPRP 
lllllllll-lllllhllllllMII I Ml I M I II INI: II II I I I I I II M I I 
MDILDKLVDLAQLTGSADVQCLLGGQW—HETLQREGLVHIVTAGSGYLCIDGETSPRP 
10 20 30 40 50 

70 80 90 100 110 120 

VSTGDIVFFPRGLGHVLSHDGKCGESLQPDMRQHGAFTVKQCGNGQDMSLFCARFRYDTH 
I = I I I I M I I I I II I ! I I I I I I I I I I I I i : f | : | : | I f II I I I II M I I I M I M I I 
VGTGDIVFFPRGLGHVLSHDGKYGESLQPDIRQNGTFMVKQCGNGLDMSL FCARFRYDTH 
60 70 80 90 100 110 

130 140 150 160 170 180 

ADLMNGLPETVFLNIAHPSLQYWSMLQLESKKPLTGTVSMVNALSSVLLVLILRAYLEQ 
"MM II MMMM MMMMIMM MMMiMMiMMI MMMMMMM 
ADLMNGLPETVFLNIAHPSLQYWSMLQLESEKPLTGTVSVVNALPSVLLVLILRAYLEQ 
130 140 150 160 170 

190 



120 



200 210 220 230 240 

DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNVDKMVAAANMSRAQLMRRFKSRVGLS 

INMIMMMMMMMIIMMIMIMIhMIIMMMMI IMIMI 

DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNIDKMVAAANMSRAQLMRRFKSQVGLS 
180 190 .200 210 220 230 

250 260 270 280 290 300 

PHAFVNHIRLQKGALLLKKNPDSVLSVALSVGFQSETHFGKAFKRQYHVS PGQYRKEGGQ 
MMM II IMIMMMMMMM M M M M M M II M M M M M M M M M M 
PHAFVNHIRLQKGALLLKKTPDSVLEVALSVGFQSETHFGKAFKRQYHVSPGQYRKEGGQ 
24 0 250 260 270 280 290 



ml 60. pep 
gl60 



KX 
I I 
KX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 665>: 

al60.seq 

1 ATGGACATTC TGGACAAACT GGTCGATTTC GCCCAATTGA CGGGCAGTGT 
51 GGATGTGCAG TGCCTTTTGG GCGGACAATG GTCGGTACGG CATGAAACCT 
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101 TGCAACGCGA AGGATTGGTA CACATTGTTA CATCGGGCAG CGGCTATCTC 

151 TGCATCGACG GCGAAACTTC CCCGCGTCCG GTCAGTACAG GGGATATTGT 

201 ATTTTTCCCG CGCGGCTTGG GTCATGTGTT GAGCCACGAC GGAAAATGCG 

251 GAGAAAGTTT ACAACCGGAT ATGCGGCAGC ACGGTGCGTT TACGGTCAAG 

301 CAGTGCGGCA ACGGACAGGA TATGAGCCTG TTTTGCGCCC GTTTCCGCTA 

351 CGACACCCAC GCCGATTTGA TGAACGGGCT GCCTGAAACC GTTTTTCTGA 

401 ACATTGCCCA TCCGAGTTTA CAGTATGTGG TTTCAATGCT GCAACTGGAA 

4 51 AGCAAAAAAC CTTTGACGGG GACGGTTTCC ATGGTCAACG CATTGTCGTC 

501 CGTCCTGCTG GTGCTTATCC TGCGCGCCTA TCTCGAACAG GATAAGGATG 

551 TCGAACTCTC GGGCGTATTG AAAGGTTGGC AGGACAAACG TTTGGGACAT 

601 TTAATCCAAA AGGTGATAGA CAAACCGGAA GACGAATGGA ATGTCGACAA 

651 AATGGTGGCG GCTGCCAATA TGTCGCGCGC GCAACTGATG CGCCGTTTCA 

701 AAAGCCGGGT CGGACTCAGC CCGCACGCCT TTGTGAACCA TATCCGCCTG 

751 CAAAAAGGCG CGTTGCTGCT GAAAAAAAAC CCGGATTCGG TTTTGTCGGT 

801 CGCACTGTCG GTAGGCTTTC AGTCGGAAAC GCACTTCGGC AAGGCGTTCA 

851 AACGGCAATA TCACGTTTCG CCGGGTCAAT ACCGGAAAGA AGGCGGGCAA 

901 AAATAA 

This corresponds to the amino acid sequence <SEQ ID 666; ORF 160.a>: 

al60.pep 

1 MDILDKLVDF AQLTGSVDVQ CLLGGQWSVR HETLQREGLV HIVTSGSGYL 

51 CIDGETSPRP VSTGDIVFFP RGLGHVLSHD GKCGESLQPD MRQHGAFTVK 

101 QCGNGQDMSL FCARFRYDTH ADLMNGLPET VFLNIAHPSL QYWSMLQLE 

151 SKKPLTG TVS MVNALSSVLL VLIL RAYLEQ DKDVELSGVL KGWQDKRLGH 

201 LIQKVIDKPE DEWNVDKMVA AANMSRAQLM RRFKSRVGLS PHAFVNHIRL 

251 QKGALLLKKN PDSVLSVALS VGFQSETHFG KAFKRQYHVS PGQYRKEGGQ 

301 K* 

ml60/al60 100.0% identity in 301 aa overlap 

10 20 . 30 40 50 60 

ml 60 . pep MDILDKLVDFAQLTGSVDVQCLLGGQWSVRHETLQREGLVHIVTSGSGYLCIDGETSPRP 

M I M I I 1 1 1 1 1 1 I I I I 1 1 I 1 1 1 1 1 1 I I I I I I I 1 1 I 1 1 i I I ! I I I I I I I 1 1 I 1 1 | | | 1 1 | 

al60 MDILDKLVDFAQLTGSVDVQCLLGGQWSVRHETLQREGLVHIVTSGSGYLCIDGETSPRP 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 60 . pep VSTGDIVFFPRGLGHVLSHDGKCGESLQPDMRQHGAFTVKQCGNGQDMSLFCARFRYDTH 
I I 1 I I I I M I I I I I I I II II I I II I I I I I I I I I I I | | | | | i | | | | | | | | | | | | | | | | | | | 
al60 VSTGDIVFFPRGLGHVLSHDGKCGESLQPDMRQHGAFTVKQCGNGQDMSLFCARFRYDTH 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 60. pep ADLMNGLPETVFLNIAHPSLQYVVSMLQLESKKPLTGTVSMVNALSSVLLVLILRAYLEQ 
I M I I Ml M I il I I I II I M! I I! I I Ml I ! I! II I I M M I I I I I I I Ml I I I I M II 
al60 ADLMNGLPETVFLNIAHPSLQYVVSMLQLESKKPLTGTVSMVNALSSVLLVLILRAYLEQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 60 . pep DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNVDKMVAAANMSRAQLMRRFKSRVGLS 
I I I I M I I I I I I I I I M I I I I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I | | | | M I I 
al60 DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNVDKMVAAANMSRAQLMRRFKSRVGLS 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 60 . pep PHAFVNHIRLQKGALLLKKN PDSVLSVALS VGFQSETHFGKAFKRQYHVS PGQYRKEGGQ 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I | | | | | | 
al60 PHAFVNHIRLQKGALLLKKNPDSVLSVALSVGFQSETHFGKAFKRQYHVSPGQYRKEGGQ 

250 260 270 280 290 300 



ml60 .pep 
al60 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 667>: 

gl61 . seq 

1 ATGGATACCG CAAAAAAAGA CATTTTAGGA TCGGGCTGGA TGCTGGTGGC 

51 GGCGGCCTGC TTCACCGTTA TGAACGTATT GATTAAAGAG GCATCGGCAA 

101 AATTTGCCCT CGGCAGCGGC GAATTGGTCT TTTGGCGCAT GCTGTTTTCA 

151 ACCGTTACGC TCGGTGCTGC CGCCGTATTG CGGCGCGACA CCTTCCGCAC 

201 GCCCCATTGG AAAAACCACT TAAACCGCAG TATGGTCGGG ACGGGGGCGA 

251 TGCTGCTGCT GTTTTACGCG GTAACGCATC TGCCTTTGAC AACCGGCGTT 

301 ACCCTGAGTT ACACCTCGTC GATTTTTttg GCGGTATTTT CCTTCCTGAT 

351 TTTGAAAGAA CGGATTTCCG TTTACACGCA GGCGGTGCTG CTCCTTGGTT 

4 01 TTGCCGGCGT GGTATTGCTG CTTAATCCCT CGTTCCGCAG CGGTCAGGAA 

4 51 CCGGCGGCAC TCGCCGGGCT GGCGGGCGGC GCGATGTCCG GCTGGGCGTA 

501 TTTGAAAGTG CGCGAACTGT CTTTGGCGGG CGAACCCGGC TGGCGCGTCG 

551 TGTTTTACCT TTCCGCAACC GGCGTGGCGA TGTCGTCggt ttgggcgacg 

601 Ctgaccggct ggCACAcccT GTCCTTTcca tcggcagttt ATCtgtCGGG 

651 CATCGGCGTG tccgcgCtgA TTGCCCAaCT GtcgatgAcg cGCGcctaca 

7 01 aaGTCGGCGA CAAATTCACG GTTGCCTCGC tttcctaTAt gaccgtcGTC 

751 TTTTCCGCCC TGTCTGCCGC ATTTTTTCTg ggcgaagagc ttttctggCA 

801 GGAAATACTC GGTATGTGCA TCATTATcct CAGCGGCATT TTGAGCAGCA 

851 TCCGCCCCAT TGCCTTCAAA CAGCGGCTGC AAGCCCTCTT CCGCCAAAGA 

901 TAA 

This corresponds to the amino acid sequence <SEQ ID 668; ORF 161.ng>: 

gl61 .pep 

1 MDTAKKDILG SGWMLVAAA C FTVMNVLIKE ASAKFALGSG ELVFWRMLFS 

51 TVTLGAAAVL RRDTFRTPHW KNHLNRS MVG TGAMLLLFYA VTHL PLTTGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQ AVL LLGFAGWLL LNPSF RSGQE 

151 PAALAGLAGG AMSGWAYLKV RELSLAGEPG WRVVFYLSAT GVAMSSVWAT 

201 LTGWHTLS FP SAVYLSGIGV SALIA QLSMT RAYKVGDKFT VAS LSYMTVV 

251 FSALSAAFFL GEELFWQ EIL GMCIIILSGI LSSI RPIAFK QRLQALFRQR 

301 * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 669>: 

ml61 . seq 

1 ATGGATACCG CAAAAAAAGA CATTTTAGGA TCGGGCTGGA TGCTGGTGGC 

51 GGCGGCCTGC TTTACCATTA TGAACGTATT GATTAAAGAG GCATCGGCAA 

101 AATTTGCCCT CGGCAGCGGC GAATTGGTCT TTTGGCGCAT GCTGTTTTCA 

151 ACCGTTGCGC TCGGGGCTGC CGCCGTATTG CGTCGGGACA mCTTCCGCAC 

201 GCCCCATTGG AAAAACCACT TAAACCGCAG TATGGTCGGG ACGGGGGCGA 

251 TGCTGCTGCT GTTTTACGCG GTAACGCATC TGCCTTTGGC CACTGGCGTT 

301 ACCCTGAGTT ACACCTCGTC GATTTTTTTG GCGGTATTTT CCTTCCTGAT 

351 TTTGAAAGAA CGGATTTCCG TTTACACGCA GGCGGTGCTG CTCCTTGGTT 

401 TTGCCGGCGT GGTATTGCTG CTTAATCCCT CGTTCCGCAG CGGTCAGGAA 

451 ACGGCGGCAC TCGCCGGGCT GGCGGGCGGC GCGATGTCCG GCTGGGCGTA 

501 TTTGAAAGTG CGCGAACTGT CTTTGGCGGG CGAACCCGGC TGGCGCGTCG 

551 TGTTTTACCT TTCCGTGACA GGTGTGGCGA TGTCGTCGGT TTGGGCGACG 

601 CTGACCGGCT GGCACACCCT GTCCTTTCCA TCGGCAGTTT ATCTGTCGTG 

651 CATCGGCGTG TCCGCGCTGA TTGCCCAACT GTCGATGACG CGCGCCTACA 

701 AAGTCGGCGA CAAATTCACG GTTGCCTCGC TTTCCTATAT GACCGTCGTT 

751 TTTTCCGCTC TGTCTGCCGC ATTTTTTCTG GGCGAAGAGC TTTTCTGGCA 

801 GGAAATACTC GGTATGTGCA TCATCATCCT CAGCGGTATT TTGAGCAGCA 

851 TCCGCCCCAC TGCCTTCAAA CAGCGGCTGC AATCCCTGTT CCGCCAAAGA 

901 TAA 

This corresponds to the amino acid sequence <SEQ ID 670; ORF 16 1>: 

ml 61 -pep 

1 MDTAKKDILG SGWMLVAAA C FTIMNVLIKE ASAKFALGSG EL VFWRMLFS 

51 TVALGAAAV L RRDX FRTPHW KNHLNRSMVG TGA MLLLFYA VTHLPLATGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQAVL LLGFAGWLL LNPSFRSGQE 
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151 TAALAGLAGG AMSGWAYLKV RELSLAGEPG WR VVFYLSVT GVAMSSVWA T 

201 LTGWHTLS FP SAVYLSCIGV SALIA QLSMT RAYKVGDKFT VAS LSYMTVV 

251 FSALSAAFFL GEELFWQ EIL GMCIIILSGI LSSI RPTAFK QRLQSLFRQR 

301 * ~ " — 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml61 / gl61 97.0% identity in 300 aa overlap 



10 20 30 40 50 60 

ml 61 . pep MDTAKKDILGSGWMLVAAACFTIMNVLIKEASAKFALGSGELVFWRMLFSTVALGAAAVL 
I I I I I I I I I I I ! I I I I I I I I I I : I I I I I I I I I I I I I I I I i I I I I I I I I I | | | : | M I I I I 
gl61 MDTAKKDILGSGWMLVAAACFTVMNVLIKEASAKFALGSGELVFWRMLFSTVTLGAAAVL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 61 - pep RRDXFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLATGVTLSYTSSIFLAVFSFLILKE 
I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I | I I | | | | | | 
gl61 RRDTFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLTTGVTLSYTSSIFLAVFSFLILKE 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 61 . pep RISVYTQAVLLLGFAGVVLLLNPSFRSGQETAALAGLAGGAMSGWAYLKVRELSLAGEPG 
I I I I I I I I I I I I I I 1 I I I I I I I I I I I 1 I I I I I I I I I I I I | | I | | I | | | | | | I | | | | I | | 
gl61 RISVYTQAVLLLGFAGVVLLLNPSFRSGQEPAALAGLAGGAMSGWAYLKVRELSLAGEPG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 61 . pep WRWFYLSVTGVAMSSVWATLTGWHTLSFPSAVYLSCIGVSALIAQLSMTRAYKVGDKFT 

I I 1 1 1 I I h I I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 1 1 1 1 1 1 1 | 1 1 1 1 1 | 1 1 1 1 1 1 1 1 1 

gl61 WRWFYLSATGVAMSSVWATLTGWHTLSFPSAVYLSGIGVSALIAQLSMTRAYKVGDKFT 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 61 .pep VASLSYMTWFSALSAAFFLGEELFWQEILGMCIIILSGILSSIRPTAFKQRLQSLFRQR 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 1111111:11111 
gl61 VASLSYMTWFSALSAAFFLGEELFWQEILGMCIIILSGILSSIRPIAFKQRLQALFRQR 

250 260 270 280 290 300 

ml 61. pep X 
I 

gl61 X 



The following partial DNA sequence was 

al61.seq 

1 ATGGATACCG CAAAAAAAGA CATTTTAGGA 

51 GGCGGCCTGC TTTACCATTA TGAACGTATT 

101 AATTTGCCCT CGGCAGCGGC GAATTGGTCT 

151 ACCGTTGCGC TCGGGGCTGC CGCCGTATTG 

201 GCCCCATTGG AAAAACCACT TAAACCGCAG 

251 TGCTGCTGCT GTTTTACGCG GTAACGCATC 

301 ACCCTGAGTT ACACCTCGTC GATTTTTTTG 

351 TTTGAAAGAA CGGATTTCCG TTTACACGCA 

401 TTGCCGGCGT GGTATTGCTG CTTAATCCCT 

451 ACGGCGGCAC TCGCCGGGCT GGCGGGCGGC 

501 TTTGAAAGTG CGCGAACTGT CTTTGGCGGG 

551 TGTTTTACCT TTCCGTGACA GGTGTGGCGA 

601 CTGACCGGCT GGCACACCCT GTCCTTTCCA 

651 CATCGGCGTG TCCGCGCTGA TTGCCCAACT 

7 01 AAGTCGGCGA CAAATTCACG GTTGCCTCGC 

7 51 TTTTCCGCTC TGTCTGCCGC ATTTTTTCTG 

801 GGAAATACTC GGTATGTGCA TCATCATCCT 



identified in N. meningitidis <SEQ ID 67 1>: 



TCGGGCTGGA 
GATTAAAGAG 
TTTGGCGCAT 
CGTCGGGACA 
TATGGTCGGG 
TGCCTTTGGC 
GCGGTATTTT 
GGCGGTGCTG 
CGTTCCGCAG 
GCGATGTCCG 
CGAACCCGGC 
TGTCATCGGT 
TCGGCAGTTT 
GTCGATGACG 
TTTCCTATAT 
GCCGAAGAGC 
CAGCGGTATT 



TGCTGGTGGC 
GCATCGGCAA 
GCTGTTTTCA 
CCTTCCGCAC 
ACGGGGGCGA 
CACCGGCGTT 
CCTTCCTGAT 
CTCCTTGGTT 
CGGTCAGGAA 
GCTGGGCGTA 
TGGCGCGTCG 
TTGGGCGACG 
ATCTGTCGTG 
CGCGCCTACA 
GACCGTCGTT 
TTTTCTGGCA 
TTGAGCAGCA 
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851 TCCGCCCCAC TGCCTTCAAA CAGCGGCTGC AATCCCTGTT CCGCCAAAGA 
901 TAA 

This corresponds to the amino acid sequence <SEQ ID 672; ORF 16La>: 

al61.pep 

1 MDTAKKDILG SGWMLVAAA C FTIMNVLIKE ASAKFALGSG EL VFWRMLFS 

51 TVALGAAAV L RRDTFRTPHW KNHLNRSMVG TGA MLLLFYA VTHLPLATGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQ AVL LLGFAGWLL LNPSF RSGQE 

151 TAALAGLAGG AMSGWAYLKV RELSLAGEPG WR WFYLSVT GVAMSSVWA T 

201 LTGWHTLS FP SAVYLSCIGV SALIA QLSMT RAYKVGDKFT VASLSYMTVV 

251 F5ALSAAFFL AEELFW QEIL GMCIIILSGI LSSIRPTAFK QRLQSLFRQR 

301 * 

ml61/al61 99.3% identity in 300 aa overlap 

10 20 30 40 50 60 

ml 61 . pep MDTAKKDILGSGWMLVAAACFTIMNVLIKEASAKFALGSGELVFWRMLFSTVALGAAAVL 

« 1 1 1 i 1 1 1 1 1 1 M i ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

a161 MDTAKKDILGSGWMLVAAACFTIMNVLIKEASAKFALGSGELVFWRMLFSTVALGAAAVL 

10 20 30 40 50 60 

7 0 80 90 100 110 120 

ml 61. pep . RRDXFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLATGVTLSYTSSIFLAVFSFLILKE 

IIIHIIMII I Ml Ml M II ! Ml II I I I M I I j M || M (I 1 I M II I I I II 

al61 RRDT FRT PH WKNHLNRSMVGTGAMLLL FYAVTHL PLATGVT LS YTS S I FLAVFS FL I LKE 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 61 . pep RI SVYTQAVLLLGFAGVVLLLNPS FRSGQETAAIiAGLAGGAMSGWAYLKVRELS LAGE PG 

I I N N I I I I I I I I I I i I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I ! I | I | | i | | | | 
a161 RI SVYTQAVLLLGFAGVVLLLNPS FRSGQETAALAGLAGGAMSGWAYLICVRELS LAGE PG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 61 . pep WRVVFYLSVTGVAMSSVWATLTGWHTLSFPSAVYLSCIGVSALIAQLSMTRAYKVGDKFT 
> I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I 
WRVVFYLSVTGVAMSSVWATLTGWHTLSFPSAVYLSCIGVSALIAQLSMTRAYKVGDKFT 
190 200 210 220 230 240 



al61 



250 260 270 280 290 300 

ml 61. pep VASLSYMTVVFSALSAAFFLGEELFWQEILGMCIIILSGILSSIRPTAFKQRLQSLFRQR 
I ' I I I I I I II I II I I I I I M : I | | | | | | I ! I I I I M i M I I I I I I I I II M I I II 1 I I I I 
ai61 VASLSYMTVVFSALSAAFFLAEELFWQEILGMCIIILSGILSSIRPTAFKQRLQSLFRQR 

250 260 270 280 290 300 

ml61.pep X 
I 

al61 X 



gl63 . seq 








1 


ATGGTTATTT 


TGACGACTTT 


GTTTTTTGTG 


51 


TTTAACCGTG 


CCGGATCAGG 


TGCAGATGTG 


101 


TCATTTTTAC 


CGAGTTCAGC 


TGGTTTTATG 


151 


ctgGGTTTtc 


tgctGATACT 


CTCGGTCAGC 


201 


AGGACGGGAT 


GAAGATGTGC 


CGGAATTCGG 


251 


TGCTGTTTGC 


GGCCGGGATG 


GGCGTGGGCC 


301 


GAGCCGTTGA 


TGCATTATTT 


TTCGGACATT 


351 


CAGGCAGCAG 


CAGGCATTGC 


TGCACACGGT 


401 


CCTGGTCGGT 


GTACGGTACG ATTGCATTGG 



TGGCTGGCGA 
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4 51 CGCTACAAAC TGCCGCTTGC CCTGCGTTCT TGTTTTTACC CCCTGTTGAA 

501 AGAAAAAATT TCCGGAAGGT TCGGCGATGC CATTGATATT ATGGCGTTGC 

551 TTGCTACTTT TTTCGGCATC ATCACCACAT TGGGGTTCGG GGCTTCGCAA 

601 CTGGGCGCCG GATTGCAGGA AATGGGCTGG ATTGCCGAAA ACAGCTTCGG 

651 CGTGCAGGTC TTGATTATCG CCGCCGTAAT GTCCCTCGCC GTCGTTTCGG 

701 CAATATCCGG CGTGGGGAAG GGCGTGAAGG TGTTGAGCGA GTTGAACCTG 

751 GGCCTTGCGT TTTTGCTGCT GTTTTTTGTT TTGGCGGCGG ACCCCACTGT 

801 TTACCTGTTG TCGGCATTCG GCGACAACAT AGGGAACTAC CTCGGAAATC 

851 TGGTGCGCCT CAGTTTGAAA ACTTATGCGT ACGAACGGGA ACACAAGCCG 

901 TGGTTTGAAT CTTGGACGGT GCTTTATTGG GCGTGGTGGT GTTCTTGGgc 

951 gcCGTTTGTG GGTTTGTTTA TCGCGCGCAT TTCAAAGGGg cgcaccatCc 

1001 gcgagtttgt CTTCGGGGTT TTGCTCATCC CCGGCCTGTT CGGCGTTTTG 

1051 TGGTTTACCG TCTTCGGCAA TACGGCGATT TGGCTGAATG ACGGGGTTGC 

1101 GGGGGGAATG CTCGAAAAGA TGACCTCCTC TCCGGAAACG CTGCTTTTTA 

1151 AATTCTTTAA TTACCTCCCC CTGCCCGAAC TGACGAGCAT CGTCAGCCTG 

1201 CTGGTCATTT CCCTGTTTTT TGTAACTTCT GCCGACTCCG GGATTTATGT 

1251 CCTGAACAAT ATTACCTCTC GGGACAAAGG CTTGAGCGCG CCACGGTGGC 

1301 AGGCGGTTAT GTGGGGCGTG CTGatgtcTG CCGTTGCCGT TTTGCTGATG 

1351 CGCTCGGGCG GACTCGGCAA CCTGCAGTCT ATGACCCTGA TTGTTTCCCT 

1401 GCCGTTTGCC CTGCTGATGC TGATAATGTG TTTCAGCCTG TGGAAAGGCT 

14 51 TGAGTGCGGA TAAGAAATAT TTTGAGACCC GGGTCAACCC TACCAGTGTA 

1501 TTTTGGACGG GCGGCAAGTG GAAAGAACGG CTGGTGCGGA TAATGAGCCA 

1551 GACGCAGGAG CAGGATATTT TAAAATTCCT CAAACATACC GCATCGCCCG 

1601 CTATGCACGA GTTGCAACGG GAGCTTTCGG AAGAATACGG CTTGAGCGTC 

1651 CGGGTCGATA AGATGTTTCA TCAGGACGAG CCCGCAATCG AGTTCGTCAT 

1701 TCGGAAAGAG ACGATGCGCG ATTTTATGTA CGGGATTAAG TCTGTCGGGC 

1751 AGGATGTATC CGACCAGTTG ATTAACGACG GCAAGCTGCC GCATATCCGG 

1801 CACCAGACAA CTTACAAACC CTACGCTTAT TTTTTCGACG GGCGCGTCGG 

1851 GTACGATGTG CAGTATATGA ACAAGGACGA GCTGATTGCC GACATTTTGA 

1901 AAAACTACGA ACGTTATTTG ATGTTGTTGG ATGATGTCGG TCAGGAACTG 

1951 ATGGCGCACG AGCAGGTGGA ATTGGCAGAG TAA 

This corresponds to the amino acid sequence <SEQ ID 674; ORF 163.ng>: 

gl63 .pep 

1 M VILTTLFFV CVLWLVL TV PDQVQMWLDR AKEVIFTEFS WFY VLTFSIF 

51 LGFLLILSVS GLGNIRLGRD EDVPEFGFLS WL AMLFAAGM GVG LMFFGV A 

101 EPLMHYFSDI TVGAPEHRQQ QALLHTVFHW GVHAWSVYGT IALALAYFGF 

151 RYKLPLALRS CFYPLLKEKI SGRFGDAID I MALLATFFGI ITTLGF GASQ 

201 LGAGLQEMGW IAENSFGVQ V LI I AA VMS LA VVSAIS GVGK GVKVLSELNL 

251 GLAFLLLFFV LAAD PTVYLL SAFGDNIGNY LGNLVRLSLK TYAYEREHKP 

301 WFESWTVLYW AWWCSWAPFV GLFIARISKG RTIRE FVFGV LLIPGLFGVL 

351 WFTVFGNTAI WLNDGVAGGM LEKMTSSPET LLFKFFNYLP LPELTSIVSL 

4 01 LVISLFFVTS ADSGIYVLNN ITSRDKGLSA PRW QAVMWGV LMSAVAVLLM 

4 51 RSGGLGNLQS M TLIVSLPFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 

501 FWTGGKWKER LVRIMSQTQE QDILKFLKHT ASPAMHELQR ELSEEYGLSV 

551 RVDKMFHQDE PAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 

601 HQTTYKPYAY FFDGRVGYDV QYMNKDELIA DILKNYERYL MLLDDVGQEL 

651 MAHEQVELAE * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 675>: 

ml63.seq 

1 ATGGTTATTT TGACGACTTT GTTTTTTGTG TGTGTTTTGG TGGTATTGGT 

51 TTTAACCGTG CCGGATCAGG TGCAGATGTG GCTCGATCGG GCAAAAGAAG 

101 TCATTTTTAC CGAGTTCAGC TGGTTTTATG TTTTAACGTT TTCCATTTTT 

151 CTGGGTTTCC TGCTGATACT CTCGGTCAGC AGTTTGGGAA ACATCAGGCT 

201 CGGACGGGAT GAAGATGTGC CGGAATTCGG CTTCCTGTCG TGGCTGGCGA 

251 TGCTGTTTGC GGCCGGGATG GGCGTGGGTC TGATGTTTTT CGGCGTGGCA 

301 GAGCCGTTGA TGCATTATTT TTCGGACATT ACGGCCGGCA CGCCGGAACA 

351 CAGGCAGCAG CAGGCATTGC TGCACACGGT GTTCCATTGG GGCGTTCACG 

401 CTTGGTCGGT GTACGGTACG ATTGCATTGG CTTTGGCTTA TTTCGGTTTC 

451 CGCTACAAGC TGCCGCTTGC CCTGCGTTCT TGTTTTTACC CCCTGTTGAA 

501 AGAAAAAATT TCCGGAAGGT TCGGCGATGC CATTGATATT ATGGCGTTGC 

551 TTGCTACTTT TTTCGGCATC ATCACCACAT TGGGGTTCGG GGCTTCGCAA 

601 CTGGGCGCCG GATTGCAGGA AATGGGCTGG ATTGCCGAAA ACAGCTTCAG 
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651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 



CGTGCAGGTT 

CAATATCCGG 

GGCCTTGCGT 

TTACCTGTTG 

TGGTGCGCCT 

TGGTTTGAAT 

GCCGTTTGTG 

GCGAGTTTGT 

TGGTTTACCG 

GGGGGGAATG 

AATTCTTTAA 

CTGGTCATTT 

CCTGAACAAT 

AGGCGGTTAT 

CGCTCGGGCG 

GCCGTTTGCC 

TGAGTGCGGA 

TTTTGGACGG 

GACGCAGGAG 

CTATGCACGA 

CGGGTCGATA 

TCGGAAAGAG 

AGGATGTATC 

CATCAGACAA 

GTACGATGTG 

AAAACTACGA 

ATGGCGCACG 



TTGATTATCG 

CGTGGGGAAG 

TTTTGCTGCT 

TCGGCATTCG 

CAGTTTTAAA 

CTTGGACGGT 

GGTTTGTTTA 

CTTCGGGGTT 

TCTTCGGCAA 

CTCGAAAAGA 

TTACCTCCCC 

CTCTGTTTTT 

ATTACCTCTC 

GTGGGGCGTG 

GACTCGGCAA 

CTGCTGATGC 

TAAGAAATAT 

GCGGCAAGTG 

CAGGATATTT 

GTTGCAACGG 

AAATGTTTCA 

ACGATGCGCG 

CGACCAGTTG 

CTTACAAACC 

CAGTATATGA 

ACGTTATTTG 

AGCAGGTGGA 



CCGCCGTCAT 

GGCGTGAAGG 

GTTTTTTGTT 

GCGACAACAT 

ACTTATGCGT 

GCTTTATTGG 

TCGCGCGCAT 

TTGCTCATCC 

TACGGCGATT 

TGACCTCCTC 

CTGCCCGAAT 

TGTAACTTCT 

GGGACAAAGG 

CTGATGTCTG 

CCTGCAGTCT 

TGATAATGTG 

TTTGAGACCC 

GAAAGAACGG 

TAAAATTCCT 

GAGCTTTCGG 

TCGGGACGAG 

ATTTTATGTA 

ATTAACGACG 

CTACGCTTAT 

ACAAGGACGA 

ATGTTGTTGG 

ATTGGCAGAG 



GTCCCTCGCC 

TGTTGAGCGA 

TTGGCGGCGG 

AGGGAACTAC 

ACGAACGGGA 

GCGTGGTGGT 

TTCAAAGGGG 

CCGGCCTGTT 

TGGCTGAATG 

TCCGGAAACG 

TGACGAGCAT 

GCCGATTCCG 

CTTGAGCGCG 

CCGTTGCCGT 

ATGACCCTGA 

TTTCAGCCTG 

GGGTTAACCC 

CTGGTGCAGA 

CAAACAGACT 

AAGAATACGG 

CCCGCAATCG 

CGGGATTAAG 

GCAAGCTGCC 

TTTTTCGACG 

GCTGATTGCC 

ATGATGTCGG 

TAA 



GTCGTTTCGG 

GTTGAACCTG 

GACCCACTGT 

CTCGGAAATC 

ACACAAGCCG 

GTTCTTGGGC 

CGCACCATCC 

CGGCGTTTTG 

ACGGGGTTGC 

CTGCTTTTTA 

CGTCAGCCTG 

GGATTTATGT 

CCACGGTGGC 

TTTGCTGATG 

TTGTTTCCCT 

TGGAAAGGCT 

TACCAGTGTA 

TAATGAGCCA 

GCATCGCCCG 

CTTGAGCGTC 

AGTTCGTCAT 

TCTGTCGGGC 

GCATATCCGG 

GGCGCGTCGG 

GACATTTTGA 

TCAGGAACTG 



This corresponds to the amino acid sequence <SEQ ID 676; ORF 163>: 

ml 63 . pep 

M VILTTLFFV CVLWLVL TV PDQVQMWLDR AKE VIFTEFS WFYVLTFSIF 
LGFLLILSVS SLGNIRLGRD EDVPE FGFLS WL AMLFAAGM GVGLMFFGV A 
EPLMHYFSDI TAGTPEHRQQ QALLHTVFHW GVHAWSVYGT IALALAYFGF 
RYKLPLALRS CFYPLLKEKI SGRFGDAID I MALLATFFGI ITTLG F GASO 
LGAGLQEMGW IAENSFSVQ V LIIAAVMSLA VVSAIS GVGK GVKVLSELNL 
GLAFLLLFFV LAAG PTVYLL SAFGDNIGNY LGNLVRLSFK TYAYEREHKP 
WFESWTVLYW AWWCSWAPFV GLFIARISKG RTIRE FVFGV LLIPGLFGVL 
WFTVFGNTAI WLNDGVAGGM LEKMTSSPET LLFKFFNYLP LPE LTSIVSL 
LVISLFFVTS ADSGIYVLNN ITSRDKGLSA PRW QAVMWGV LMSAVAVLLM . 
RSGGLGNLQS M TLIVSLPFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 
FWTGGKWKER LVQIMSQTQE QDILKFLKQT AS PAMHELQR ELSEEYGLSV 
RVDKMFHRDE PAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 
HQTTYKPYAY FFDGRVGYDV QYMNKDELIA DILKNYERYL MLLDDVGQEL 
MAHEQVELAE * 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml63 / gl63 98.6% identity in 660 aa overlap 

10 20 30 40 50 60 

ml 63. pep MVILTTLFFVCVLVVLVLTVPDQVQMWLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 
I I I Ml I I I M II II I I I I II I M I [| I M II ! I Ml I I I I II I I I I M I II M i I I I I! 
9 1 6 3 MVI LTTLFFVCVLWLVLTVPDQVQMWLDRAKEVI FTE FS WFYVLTFS I FLG FLLILSVS 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 63 . pep SLGN I RLGRDEDVPE FGFLS WLAMLFAAGMGVGLMFFGVAEPLMHYFSD I TAGTPEHRQQ 

MINIMUM I MM I! Mill I II I I | M I M I I I M I II I I I: I: | M I I ! 

9163 GLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITVGAPEHRQQ 

70 80 90 100 110 120 

130 140 150 160 170 180 



WO 99/57280 



PCT/US99/09346 



462 



ml 63 . pep QALLHTVFHWGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | || | | | | 
gl63 QALLHTVFHWGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml63 . pep MALLATFFGIITTLGFGASQLGAGLQEMGWIAENSFSVQVLIIAAVMSLAWSAISGVGK 
I I M I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I : I I I I I I I I I I | | | || | | | | | | | | 
gl63 MALLATFFGIITTLGFGASQLGAGLQEMGWIAENSFGVQVLIIAAVMSLAWSAISGVGK 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 63 . pep GVKVLSELNLGLAFLLLFFVLAAGPTVYLLSAFGDNIGNYLGNLVRLSFKTYAYEREHKP 
I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I : I I I I I I I I I I I 
gl63 GVKVLSELNLGLAFLLLFFVLAADPTVYLLSAFGDNIGNYLGNLVRLSLKTYAYEREHKP 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 63 . pep WFESWTVLYWAWWCSWAPFVGLFIARISKGRTIREFVFGVLLIPGLFGVLWFTVFGNTAI 
I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I | I | | | 
gl63 WFESWTVLYWAWWCSWAPFVGLFIARISKGRTIREFVFGVLLIPGLFGVLWFTVFGNTAI 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml63 . pep WLNDGVAGGMLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 
I M M I II I II I I I I I I I I I I I I I I | I | || | | | | | | || | | | | | | | | | n I I I I I I I I I I I 
gl63 WLNDGVAGGMLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 63 . pep ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 
M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl63 ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 63 . pep WKGLSADKKYFETRVNPTSVFWTGGKWKERLVQIMSQTQEQDILKFLKQTASPAMHELQR 
I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I : I I I I I I I I I I I I I I I : I I I I | | I | | | | 
gl63 WKGLSADKKYFETRVNPTSVFWTGGKWKERLVRIMSQTQEQDILKFLKHTASPAMHELQR 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 63. pep ELSEEYGLSVRVDKMFHRDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 
I I I I I f I I I I I I I I I II: I I I I I I I | | I I I | i | | | | | I I I I I I I I I I I I M I I I I I I IN 
g!63 ELSEEYGLSVRVDKMFHQDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml 63 . pep HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQELMAHEQVELAE 
I M I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | J | | j | | | | | | | | | | 
gl63 HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQELMAHEQVELAE 

610 620 630 640 650 660 



ml 63. pep X 
I 

gl63 X 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 677>: 

al63 . seq 

1 ATGGTTATTT TGACGACTTT GTTTTTTGTG TGTGTTTTGG TGGTATTGGT 

51 TTTAACCGTG CCGGATCAGG TGCAGATGTG GCTCGATCGG GCAAAAGAAG 

101 TCATTTTTAC CGAGTTCAGC TGGTTTTATG TTTTAACGTT TTCCATTTTT 

151 CTGGGTTTCC TGCTGATACT CTCGGTCAGC AGTTTGGGAA ACATCAGGCT 



463 



201 CGGACGGGAT GAAGATGTGC CGGAATTCGG CTTCCTGTCG TGGCTGGCGA 

251 TGCTGTTTGC GGCCGGGATG GGCGTGGGTC TGATGTTTTT CGGCGTGGCA 

301 GAGCCGTTGA TGCATTATTT TTCGGACATT ACGGCCGGCA CGCCGGAACA 

351 CAGGCAGCAG CAGGCATTGC TGCACACGGT GTTCCATTGG GGCGTTCACG 

401 CTTGGTCGGT GTACGGTACG ATTGCATTGG CTTTGGCTTA TTTCGGTTTC 

451 CGCTACAAGC TGCCGCTTGC CCTGCGTTCT TGTTTTTACC CCCTGTTGAA 

501 AGAAAAAATT TCCGGAAGGT TCGGCGATGC CATTGATATT ATGGCGTTGC 

551 TTGCTACTTT TTTCGGCATC ATCACCACAT TGGGGTTCGG GGCTTCGCAA 

601 CTGGGCGCCG GATTGCAGGA AATAGGCTGG ATTGCCGAAA ACAGCTTCAG 

651 CGTGCAGGTT TTGATTATCG CCGCCGTCAT GTCCCTCGCC GTCGTTTCGG 

701 CAATATCCGG CGTGGGGAAG GGTGTGAAGG TGTTGAGCGA GTTGAACCTG 

751 GGTCTTGCGT TTTTGCTGCT GTTTTTTGTT TTGGCGGCGG GTCCCACTGT 

801 TTACCTGTTG TCGGCATTCG GCGACAACAT AGGGAACTAC CTCGGAAATC 

851 TGGTGCGCCT CAGTTTTAAA ACTTATGCGT ACGAACGGGA ACACAAGCCG 

901 TGGTTTGAAT CTTGGACGGT GCTTTATTGG GCGTGGTGGT GTTCTTGGGC 

951 GCCGTTTGTG GGTTTGTTTA ' TCGCGCGCAT TTCAAAGGGG CGCACCATCC 

1001 GCGAGTTTGT CTTCGGGGTT TTGCTCATCC CCGGCCTGTT CGGCGTTTTG 

1051 TGGTTTACCG TCTTCGGCAA TACGGCGATT TGGCTGAATG ACGGGGTTGC 

1101 GGGGGGAGTG CTCGAAAAGA TGACCTCCTC TCCGGAAACG CTGCTTTTTA 

1151 AATTCTTTAA TTACCTCCCC CTGCCCGAAT TG AC GAG CAT CGTCAGCCTG 

1201 CTGGTCATTT CTCTGTTTTT TGTAACTTCT GCCGATTCCG GGATTTATGT 

1251 CCTGAACAAT ATTACCTCTC GGGACAAAGG CTTGAGCGCG CCACGGTGGC 

1301 AGGCGGTTAT GTGGGGCGTG CTGATGTCTG CCGTTGCCGT TTTGCTGATG 

1351 CGCTCGGGCG GACTCGGCAA CCTGCAGTCT ATGACCCTGA TTGTTTCCCT 

1401 GCCGTTTGCC CTGCTGATGC TGATAATGTG TTTCAGCCTG TGGAAAGGAT 

14 51 TGAGTGCGGA TAAGAAATAT TTTGAGACCC GGGTTAACCC TACCAGTGTA 

1501 TTTTGGACGG GCGGCAAGTG GAAAGAACGG CTGGTGCAGA TAATGAGCCA 

1551 GACGCAGGAG CAGGATATTT TAAAATTCCT CAAACATACC GCATCGCCCG 

1601 CTATGCACGA GTTACAACGG GAGCTTTCGG AAGAATACGG CTTGAGCGTC 

1651 CGGGTCGATA AGATGTTTCA TCAGGACGAG CCCGCAATCG AGTTCGTCAT 

1701 TCGGAAAGAG ACGATGCGCG ATTTTATGTA CGGGATTAAG TCTGTCGGGC 

1751 AGGATGTATC CGACCAGTTG ATTAACGACG GCAAGCTGCC GCATATCCGG 

1801 CATCAGACAA CTTACAAACC CTACGCTTAT TTTTTCGACG GGCGCGTCGG 

1851 GTACGATGTG CAGTATATGA ACAAGGACGA GCTGATTGCC GACATTTTGA 

1901 AAAACTACGA ACGTTATTTG ATGTTGTTGG ATGATGTCGG TCAGGAACTG 

1951 ATGGCGCACG AGCAGGTGGA ATTGGCAGAG TAA 



This corresponds to the amino acid sequence <SEQ ID 678; ORF 163.a>: 

al63 . pep 

1 M VILTTLFFV CVLWLVL TV PDQVQMWLDR AKEVIFTEFS WFYVLTFSIF 

51 LGFLLILSVS SLGNIRLGRD EDVPE FGFLS WL AMLFAAGM GVGLMFFGV A 

101 EPLMHYFSDI TAGTPEHRQQ QALLHTVFHW GVHAWSVYGT IALALAYFGF 

151 RYKLPLALRS CFYPLLKEKI SGRFGDAID I MALLATFFGI ITTLGF GASQ 

201 LGAGLQEIGW IAENSFSVQ V LIIAAVMSLA WSAIS GVGK GVKVLSELNL 

251 GLAFLLLFFV LAAG PTVYLL SAFGDNIGNY LGNLVRLSFK TYAYEREHKP 

301 WFESWTVLYW AWWCSWAPFV GLFIARISKG RTIRE FVFGV LLIPGLFGVL 

351 WFTVFGNTAI WLNDGVAGGV LEKMTSSPET LLFKFFNYLP LPELTSIVSL 

401 LVISLFFVTS ADSGIYVLNN ITSRDKGLSA PRW QAVMWGV LMSAVAVLLM 

4 51 RSGGLGNLQS MTLIVSLPFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 

501 FWTGGKWKER LVQIMSQTQE QDILKFLKHT ASPAMHELQR ELSEEYGLSV 

551 RVDKMFHQDE PAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 

601 HQTTYKPYAY FFDGRVGYDV QYMNKDELIA DILKNYERYL MLLDDVGQEL 

651 MAHEQVELAE * 



ml63/al63 99.4% identity in 660 aa overlap 



10 20 30 40 50 L 60 

ml 63. pep MVILTTLFFVCVLVVLVLTVPDQVQMWLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 

I I I I I IMIIIIIIM illlllll I ! I II III I MM II I I M II 

al63 MVILTTLFFVCVLWLVLTVPDQVQMWLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 

!0 20 30 40 50 60 

70 80 90 100 110 120 

ml 63 . pep SLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITAGTPEHRQQ 



t*C17USVWU5IJ40 



464 



I I 1 1 1 1 1 1 1 1 i I M 1 1 1 M 1 1 i II i 1 1 1 1 1 1 1 I I i 1 1 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 1 | 1 1 1 | 

al63 SLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITAGTPEHRQQ 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 63 . pep QALLHTVFHWGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 
I I i I I I M I I II I I I I I I I I I I I I | I I I | I M I I I I I I I i I I I I I I I I | I I I 1 I I | | M | 
al63 QALLHTVFHWGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 63 .pep MALLATFFGIITTLGFGASQLGAGLQEMGWIAENSFSVQVLIIAAVMSLAWSAISGVGK 
I I I I I M I I I I I I I I I I I I I I I I I | I | : | | | | | | | | | | | | | || | | | | | | | | | | | | | | | | | 
al63 MALLATFFGI ITTLGFGASQLGAGLQEIGWIAENS FSVQVLI IAAVMSLAWSAI SGVGK 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 63 . pep GVKVLSELNLGLAFLLLFFVLAAGPTVYLLSAFGDNIGNYLGNLVRLSFKTYAYEREHKP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M | 
al63 GVKVLSELNLGLAFLLLFFVLAAGPTVYLLSAFGDNIGNYLGNLVRLSFKTYAYEREHKP 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 63 . pep WFESWTVLYWAWWCSWAPFVGLFIARISKGRTIREFVFGVLLIPGLFGVLWFTVFGNTAI 
I I I I M I I I I I I I I I i I I I I I I I II I II I I I I I I I I I I M I I I I I i I I I I M I I I I I M I 
al63 WFESWTVLYWAWWCSWAPFVGLFIARISKGRTIREFVFGVLLIPGLFGVLWFTVFGNTAI 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 63 . pep WLNDGVAGGMLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 
I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II 
al63 WLNDGVAGGVLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 63 . pep ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 
I I I I I I M II I I I I I M I I I I I I I I I I I I I I I I II I I I I II I I | I | | | | | | M I I I I I I I 
al63 . ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml63 . pep WKGLSADKKYFETRVN PTSVFWTGGKWKERLVQIMSQTQEQDILKFLKQTASPAMHELQR 

I 1 I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I 11 I : I I I 1 I I I I I M 
al63 WKGLSADKKYFETRVNPTSVFWTGGKWKERLVQIMSQTQEQDILKFLKHTAS PAMHELQR 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 63 . pep ELSEEYGLSVRVDKMFHRDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 
I I I I I I I I I I I I I I I I I: I I I I I I I I I I I I I I I I I I I I I I I i M I I I M I 1 I I I I 1 M I I 
al63 ELSEEYGLSVRVDKMFHQDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml 63 . pep HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQELMAHEQVELAE 
I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I M I I I I I I I I I I I I I I I M I I I I I I I I I 
al63 HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQELMAHEQVELAE 

610 620 630 640 650 660 



ml 63 . pep 
al63 



X 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 679>: 

gl64.seq (partial) 

1 . .ATGAACACAT TTTTGAAAAA CAGCGAATAC GCGTATATCC TGAACGACTG 

51 CAAGGCGCGC TTCCTGTTCG CCTCGGCCGG CCTGTCAAAA GAATTGGCGG 

101 GCCTGAAGGC GCAAACGCCC GTCGAAAAAA TCATTTGGAC GGACAAAAGC 

151 CGGCCGGCCG GCGAAACGGC GGAAGGCGAT GCCTTTTTTG AAAACGTGCG 

201 CCGCTTCCCC GAAAAAGCCG ACTTGGGCCG CCAACCCCGG ATAAATGATT 

251 TGGCACACAT CATCTACACC TCCGGCACGA CGGGGCATCC CAAAGGCGCG 

301 CTAATCAGTT ACGCCAACCT GTTCGCCAAC CTGAACGGCA TCGAACGCAT 

351 CTTtaaAATT TCCAAACGCG ACCGCTTTAT CGTTTTCctg ccgatgTTCC 

401 ACAGCTTCAC GCTGACGGCT ATGGTGCTGC TGCCGATTTA TATGGCGTGT 

4 51 TCGATTATTT TGGTCAAAtc cgttttCCCc ttttccaacG TTTTGAAACA 

501 GGCCCTGCTC AAACGCGCAA CCGTGTTTTT GGGCGTACCC GCGATTTACA 

551 CCGCGATGAG CAAGGCAAAA ATCCCTTGGT ATTTCAGATG GTTCAACCGC 

601 ATCCGCCTGT TTATCAGCGG CGGCGCGCCT TTGGCGGAAC AAACCATCCT 

651 CGATTTTAAA GCCAAGTTCC CCCGCGCCAA ATTGCTGGAA GGCTACGGAC 

7 01 TGAGCGAAGC CTCGCCCGTC GTCGCCGTCA ATACGCCCGA ACGGCAAAAA 
751 GCCCGCAGCG TCGGCATCCC CCTGCCCGGT TTGGAAGCCA AAGCCGTCGA 

8 01 TGAAGAATTG GTCGAAGTGC CGCGCGGCGA AGTGGGCGAA CTGATCGTCA 
851 GGGGCGGTTC GGTGATGCGG GGCTACCTCA ATATGCCTGC CGCCACCGAT 
901 GAAACCATCG TCAACGGCTG GTTGAAAACG GGCGATTTCG TTACCATAGA 
951 CGAGGACGGC TTTATCTTTA TCGTCGACCG CAAAAAAGAT TTGATTATTT 

1001 CCAAAGGTCA AAACGTCTAT CCGCGCGAGA TCGAAGAAGA AATCCACAAA 

1051 CTCGATGCCG TCGAAGCCGC CGCCGTCATC GGCGTGAAAG ACCGTTATGC 

1101 CGACGAGGAA ATCGTCGCCT TCGTCCAATT GAAGGAAGGT ATGGATTTGG 

1151 GCGAGGACGA aatccgccgc caccTGCGTA CCGTGCTGGC AAATTTCAAA 

1201 ATCCCCAAAC AGATCCACTT TAAAGACGGG CTGCCGCGCA ACGCTACGGG 

1251 CAAAGTATTG AAACGGGTGC TGAAGGAGCA GTTTGAAGGA AACAAATGA 

This corresponds to the amino acid sequence <SEQ ID 680; ORF 164.ng>: 

gl64.pep (partial) 

1 . . MNTFLKNSEY AYILNDCKAR FLFASAGLSK ELAGLKAQTP VEKIIWTDKS 

51 R PAGET AEG D AFFENVRRFP EKPDLGRQPR INDLAHIIYT SGTTGHPKGA 

101 LISYANLFAN LNGIERIFKI SKRDRFIVFL PMFHSFTL TA MVLLPIYMAC 

151 SIILV KSVFP FSNVLKQALL KRATVFLGVP AIYTAMSKAK IPWYFRWFNR 

201 IRLFISGGAP LAEQTILDFK AKFPRAKLLE GYGLSEASPV VAVNTPERQK 

251 ARSVGIPLPG LEAKAVDEEL VEVPRGEVGE LIVRGGSVMR GYLNMPAATD 

301 ETIVNGWLKT GDFVTIDEDG FIFIVDRKKD LIISKGQNVY PREIEEEIHK 

.351 LDAVEAAAVI GVKDRYADEE IVAFVQLKEG MDLGEDEIRR HLRTVLANFK 

401 IPKQIHFKDG LPRNATGKVL KRVLKEQFEG NK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 68 1>: 

ml64 . seq 

1 ATGAACCGGA CTTATGCCAA TTTCTACGAA ATGCTCGCCG CCGCCTGCCG 

51 CAAAAACGGA AACGGCACGG CAGTGTTCGA CGGCAAGGAA AAAACCGCCT 

101 ACCGCGCGCT CAAGCAGGAG GCCGAAGCCG TCGCGGCGTA TCTGCAAAAT 

151 ATCGGCGTGA AGTTCGGCGA CACGGTCGCG CTGGCGGTTT CCAATTCCAC 

201 AGAATTTATT ACCGCCTATT TCGCCATCTC CGCCATCGGC GCGGTCGCCG 

251 TACCGATGAA CACATTTTTG AAAAACAGCG AATACGCGTA TATCCTGAAC 

301 GACTGCAAGG CGCGCTTCCT GTTCGCCTCG GCCGGCCTGT CAAAAGAATT 

351 GGCGGGCTTG AAGGCGCAAA CGCCCGTCGA AAAAATCATT TGGACGGACA 

401 AAAGCCGTCC GACCGGCGAA ACGGCGGAAG GCGATGCCTT TTTTGAAGAC 

451 GTGCGCCGCT TCCCCGAAAA ACCCGACTTG GGCCGCCAAC CCCGGATAAA 

501 TGATTTGGCA CACATCATCT ACACCTCCGG CACGACGGGG CATCCCAAAG 

551 GCGCGCTAAT CAGTTACGCC AACCTGTTCG CCAACCTGAA CGGCATCGAA 

601 CGCATCTTTA AAATTTCCAA GCGCGACCGC TTTATCGTTT TCCTGCCGAT 

651 GTTCCACAGC TTCACGCTGA CGGCTATGGT GCTGCTGCCG ATTTATATGG 

701 CGTGTTCGAT TATTTTGGTC AAATCCGTTT TTCCGTTTTC CAACGTTTTG 

751 AAACAGACAC TGCTCAAACG CGCGACCGTG TTTTTGGGCG TACCCGCGAT 

801 TTACACCGCG ATGAGCAAGG CGAAAATCCC TTGGTATTTC AGATGGTTCA 

851 ACCGCATTCG CCTGTTTATC AGCGGCGGCG CGCCTTTGGC GGAACAAACC 

901 ATCCTCGATT TCAAAGCCAA GTTCCCCCGC GCCAAATTGC TGGAAGGCTA 
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951 CGGACTGAGC GAAGCCTCTC CCGTCGTCGC CGTCAATACG CCCGAGAGGC 

1001 AAAAAGCCCG CAGCGTCGGC ATCCCCCTGC CCGGTTTGGA AGCCAAAGCC 

1051 GTCGATGAAG AATTGGTCGA AGTGCCGCGC GGCGAAGTGG GCGAACTGAT 

1101 CGTCAGGGGC GGTTCGGTGA TGCGGGGCTA CCTCAATATG CCTGCCGCCA 

1151 CCGATGAAAC CATCGTCAAC GGCTGGTTGA AAACGGGCGA TTTCGTTACC 

1201 ATAGACGAAG ACGGCTTTAT CTTTATCGTC GACCGCAAAA AAGATTTGAT 

1251 TATTTCCAAA GGTCAAAATG TCTATCCGCG CGAGATTGAA GAAGAAATCT 

1301 ACAAACTCGA TGCCGTCGAA GCCGCCGCCG TCATCGGCGT GAAAGACCGT 

1351 TATGCCGACG AGGAAATCGT CGCCTTCGTC CAATTGAAGG AAGGTATGGA 

14 01 TTTGGGCGAG AACGAAATCC GCCGCCACCT GCGTACCGTG CTGGCAAATT 

14 51 TCAAAATCCC CAAACAAATC CACTTTAAAG ACGGGCTGCC GCGCAACGCT 

1501 ACGGGCAAGG TATTGAAACG GGTGTTGAAG GAGCAGTTTG ACGGAAACAA 

1551 ATGA 

This corresponds to the amino acid sequence <SEQ ID 682; ORF 164>: 



ml64 .pep 




1 


MNRTYANFYE 


51 


IGVKFGDTVA 


101 


DCKARFLFAS 


151 


VRRFPEKPDL 


201 


RIFKISKRDR 


251 


KQTLLKRATV 


301 


ILDFKAKFPR 


351 


VDEELVEVPR 


401 


IDEDGFIFIV 


451 


YADEEIVAFV 


501 


TGKVLKRVLK 



FIVFLPMFHS FTL TAMVLLP IYMACSIILV KSVFPFSNVL 
FLGVPAIYTA MSKAKIPWYF RWFNRIRLFI SGGAPLAEQT 
AKLLEGYGLS E AS PWAVNT PERQKARSVG IPLPGLEAKA 
GEVGELIVRG GSVMRGYLNM PAATDETIVN GWLKTGDFVT 



Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N. gonorrhoeae ml64 / gl64 98.6% identity in 432 aa 

overlap 



60 70 80 90 100 110 

ml64.pep GDTVALAVSNSTEFITAYFAISAIGAVAVPMNTFLKNSEYAYILNDCKARFLFASAGLSK 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

gi64 mntflknseyayilndckarflfasagl.sk 

10 4 20 30 

120 130 140 150 160 170 

ml64 .pep ELAGLKAQTPVEKIIWTDKSRPTGETAEGDAFFEDVRRFPEKPDLGRQPRINDLAHIIYT 
I I I I I I M I I I I I t I I I I I I I I : | | | | | | | | | | | : | | | | | | | | j | | | | | | j | | | | | j | | | 
g!64 ELAGLKAQTPVEKIIWTDKSRPAGETAEGDAFFENVRRFPEKPDLGRQPRINDLAHIIYT 

40 50 60 70 80 90 

180 190 200 210 220 230 

ml64 .pep SGTTGHPKGALISY7VNLFANLNGIERIFKISKRDRFIVFLPMFHSFTLTAMVLLPIYMAC 

I I I I I I I I i I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I i I I M I | | | | | | j 

gl64 SGTTGHPKGALISYANLFANLNGIERIFKISKRDRFIVFLPMFHSFTLTAMVLLPIYMAC 

100 110. 120 130 140 150 

240 250 260 270 280 290 

ml64 .pep SIILVKSVFPFSNVLKQTLLKRATVFLGVPAIYTAMSKAKIPWYFRWFNRIRLFISGGAP 
I I I I I I I I I I I I ! I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | I j | | | | | | | | | | 
gl64 SIILVKSVFPFSNVLKQALLKRATVFLGVPAIYTAMSKAKIPWYFRWFNRIRLFISGGAP 

160 170 180 190 200 210 

300 310 320 330 340 350 

ml 64 .pep LAEQTILDFKAKFPRAKLLEGYGLSEAS PWAVNT PERQKARSVG I PLPGLEAKAVDEEL 

I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I | | | | I | | ! | | | | | | | 
gl64 LAEQT I LDFKAKFPRAKLLEGYGLSEASPWAVNTPERQKARSVG I PLPGLEAKAVDEEL 

220 230 240 250 260 270 



360 370 380 390 400 410 
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ml 64 . pep VEVPRGEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKD 
M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | j | | 

g!64 VEVPRGEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKD 

280 290 300 310 320 330 

420 430 440 450 460 470 

ml 64 . pep LIISKGQNVYPREIEEEIYKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGENEIRR 
I I I I I I I I I M I I I I I I I : I I I I I I i I I I I I I I I I I I I I I I I I I I I I | | | | | | | | : | | | | 
g!64 LIISKGQNVYPREIEEEIHKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGEDEIRR 

340 350 360 370 380 390 

480 490 500 510 

ml64 . pep HLRTVLANFKIPKQIHFKDGLPRNATGKVLKRVLKEQFDGNKX 

I I I I I I I I I i I I I I I I ! I I I I I I I I I I I | | I | I : 1 | | | 

gl64 HLRTVLANFKIPKQIHFKDGLPRNATGKVLKRVLKEQFEGNKX 

400 410 420 430 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 683>- 



al64 . seq 
1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 



ATGAACCGGA 

CAAAAACGGC 

ACCGCGCGCT 

ATCGGCGTGA 

GGAATTTATT 

TACCGATGAA 

GACTGCAAGG 

GGCGGGCTTG 

AAAGCCGTCC 

GTGCGCCGCT 

TGATTTGGCA 

GTGCGCTAAT 

CGCATCTTTA 

GTTCCACAGC 

CGTGTTCGAT 

AAACAGGCAC 

TTACACCGCG 

ACCGCATCCG 

ATCCTCGATT 

CGGACTGAGC 

AAAAAGCCCG 

GTCGATGAAG 

CGTCAGGGGC 

CCGATGAAAC 

ATAGACGAAG 

TATTTCCAAA 

ACAAACTCGA 

TATGCCGACG 

TTTGGGCGAG 

TCAAAATCCC 

ACGGGCAAGG 

ATGA 



CTTATGCCAA 

AACGGCACGG 

CAAGCAGGAA 

AGTTCGGCGA 

ACCGCCTATT 

CACATTTTTG 

CGCGCTTCCT 

AAGGCGCAAA 

GGACGGCGAA 

TCCCCGAAAA 

CACATCATCT 

CAGCTACGCC 

AAATCTCCAA 

TTCACGCTGA 

TATTTTGGTC 

TGCTCAAACG 

ATGAGCAAGA 

CCTGTTTATC 

TCAAAGCCAA 

GAAGCCTCGC 

CAGCGTCGGC 

AATTGGTCGA 

GGTTCGGTGA 

CATCGTCAAC 

ACGGCTTTAT 

GGTCAAAATG 

TGCCGTCGAA 

AGGAAATCGT 

AACGAAATCC 

CAAACAAATC 

TATTGAAACG 



TTTCTACGAA 

CAGTGTTCGA 

GCCGAAGCCG 

CACGGTCGCG 

TCGCCGTATC 

AAAAACAGCG 

GTTCGCCTCG 

CGCCCGTCGA 

ATGGCGGAAG 

ACCCGACTTG 

ACACCTCCGG 

AACCTGTTCG 

GCGCGACCGC 

CGGCTATGGT 

AAATCCGTTT 

CGCGACCGTG 

CGAAAATCCC 

AGCGGCGGAG 

GTTCCCCCGC 

CCGTCGTCGC 

ATCCCCCTGC 

AGTGCCGCGC 

TGCGGGGCTA 

GGCTGGTTGA 

CTTTATCGTC 

TCTATCCGCG 

GCCGCCGCCG 

CGCCTTCGTC 

GCCGCCACCT 

CACTTTAAAG 

GGTGTTGAAG 



ATGCTGACCG 

CGGCAAGGAA 

TTGCGGCGTA 

CTGGCGGTTT 

CGCCATCGGC 

AATACGCGTA 

GCCGGCCTGT 

AAAAATCATT 

GCGATGCCTT 

GGCCGCCAAC 

CACGACGGGG 

CCAACCTGAA 

TTTATCGTTT 

GCTGCTGCCG 

TCCCCTTTTC 

TTTTTGGGCG 

TTGGTATTTC 

CACCTTTGGC 

GCCAAATTGC 

CGTCAATACG 

CCGGTTTGGA 

GGCGAAGTGG 

CCTCAATATG 

AAACGGGCGA 

GACCGCAAAA 

CGAAATCGAA 

TCATCGGCGT 

CAATTGAAGG 

GCGTACCGTG 

ACGGGCTGCC 

GAGCAGTTTG 



CCGCCTGCCG 

AAAACCGCCT 

TCTGCAAAAT 

CCAATTCCAC 

GCGGTTGCCG 

TATCCTGAAC 

CAAAAGAATT 

TGGACGGGCC 

TTTTGAAGAC 

CCCGGATAAA 

CATCCCAAAG 

CGGCATCGAA 

TCCTGCCGAT 

ATTTATATGG 

CAACGTTTTG 

TGCCCGCGAT 

AGATGGTTCA 

GGAACAAACC 

TGGAAGGCTA 

CCCGAGAGGC 

AGTCAAAGCC 

GCGAACTGAT 

CCTGCCGCCA 

TTTCGTTACC 

AAGATTTGAT 

GAAGAAATCT 

GAAAGACCGT 

AAGGTATGGA 

CTGGCAAATT 

GCGCAACGCT 

ACGGAAACAA 



This corresponds to the amino acid sequence <SEQ ID 684; ORF 164.a> 

al64 .pep 

MNRTYANFYE MLTAACRKNG NGTAVFDGKE KTAYRALKQE 
IGVKFGDTVA LAVSNSTE FI TAYFAVSAIG AVAVP MNTFL 
DCKARFLFAS AGLSKELAGL KAQTPVEKI I WTGQSRPDGE 
VRRFPEKPDL GRQPRINDLA HIIYTSGTTG HPKGALISYA 
RIFKISKRDR FIVFLPMFHS FTLTAMVLLP IYMACSIILV 



51 
101 
151 
201 
251 
301 
351 
401 
451 



KQALLKRATV FLGVPAIYTA MSKTKIPWYF RWFNRIRLFI 
ILDFKAKFPR AKLLEGYGLS EASPWAVNT PERQKARSVG 
VDEELVEVPR GEVGELIVRG GSVMRGYLNM PAATDETIVN 
IDEDGFIFIV DRKKDLIISK GQNVYPREIE EEIYKLDAVE 
YADEEIVAFV QLKEGMDLGE NEIRRHLRTV LANFKIPKQI 



AEAVAAYLQN 
KNSEYAYILN 
MAEGDAFFED 
NLFANLNGIE 
KSVFPFSNVL 
SGGAPLAEQT 
IPLPGLEVKA 
GWLKTGDFVT 
AAAVIGVKDR 
HFKDGLPRNA 
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501 TGKVLKRVLK EQFDGNK* 

ml64/al64 98.3% identity in 517 aa overlap 

10 20 30 40 50 60 

ml 64 . pep MNRTYANFYEMLAAACRKNGNGTAVFDGKEKTAYRALKQEAEAVAAYLQNIGVKFGDTVA 
I I I I I I I M I I I : I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I [ i | I | M I I I I I I I 
al64 MNRTYANFYEMLTAACRKNGNGTAVFDGKEKTAYRALKQEAEAVAAYLQNIGVKFGDTVA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 64 .pep LAVSNSTEFITAYFAISAIGAVAVPMNTFLKNSEYAYILNDCKARFLFASAGLSKELAGL 
M M I Ml IMMI1:| llllllllllll || MMMiMIM IMM | Ml IMM I M 
al64 LAVSNSTEFITAYFAVSAIGAVAVPMNTFLKNSEYAYILNDCKARFLFASAGLSKELAGL 

70 80. 90 100 110 120 

130 140 150 160 170 180 

ml 64 .pep KAQTPVEKIIWTDKSRPTGETAEGDAFFE DVRRFPEKPDLGRQPRINDLAHIIYTSGTTG 

llllllllllll : I 1 I II I I I I I ! I I I I I | M I II I II I I I I I I I I I I II II M | | | 
al64 KAQTPVEKIIWTGQSRPDGEMAEGDAFFEDVRRFPEKPDLGRQPRINDLAHIIYTSGTTG 

130 140 150 160 170 180 

190 200 210 220 230 240 
ml64 . pep HPKGALISYANLFANLNGIERIFKISKRDRFIVFLPMFHSFTLTAMVLLPIYMACSIILV 
I I I I I M I I II I I M I I I I I I I I I | M | | M | | | | | | | | | | | | | | | | | || M | | | 

al64 HPKGALISYANLFANLNGIERI FKI SKRDRFIVFLPMFHS FTLTAMVLLPI YMACS I ILV 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 64 . pep KSVFPFSNVLKQTLLKRATVFLGVPAIYTAMSKAKIPWYFRWFNRIRLFISGGAPLAEQT 
. I I I I I I I M I I I : I I I I I I I I I I I I I I I I II I I : I I I I I I I I I I I | II I I I I I I I I I I I I 
al64 KSVFPFSNVLKQALLKRATVFLGVPAIYTAMSKTKI PWYFRWFNRIRLFI SGGAPLAEQT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 64 . pep ILDFKAKFPRAKLLEGYGLSEASPWAVNTPERQKARSVGIPLPGLEAKAVDEELVEVPR 

I M I I I I M I I I I M I I I I M I I I I I ! I I I I I I I II I I | I | I | | | | | : | | | | | 

al64 ILDFKAKFPRAKLLEGYGLSEAS PWAVNT PERQKARSVGI PLPGLEVKAVDEELVEVPR 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml64 .pep GEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKDLIISK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al64 GEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKDLIISK 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 64 . pep GQNVYPREIEEEIYKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGENEIRRHLRTV 
I I I I I I I I I I I M I I I I I I I I M II I I I I I I I I I I I I I I I I I I I 1 I M I I | | | | | | | | | | 
al64 GQNVYPREIEEEIYKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGENEIRRHLRTV 

430 440 450 460 470 480 

490 500 510 

ml 64 . pep LAN FK I PKQ I HFKDGL PRN ATGKVLKRVLKEQFDGNKX 

I I I I I I I I I I I I I I I M I I I I I I I I I I I ! I I I I I I I I I 
al64 LANFKIPKQIHFKDGLPRNATGKVLKRVLKEQFDGNKX 

490 500 510 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 685>: 

gl65 . seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 
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51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAagat gTGGCGTTGG AATCGTCAAA cGCGTGGAAC 

151 AACGcCGgca CGGGGCATTC CGcGCTGTGc GAATTGAACT AtgcgccGCT 

201 GGGtgcggac ggcgtcatCA ATCCGGCGCg cgCCCTGAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGcga cgctgGTCGC GGAAGGCAAG 

301 TTGGAagaCA ATTCCTTCAT CAATGCcgtg ccgcatatGT Ctttggtgat 

351 gAacgaagac cactgCCgtt acCTGCAAAA ACGCTATGAT GTGTTTAAAA 

4 01 CGCAGAAACT TTTTGAAAAT AT GG AATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CtccgCTGAT TATGCGCGGC CGGgacgaaA ACCAACCCGT 

501 CGCCGCCAAC TATTCCGCCG Aaggcacgga tgtcgATTTC GGACGGCTGA 

551 CGCGCCAGAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCAGACTG GCAGCTCACC CTCCGCACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CACTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTACCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTAGACGG CAAACGACAC CTTATGTTCG GTCCTTACGC 

951 AGGTTTCCGT TCCAACTTCC TCAAGCAAGG CTCGTTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CcTGCTGGgC gAaTTGCgtt aa 

This corresponds to the amino acid sequence <SEQ ID 686; ORF 165.ng>: 

gl65.pep 

1 MAEATD WLV GGGIMSATLG VLL KELEPSW EITLIERLED VALE SSN AWN 

51 NAGTGHSALC ELNYAPLGAD GVINPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCRYLQKRYD VFKTQKLFEN MEFSTDRNKI . 

151 SDWAPLIMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDWQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GLPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSFM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 687>: 

ml65.seq (partial) 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGCTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

401 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

4 51 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGgTACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTCCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCCCTTACGC 

951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCGCTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGTGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAA. . . 

This corresponds to the amino acid sequence <SEQ ID 688; ORF 165>: 

ml65.pep (partial) 

1 MAEATD WLV GGGIMSATLG VLL KELEPSW EITLIERLED VALE SSN AWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 
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201 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLCAGWA 

351 NMPLTK. . . 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



m!65 / g!65 97.2% identity in 356 aa overlap 



10 20 30 40 50 60 

ml 65 . pep MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLE DVALESSNAWNNAGTGHSALC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl65 MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLE DVALESSNAWNNAGTGHSALC 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml65 . pep ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

I I I I I I I I ! : I : I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I M I 
gl65 ELNYAPLGADGVINPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml65 . pep HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDF 

II I I I I ! I I : I I I I I I I II I I I I I I I I I I I I I I I I : I I I I I I I 1 I II I I I 1 I I I I I I I I 
gl65 HCRYLQKRYDVFKTQKLFENMEFSTDRNKISDWAPLIMRGRDENQPVAANYSAEGTDVDF 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 65. pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I 
gl65 GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDWQLTLRTRFLFLGA 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 65. pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
I II M I I I ! M I I I I I I I I I I : I I I I I I I I II I I I I I I I I I I II I I II I ! I I I I I I 1 I I I 
gl65 GGGALTLLQKSGIPEGKGYGGLPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 

250 260 270 280 290 300 



310 320 330 340 350 

ml 65. pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTK 
I I I I I I I II I I I I I I I I I I I I I I I I M I : I I I I I I I I I I I I I I I I I I I I II I I I I 
gl65 DTRNVDGKRHLMFGPYAGFRSNFLKQGSFMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 

310 320 330 340 350 360 



gl65 ELRX 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 689>: 

al65 . seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGTTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

4 01 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

4 51 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGGCACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 
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601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

7 51 TCCGGCATCC CCGAAGGCAA AGGCTACGGT GGCTTTCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCCCTTACGC 

951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCACTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CCTGCTGGGC GAATTGCGTA AAACCAAAGA 

1101 AGAACGCTTC GCCTCCCTGC TGGAATACTA CCCCGAGGCA AACCCCGACG 

1151 ACTGGGAACT CATCACCGCA GGGCAACGCG TTCAAATCAT TAAAAAAGAC 

1201 TCCGAAAAAG GCGGCGTGTT GCAGTTTGGT ACGGAGATTG TCGCACACGC 

1251 CGACGGCTCG CTCGCCGCAT TGCTGGGCGC GTCGCCGGGC GCATCGACCG 

1301 - CCGTGCCGCT GATGATCCGG CTGATGCACC AATGCTTCCC CGAACGCACC 

1351 CCGTCTTGGG AAGGCCGTCT GAAAGAGCTG GTACCGGGTT ACGGCATCAA 

14 01 GTTGAACGAA AACCCCGAAA GGGCGGATGA AATTATCGCC TATACCGCGA 

14 51 AAGTGTTGGA TATTTAA 

This corresponds to the amino acid sequence <SEQ ID 690; ORF 165.a>: 

al65 .pep 

1 MAEATD WLV GGGIMSATLG VLL KELEPSW EITLIERLED VALES SN AWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPEA NPDDWELITA GQRVQIIKKD 

4 01 SEKGGVLQFG TEIVAHADGS LAALLGASPG ASTAVPLMIR LMHQCFPERT 

4 51 PSWEGRLKEL VPGYGIKLNE NPERADEIIA YTAKVLDI* 

ml65/al65 99.7% identity in 356 aa overlap 

10 20 30 40 50 60 

ml 65. pep MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I II I I I I I I I I I I I I ! I I I I I I I I I I I 
al65 MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml65 . pep . ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 
I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

al65 ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 65. pep HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAAN YSAEGTDVDF 

I I I I I I I I M I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
al65 HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDF 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 65 . pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
I I I I I I I I I I I I I I t I I I M I i I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I i I I f I 
al65 GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLT LRTRFLFLGA 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml65.pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 1 I I I I M 
al65 GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 

250 260 270 280 290 300 
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310 320 330 340 350 

ml 65 . pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTK 

I I I I I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I 

al65 DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 

310 320 330 340 350 360 

al65 ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVAHADGS 

370 380 390 400 410 420 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 69 1>: 

gl65-l . seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAagat gTGGCGTTGG AATCGTCAAA cGCGTGGAAC 

151 AACGcCGgca CGGGGCATTC CGcGCTGTGc GAATTGAACT AtgcgccGCT 

201 GGGtgcggac ggcgtcatCA ATCCGGCGCg cgCCCTGAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGcga cgctggTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAACGAAGAC CACTGCCGTT ACCTGCAAAA ACGCTATGAT GTGTTTAAAA 

401 CGCAGAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

4 51 TCCGATTGGG CtccgCTGAT TATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TATTCCGCCG AAGGCACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAGAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCAGACTG GCAGCTCACC CTCCGCACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CACTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTACCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTAGACGG CAAACGACAC CTTATGTTCG GTCCTTACGC 

951 AGGTTTCCGT TCCAACTTCC TCAAGCAAGG CTCGTTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CCTGCTGGGC GAATTGCGTA AAACCAAAGA 

1101 AGAACGCTtt gCCTCCCTGC TGgaatacta cccGaggcag acccGACGAc 

1151 tggtactcat cacgcaggnc acGCGTcata tcattanata tgactCgaaa 

1201 ctgcgcgtgc tgcagttgta cgagattgtg ccaCGCGacg ctcgctcgcg 

1251 cattctggag cgtcgcggcg catcacgctn tgcgctgata tccgctgatg 

1301 acactgctcc gaGCGcgccc gtcttggaaa gtgtctga 

This corresponds to the amino acid sequence <SEQ ID 692; ORF 165-l.ng>: 

gl65-l .pep 

1 MAEATD VVLV GGGIMSATLG VLL KELEPSW EITLIERLED VALES SNAWN 

51 NAGTGHSALC ELNYAPLGAD GVINPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCRYLQKRYD VFKTQKLFEN MEFSTDRNKI 

151 SDWAPLIMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 
201 ' NRHVEDIKRE SDGAWVLKTA DTRNPDWQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GLPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSFM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPRQ TRRLVLITQX TRHIIXYDSK 

401 LRVLQLYEIV PRDARSRILE RRGASRXALI SADDTAPSAP VLESV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 693>: 

ml 65-1 . seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGCTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

401 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGGTACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTCCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 
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851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCCCTTACGC 

951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCGCTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGTGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CCTGCTGGGC GAATTGCGTA AAACCAAAGA 

1101 AGAACGCTTC GCCTCCCTGC TGGAATACTA CCCCGAGGCA AACCCCGACG 

1151 ACTGGGAACT CATCACCGCA GGGCAACGCG TCCAAATCAT TAAAAAAGAC 

1201 TCCGAAAAAG GCGGCGTGCT CCAGTTTGGT ACGGAGATTG TCGCCCACGC 

1251 CGACGGCTCA CTCGCCGCAT TGCTGGGCGC GTCGCCGGGC GCATCGACCG 

1301 CTGTGCCGCT GATGATCCGG CTGATGCACC AATGCTTCCC CGAGCGCGCC 

1351 CCGTCTTGGG AAGACCGTCT GAAAGAGCTG GTACCGGGTT ACGGCATCAA 

1401 GTTGAACGAA AACCCTGAAA GGGCGGATGA AATTATCGCC TATACCGCGA 

1451 AAGTATTGGA TATTTAA 



This corresponds to the amino acid sequence <SEQ ID 694; ORP 165-1>: 

xnl65-l .pep 

1 MAEATD VVLV GGGIMSATLG VLL KELEPSW EITLIERLED VALES SNAWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLCAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPEA NPDDWELITA GQRVQIIKKD 

401 SEKGGVLQFG TEIVAHADGS LAALLGASPG ASTAVPLMIR LMHQCFPERA 

451 PSWEDRLKEL VPGYGIKLNE NPERADEIIA YTAKVLDI* 



ml65-l/gl65-l 89.7% identity in 428 aa overlap 



10 20 30 40 50 60 

ml 65-1 .pep MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

I I I I I I f I I I! I I II MINIUM MMMMMMMMMI 

gl65-l MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml65-l.pep ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

MM :l:l:llllll I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I M I I I 

gl65-l ELNYAPLGADGVINPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 65~1 . pep HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAAN YSAEGTDVDF 
M M M I I I : I I M I I II I I M I M I I II II M II : I II I I I M M M M II I II I M I 
gl65-l HCRYLQKRYDVFKTQKLFENMEFSTDRNKISDWAPLIMRGRDENQPVAAN YSAEGTDVDF 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml65-l . pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 

M I M I M 1 ( I I 1 ) I f I M M I I I II I I M I I I I I I 1 I 1 I MMMIMIMI 

gl65-l GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDWQLTLRTRFLFLGA 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml 65-1. pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
I I I M I M M I I I II II I M I : M M I I I II I II I I I I I M II I II I II I I M I M M II 
gl65-l GGGALTLLQKSGIPEGKGYGGLPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml65-l .pep DTRNVDGKRH LMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTKYLLG 
I I M M II I M I M II I II I M I II I I I : M M II I II I II II I I M M I II II I I II I 
gl65-l DTRNVDGKRHLMFGPYAGFRSNFLKQGSFMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml65-l .pep ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVAHADGS 

I M M I M M M I M II I : : Ml I Ml I I I III: : I : 

gl65-l ELRKTKEERFASLLEYYPR-QTRRLVLITQXTR-HIIXYDS-KLRVLQLYEIVPRDARSR 

370 380 390 400 410 



430 440 450 460 470 480 
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ml 65-1. pep laallgaspgastavplmirlmhq'cfperapswedrlkelvpgygiklnenperadeiia 

: III 

g 1 6 5 - 1 I LERRGASRXAL I S ADDTAP SAP VLES VX 

420 430 440 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 695>: 

al65-l.seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGTTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

401 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGGCACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGT GGCTTTCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCCCTTACGC 

951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCACTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CCTGCTGGGC GAATTGCGTA AAACCAAAGA 

1101 AGAACGCTTC GCCTCCCTGC TGGAATACTA CCCCGAGGCA AACCCCGACG 

1151 ACTGGGAACT CATCACCGCA GGGCAACGCG TTCAAATCAT TAAAAAAGAC 

1201 TCCGAAAAAG GCGGCGTGTT GCAGTTTGGT ACGGAGATTG TCGCACACGC 

1251 CGACGGCTCG CTCGCCGCAT TGCTGGGCGC GTCGCCGGGC GCATCGACCG 

1301 CCGTGCCGCT GATGATCCGG CTGATGCACC AATGCTTCCC CGAACGCACC 

1351 CCGTCTTGGG AAGGCCGTCT GAAAGAGCTG GTACCGGGTT ACGGCATCAA 

1401 GTTGAACGAA AACCCCGAAA GGGCGGATGA AATTATCGCC TATACCGCGA 

1451 AAGTGTTGGA TATTTAA 

This corresponds to the amino acid sequence <SEQ ID 696; ORF 165-l.a>: 

al65-l .pep 

1 MAEAT DWLV GGGIMSATLG VLL KELEPSW EITLIERLED VALES SNAWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPEA NPDDWELITA GQRVQIIKKD 

401 SEKGGVLQFG TEIVAHADGS LAALLGASPG ASTAVPLMIR LMHQCFPERT 

451 PSWEGRLKEL VPGYGIKLNE NPERADEIIA YTAKVLDI* 

al65-l/ml65-l 99.4% identity in 488 aa overlap 

10 20 30 40 50 60 

al 65-1. pep MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I || I I I I I I I I 
ml 65-1 MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

10 20 30 40 50 60 

70 80 90 100 110 120 

al65-l.pep ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 
I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I II II I I I 
ml65-l ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 

130 140 150 160 170 180 

al 65-1. pep HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDF 
I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I | I I I I | | | | M 
ml 6 5 - 1 HCS YLQKRYDAFKTQKLFENMEFSTDRNKI S DWAPLMMRG RDENQPVAAN YSAEGTDVDF 

130 140 150 160 170 180 



190 200 210 220 230 240 
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a!65-l .pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
I I 1 I I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 
ml65-l GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
190 200 210 220 230 240 

250 260 270 280 290 300 

al65-l . pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
I I I I I I t I I I I I I I I I I II I I I I I 11 I I I I I I I I I |.| I I I I I I I I I I I I I I I I I I I | I I I 
ml 65-1 GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 

250 260 270 280 290 300 



310 320 330 340 350 360 

al65-l .pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I II I I I I I I I I 
ml65-l DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTKYLLG 

310 320 330 340 350 360 

370 380 390 400 410 420 

al 65~1 - pep ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVAHADGS 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I 
ml 65-1 ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVAHADGS 

370 380 390 400 410 420 



430 440 450 460 470 480 

a 1 65 - 1 . pep LAALLGAS PGASTAVPLMI RLMHQCFPERTPSWEGRLKELVPGYG I KLNENPERADEI I A 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml 65-1 LAALLGAS PGASTAVPLMIRLMHQCFPERAPSWEDRLKELVPGYGIKLNENPERADEI I A 

430 440 450 460 470 480 



489 

a 1 6 5 - 1 . pep YTAKVLDIX 
I I II I I I I I 
ml65-l YTAKVLDIX 



al65-l/p33940 

sp|P33940|YOJH_ECOLI HYPOTHETICAL 60.2 KD PROTEIN IN ECO-ALKB INTERGENIC REGION 
>gi| 1736851 1 gnl I PID | dl016718 (D90850) ORF_ID: o372#5 ; similar to (SwissProt Accession Number 
P33940] [Escherichia coli] >gi 11788539 (AE000310) f548; This 548 aa ORF is 100 pet identical 
to 490 residues of YOJH_ECOLI SW: P33940 (492 aa) but contains 56 additional N-ter aa; 100 pet 
identical to GB: ECOHU49J33 

ACCESSION: U00008 (490 aa) but contains 58 aditional N-term resi... Length = 548 
Score = 458 bits (1167), Expect = e-128 

Identities = 233/490 (47%), Positives = 303/490 (61%), Gaps = 5/490 (1%) 



Query: 


3 


EATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALCEL 


62 






+ TDV+L+GGGIMSATLG L+ELEP W +T++ERLE VA ESSN WNNAGTGHSAL EL 




Sbjct: 


30 


. QETDVLLIGGGIMSATLGTYLRELEPEWSMTMVERLEGVAQESSNGWNNAGTGHSALMEL 


89 


Query: 


63 


NYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLED-NSFINAVPHMSLVMNEDH 


121 






NY P A+G I +A+ I E F +SRQFWA V G L SFIN VPHMS V ED+ 




Sbjct: 


90 


NYTPQNADGSISIEKAVAINEAFQISRQFWAHQVERGVLRTPRSFINTVPHMSFVWGEDN 


149 


Query: 


122 


CSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDFG 


181 






++L+ RY A + LF M +S D +1 +WAPL+M GRD Q VAA + GTDV++G 




Sbjct: 


150 


VNFLRARYAALQQSSLFRGMRYSEDHAQIKEWAPLVMEGRDPQQKVAATRTEIGTDVNYG 


209 


Query: 


182 


RLTRQMVKYLQGKG-VKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTXXXXXXXXXX 


240 






+TRQ++ LQ K + + V +KR D W + AD +N Q 




Sbjct: 


210 


EITRQLIASLQKKSNFSLQLSSEVRALKRNDDNTWTVTVADLKNGTAQ-NIRAKFVFIGA 


268 


Query: 


241 


XXXXXXXXQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 


300 






Q+SGIPE K Y GFPV G F + NP+ H AKVYG+ASVGAPPMSVPH+ 




Sbjct: 


269 


GGAALKLLQESGIPEAKDYAGFPVGGQFLVSENPDVVNHHLAKVYGKASVGAPPMSVPHI 


328 


Query: 


301 


DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 


360 






DTR +DGKR ++FGP+A F + FLK GSL DL S N+ PM+ GNL KYL+ 




Sbjct: 


329 


DTRVLDGKRVVLFGPFATFSTKFLKNGSLWDLMSSTTTSNVMPMMHVGLDNFDLVKYLVS 


388 


Query: 


361 


ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVXXXXXX 


420 






++ ++E+RF +L EYYP+A +DW L AGQRVQIIK+D+EKGGVL+ GTE+V 




Sbjct: 


389 


QVMLSEEDRFEALKEYYPQAKKEDWRLWQAGQRVQIIKRDAEKGGVLRLGTEVVSDQQGT 


448 


Query: 


421 


XXXXXXXXXXXXXXVPLMIRLMHQCFPER — TPSWEGRLKELVPGYGI KLNENPERADEI 


478 
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P+M+ L+ + F +R + P W+ LK +VP YG KLN + + 
Sbjct: 449 IAALLGASPGASTAAPIMLNLLEKVFGDRVSSPQWQATLKAIVPSYGRKLNGDVAATERE 508 

Query: 479 IAYTAKVLDI 488 

+ YT++VL + 
Sbjct: 509 LQYTSEVLGL 518 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 697>: 

g204 . seq 

1 atggcggcgg cggaaataaa acgccccctc gctgtcgatt tccagcacat 
51 agcgtccgtt ctgcacggcg gcatagccgc ttttgcctgc ctgatagggt 
101 tgcagggcgg aatgcgaaat caggtaatca gtcagtttgc cgccgtcttc 
151 ggcgatattg cccaccagtt tggcaaacaa ggtatggcac acgccgtttt 
201 ccgcccagcc cgaaggcgcg tcctttccgt cggtttccat acatttgccg 
251 acgacggctt ccaagtcgtt gggatgcttt ccggtcagcc ggacggcgtt 
301 ttgttccggc aagcctttaa tcggataact gatttgtttt ttgccgtcgt 
351 tggttttgcc ttcgctactt tgtcccaaag ccaaaccggc aatcgccgta 
401 ttgtcgatgt atttgacttt gaaaaccggt ttcggcgcgc tttgtgccgc 
451 attttgcggc tgttccgccg tattttcgga tttgccgcag gcggcaagca 

501 gcaggcagcc gcccaacacg gcaaaaggta ttttcagcat tccgcactcc 
551 tgatggtttc aaaatgccgt ctgaaatgcc gtctgaaacg tggcaggcgg 

601 aggttcggac ggcattgggt ttatttcaac gggcggatgc cgaccgcatc 

651 gcgtacttta tccaacaatt cgcgcgcttc tttgcgcgct ttttgcgcgc 

701 ctgcctgcaa aatctcttcg atttgcgaag gattagaggt caatgcgttg 

751 tag 

This corresponds to the amino acid sequence <SEQ ID 698; ORF 204.ng>: 

g204 .pep 

1 MAAAEIKRPL AVDFQ HIASV LHGGIAAFAC LIG LQGGMRN QVISQFAAVF 

51 GDIAHQFGKQ GMAHAVFRPA RRRVLSVGFH TFADDGFQW GMLSGQPDGV 

101 LFRQAFNR IT DLFFAWGFA FATL SQSQTG NRRIVDVFDF ENRFRRALCR 

151 ILRLFRRIFG FAAGGKQQAA AQHGKRYFQH SALLMVSKCR LKCRLKRGRR 

201 RFGRHWVYFN GRMPTASRTL SNNSRASLRA FCAPACKISS I CEGLEVNAL 

251 * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 699>: 

m204 . seq 

1 ATGGCGGCGG CGGAAATAAA ACGCCCCTTC GCTGTCGATT TCCAGCACAT 

51 AGCGTCCGTT CTGCACGGCG GCATAGCCGC TTTTGCCTGC CTGATAGGGT 

101 TGCAGGGCGG CATGCGAAAC TAGGTAATCC GTCAGTTTGC CGCCGTCTTC 

151 GGCGATATTG CCCACCAGTT TGGCAAACAA GGTATGGCAC ACGCCGTTTT 

2 01 CTGCCCAACC TGCCGGACTG TCCTTATCAT CGGTTTCCAT ACATTTGCCG 
251 CTGACGGCTT CCAAGTCGCC GGGATGCTTG CCGATCAGTC GGATAACATT 
301 TTGTTCCGGC AAGCCTTTAA TCGGATAACT GATTTGTTTT TTGCCGTCGT 

3 51 TGGTTTTGCC TTCGCTGCTT TGTCCCAAAT CCAAACCGGC AATCGCCGTA 

4 01 TTGTCGATAT ATATGACTTT GAAAACCGGT TTCGGCGCGC TTTGTACCGC 
451 GTTTTGCGGC TGTACCGCCG TATTTwCGGA TTTGCCGCaC GGCaArGCAG 

5 01 CAGGCAGCCG CCCAATACGG CAAAArAwGT wTTCAGCATT CCACAyTCCT 

5 51 GATGGTTTCA AAATGCCGTC TGAAACGCGG CAGGCGGAGG TTCGGACGGC 
601 ATCGGGTTCA TTTCAACGGG CGGATGcCGA CCGCATCgGT ACTTTGTCCA 

6 51 ATAATTCGCG TGCTTCTTTA CGCGCTTTCG CCGCGCCTGC CTGCAAAATC 
701 TCTTCGATTT GCGAAGGGTC GGCGGTCAGC TCGTTGTAG 

This corresponds to the amino acid sequence <SEQ ID 700; ORF 204>: 

m204 .pep 

1 MAAAEIKRPF AVDFQ HIASV LHGGIAAFAC LIG LQGGMRN *VIRQFAAVF 

51 GDIAHQFGKQ GMAHAVFCPT CRTVLIIGFH TFAADGFQVA GMLADQSDNI 

101 LFRQAFNR IT DLFFAWGFA FAAL SQIQTG NRRIVDIYDF ENRFRRALYR 

151 VLRLYRRIXG FAATAXQQAA AQYGKXXXQH STXLMVSKCR LKRGRRRFGR 

201 HRVHFNGRMP TASGTLSNNS RASLRAFAAP ACKISSICEG SAVSSL* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 204 shows 82.0% identity over a 250 aa overlap with a predicted ORF (ORF 204.ng) 
from N. gonorrhoeae: 

m204/g204 

10 20 30 40 50 60 

m2 04 . pep MAAAEIKRPFAVDFQHIASVLHGGIAAFACLIGLQGGMRNXVIRQFAAVFGDI AHQFGKQ 

I M i 1 1 1 1 hi M I Mil III 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II IIIIIIIIMIIIIII 

g204 MAAAE I KRPLAVDFQHI AS VLHGGI AAFACL I GLQGGMRNQVI SQFAAVFGD I AHQ FGKQ 

10 20 30 40 50 60 

70 80 90 100 110 120 

m204 .pep GMAHAVFCPTCRTVLIIGFHTFAADGFQVAGMLADQSDNILFRQAFNRITDLFFAWGFA 

Ml I III 1= I II MM IN I Ml hill: I hMIIIIII I lllllllll II I 

g204 GMAHAVFRPARRRVLSVGFHTFADDGFQWGMLSGQPDGVLFRQAFNRITDLFFAWGFA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 04 . pep FAALSQIQTGNRRIVDIYDFENRFRRALYRVLRLYRRIXGFAATAXQQAAAQYGKXXXQH 

I Ml II II III I I I MM II II II II I hi I hi II MM = lllllhll II 
g2 04 FATLSQSQTGNRRI VDVFDFENRFRRALCRI LRLFRRI FGFAAGGKQQAAAQHGKRYFQH 

130 140 150 160 170 180 

190 200 210 220 230 

m204 . pep STXLMVS KCRLK RGRRRFGRHRVHFNGRMPTASGTLSNNSRASLRAFAAPACKISS 

h IIMIIIII 1 1 1 1 1 1 1 M hlllllllM MIMMMMM IIMIIII 

g 2 0 4 SALLMVS KCRLKCRLKRGRRRFGRHWVYFNGRMPT AS RTLSNNS RAS LRAFCAP ACKI S S 

190 200 210 220 230 240 . 



m204 . pep 
g204 



240 
ICEGSAVSSLX 

MM hh 

ICEGLEVNAL 
250 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 701 >: 

a204 . seq 

1 ATGGCGGCGG CGGAAATAAA ACGCCCCCTC GCTGTCGATT TCCAGCACAT 

51 AGCGTCCGTT CTGCACGGCG GCATAGCCGC TTTTGCCTGC CTGATAGGGT 

101 TGCAGGGCGG AATGCGAAAT CAGGTAATCC GTCAGTTTGC CGCCGTCTTC 

151 GGCGATATTG CCCACCAGTT TGGCAAACAA GGTATGGCAC ACGCCGTTTG 

201 CCGCCCAGCC CGAAGGCGCG CCCTTTCCGT CGGTTTCCAT ACATTTGCCG 

251 ACGACGGCTT CCAAGTCGTT GGGATGCTTG CCGGTCAGCC GGACGACGTT 

301 TTGTTCCGGC AAGCCTTT 

351 

401 

451 ; 

501 

551 AAGAG GTTCGGACGG 

601 CATTGGGTTT ATTTCAACGG GCGGATACCG ACCGCATCAC GTACTTTGCC 

651 CAATAATTCG CGTGCTTCTT TACGCGCTTT TTGCGCGCCT GCCTGCAAAA 

701 TCTCTTCGAT TTGCGAAGGG TCGGCGGTCA GCTCGTTGTA G 

This corresponds to the amino acid sequence <SEQ ID 702; ORF 204.a>: 

a204 .pep 

1 MAAAE I KR PL AVPFQH IASV LHGGIAAFAC LIG LQGGMRN QVIRQFAAVF 

51 GDIAHQFGKQ GMAHAVCRPA RRRALSVGFH TFADDGFQVV GMLAGQPDDV 

101 LFRQAF 

151 KRFGR 

201 HWVYFNGRIP TASRTLPNNS RASLRAFCAP ACKISSICEG SAVSSL* 



PCT/US99/U9346 



478 



m204/a204 54.5% identity in 246 aa overlap 

10 20 30 



40 



50 



60 



m204.pep MAAAEIKRPFAVDFQHIASVLHGGIAAFACLIGLQGGMRNXVIRQFAAVFGDIAHQFGKQ 
I I I I I I I I I: I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a204 MAAAEIKRPLAVDFQHIASVLHGGIAAFACLIGLQGGMRNQVIRQFAAVFGDIAHQFGKQ 
10 20 30 40 50 60 

70 80 90 100 110 120 

m204 . pep GMAH AVFC PTCRTVL 1 1 GFHTFAADGFQVAGMLADQS DN I LFRQAFNRI TDLFFA VVGFA 

I I I I I I I : I : I : I I I I t I I I I I t : I I I 1 I I : : I I I I I I 
a2 0 4 GMAHAVCRPARRRALS VGFHTFADDGFQVVGMLAGQPDDVLFRQAF 

70 80 90 100 



m204 .pep 
a204 



130 140 150 160 170 180 

FAALSQIQTGNRRIVDIYDFENRFRRALYRVLRLYRRIXGFAATAXQQAAAQYGKXXXQH 



190 200 210 220 230 240 

m204 .pep STXLMVSKCRLKRGRRRFGRHRVHFNGRMPTASGTLSNNSRASLRAFAAPACKISSICEG 

: I I I I I I : I I I I : I I 1 I II I I I I 1 I I I I I I I I I 1 I I I I I I I 

a204 KRFGRHWVYFNGRIPTASRTLPNNSRASLRAFCAPACKISSICEG 

110 120 130 140 150 



m204 .pep 



a204 



SAVSSLX 
I I I I I I I 
SAVSSLX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 703>: 

g205.seq 

1 atgctgaaaa taccttttgc cgtgttgggc ggctgcctgc tgcttgccgc 

51 ctgcggcaaa tccgaaaata cggcggaaca gccgcaaaat gcggcacaaa 

101 gcgcgccgaa accggttttc aaagtcaaat acatcgacaa tacggcgatt 

151 gccggtttgg ctttgggaca aagtagcgaa ggcaaaacca acgacggcaa 

201 aaaacaaatc agttatccga ttaaaggctt gccggaacaa aacgccgtcc 

251 ggctgaccgg aaagcatccc aacgacttgg aagccgtcgt cggcaaatgt 

3 01 atggaaaccg acggaaagga cgcgccttcg ggctgggcgg aaaacggcgt 

3 51 gtgccatacc ttgtttgcca aactggtggg caatatcgcc gaagacggcg 

401 gcaaactgac tgattacctg atttcgcatt ccgccctgca accctatcag 

451 gcaggcaaaa gcggctatgc cgccgtgcag aacggacgct atgtgctgga 

501 aatcgacagc gagggggcgt tttatttccg ccgccgccat tattga 

This corresponds to the amino acid sequence <SEQ ID 704; ORF 205. ng>: 

g205 .pep 

1 MLKIPFAVLG GCLLLA ACGK SENTAEQPQN AAQSAPKPVF KVKYIDNTAI 

51 AGLALGQSSE GKTNDGKKQI SYPIKGLPEQ NAVRLTGKHP NDLEAWGKC 

101 METDGKDAPS GWAENGVCHT LFAKLVGNIA EDGGKLTDYL ISHSALQPYQ 

151 AGKSGYAAVQ NGRYVLEIDS EGAFYFRRRH Y 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 705>: 

m205 . seq 

1 ATGCTGAAwA CwTyTTTTGC CGTATTGGGC GGCTGCCTGC TGCyTtGCCG 

51 tGCGGCAAAT CCGwAAATAC GGCGGTACAG CCGCAAAACG CGGTACAAAG 

101 CGCGCCGAAA CCGGTTTTCA AAGTCATATA TATCGACAAT ACGGCGATTG 

151 CCGGTTTGGA TTTGGGACAA AGCAGCGAAG GCAAAACCAA CGACGGCAAA 

2 01 AAACAAATCA GTTATCCGAT TAAAGGCTTG CCGGAACAAA ATGTTATCCG 
251 ACTGATCGGC AAGCATCCCG GCGACTTGGA AGCCGTCAGC GGCAAATGTA 

3 01 TGGAAACCGA TGATAAGGAC AGTCCGGCAG GTTGGGCAGA AAACGGCGTG 
351 TGCCATACCT TGTTTGCCAA ACTGGTGGGC AATATCGCCG AAGACGGCGG 

4 01 CAAACTGACG GATTACCTAG TTTCGCATGC CGCCCTGCAA CCCTATCAGG 
451 CAGGCAAAAG CGGCTATGCC GCCGTGCAGA ACGGACGCTA TGTGCTGGAA 

5 01 ATCGACAGCG AAGGGGCGTT TTATTTCCGC CGCCGCCATT ATTGA 
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This corresponds to the amino acid sequence <SEQ ID 706; ORF 205>: 

m2 05 .pep 

1 MLXTXFAVLG GCLLX CRCGK SXNTAVQPQN AVQSAPKPVF KVIYIDNTAI 

51 AGLDLGQSSE GKTNDGKKQI SYPIKGLPEQ NVIRLIGKHP GDLEAVSGKC 

101 METDDKDSPA GWAENGVCHT LFAKLVGNIA EDGGKLTDYL VSHAALQPYQ 

151 AGKSGYAAVQ NGRYVLEIDS EGAFYFRRRH Y* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 205 shows 88.4% identity over a 181 aa overlap with a predicted ORF (ORF 205.ng) 
from N. gonorrhoeae: 

m205/g205 

10 20 30 40 50 60 

m2 05 . pep MLXTXFAVLGGCLLXCRCGKSXNTAVQPQNAVQSAPKPVFKVI YIDNTAI AGLDLGQSSE 

II II I MM 1 1 MM III II 1 1 h 1 1 1 1 1 M 1 1 1 M I M M M I MUM 

g2 05 MLKIPFAVLGGCLLLAACGKSENTAEQPQNAAQSAPKPVFKVKYIDNTAIAGLALGQSSE 

10 20 30 40 50 60 



70 80 90 100 110 120 

m2 05 . pep GKTNDGKKQI S YP I KGLPEQNVI RLI GKHPGDLE AVSGKCMETDDKDS PAGWAENGVCHT 

I I ! I II I I I I M ! I I I Ml I l-l I lllhlllll lllllll Ihhilllllllil 
g205 GKTNDGKKQI SYPIKGLPEQNAVRLTGKHPNDLEAWGKCMETDGKDAPSGWAENGVCHT 

70 80 90 100 110 120 



130 140 150 160 170 180 

m2 05 . pep LFAKLVGN I AEDGGKLTD YLVS HAALQP YQAG KS G YAAVQNGR YVLE IDS EGAF Y FRRRH 

h 1 1 M 1 1 1 1 1 1 1 1 1 1 ! 1 1 hlhl II M 1 1 II 1 1 1 1 1 1 II ! 1 1 h III II 1 1 1 1 1 1 1 1 1 

g205 LFAKLVGNIAEDGGKLTDYLISHSALQPYQAGKSGYAAVQNGRYVLEIDSEGAFYFRRRH 

130 140 150 160 170 180 



m205.pep YX 
I 

g205 Y 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 707>: 

a205.seq (partial) 

1 TCCGAACCTC TTAAAGGCTT GCCGGAACAA AACGTCGTCC GGCTGACCGG 

51 CAAGCATCCC AACGACTTGG AAGCCGTCGT CGGCAAATGT ATGGAAACCG 

101 ACGGAAAGGG CGCGCCTTCG GGCTGGGCGG CAAACGGCGT GTGCCATACC 

151 TTGTTTGCCA AACTGGTGGG CAATATCGCC GAAGACGGCG GCAAACTGAC 

201 GGATTACCTG ATTTCGCATT CCGCCCTGCA ACCCTATCAG GCAGGCAAAA 

251 GCGGCTATGC CGCCGTGCAG AACGGACGCT ATGTGCTGGA AATCGACAGC 

301 GAGGGGGCGT TTTATTTCCG CCGCCGCCAT TATTGA 

This corresponds to the amino acid sequence <SEQ ID 708; ORF 205. a>: 

a205.pep (partial) 

1 SEPLKGLPEQ NVVRLTGKHP NDLEAVVGKC METDGKGAPS GWAANGVCHT 

51 LFAKLVGNIA EDGGKLTDYL ISHSALQPYQ AGKSGYAAVQ NGRYVLEIDS 

101 EGAFYFRRRH Y* 

m205/a205 88.3% identity in 1 1 1 aa overlap 

50 60 70 80 90 100 

m205.pep KVIYIDNTAIAGLDLGQSSEGKTNDGKKQISYPIKGLPEQNVIRLIGKHPGDLEAVSGKC 

i I : I I I I I I I I : I I I I I I : I I i I I I II 
a205 SEPLKGLPEQNWRLTGKHPNDLEAWGKC 

10 20 30 



m205 . pep 



110 120 130 140 150 160 

MET DDKDSPAGWAENGVCHTLFAKLVGN I AEDGGKLTDYLVS HAALQP YQAGKSGYAAVQ 



WO 99/57280 



PCT/US99/09346 



480 



I I I I I : I : ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I : I I I I I I I I I I I I I I I I 
a205 METDGKGAPSGWAANGVCHTLFAKLVGNIAE DGGKLTDYLISHSALQPYQAGKSGYAAVQ 

40 50 60 70 80 90 



170 180 
m2 0 5 . pep NGRYVLE I DSEGAFYFRRRHYX 

I I I I I I I I I I I I I I I I I I I I I I 
a 2 0 5 NGRYVLE I DSEGAFY FRRRHYX 

100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 709>: 

g205-l.seq (partial) 

1 ATGCTGAAAA TAcCTTTTGC CGTGTTGGGC GGCTGCCTGC TGCTTGCCGC 

51 CTGCGGCAAA TCCGAAAATA CGGCGGAACA GCCGCAAAAT GCGGCACAAA 

101 GCGCGCCGAA ACCGGTTTTC AAAGTCAAAT ACATCGACAA TACGGCGATT 

151 GCCGGTTTGG CTTTGGGACA AAGTAGCGAA GGCAAAACCA ACGACGGCAA 

201 AAAACAAATC AGTTATCCGA TTAAAGGCTT GCCGGAACAA AACGCCGTCC 

251 GGCTGACCGG AAAGCATCCC AACGACTTGG AAGCCGTCGT CGGCAAATGT 

301 ATGGAAACCG ACGGAAAGGA CGCGCCTTCG GGCTGGGCGG AAAACGGCGT 

351 GTGCCATACC TTGTTTGCCA AACTGGTGGG CAATATCGCC GAAGACGGCG 

4 01 GCAAACTGAC TGATTACCTG ATTTCGCATT CCGCCCTGCA ACCCTATCAG 

451 GCAGGCAAAA GCGGCTATGC CGCCGTGCAG AACGGACGCT ATGTGCTGGA 

501 AATCGACAGC GAGGGGGCGT TTTA 

This corresponds to the amino acid sequence <SEQ ID 710; ORF 205-l.ng>: 

g205-l.pep (partial) . 

1 MLKI PFAVLG GCLLLAA CGK SENTAEQPQN AAQSAPKPVF KVKYIDNTAI 
51 AGLALGQSSE GKTNDGKKQI SYPIKGLPEQ NAVRLTGKHP NDLEAWGKC 
101 METDGKDAPS GWAENGVCHT LFAKLVGNIA EDGGKLTDYL ISHSALQPYQ 
151 AGKSGYAAVQ NGRYVLEIDS EGAF 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 71 1>: 

m205-l.seq. . 

1 ATGCTGAAAA CATCTTTTGC CGTATTGGGC GGCTGCCTGC TGCTTGCCGC 

51 CTGCGGCAAA TCCGAAAATA CGGCGGAACA GCCGCAAAAC GCGGTACAAA 

101 GCGCGCCGAA ACCGGTTTTC AAAGTCAAAT ATATCGACAA TACGGCGATT 

151 GCCGGTTTGG ATTTGG GAGA AAGCAGCGAA GGCAAAACCA ACGACGGCAA 

201 AAAACAAATC AGTTATCCGA TTAAAGGCTT GCCGGAACAA AATGTTATCC 

251 GACTGATCGG CAAGCATCCC GGCGACTTGG AAGCCGTCAG CGGCAAATGT 

301 ATGGAAACCG ATGATAAGGA CAGTCCGGCA GGTTGGGCAG AAAACGGCGT 

3 51 GTGCCATACC TTGTTTGCCA AACTGGTGGG CAATATCGCC GAAGACGGCG 

4 01 GCAAACTGAC GGATTACCTA GTTTCGCATG CCGCCCTGCA ACCCTATCAG 
451 GCAGGCAAAA GCGGCTATGC CGCCGTGCAG AACGGACGCT ATGTGCTGGA 
501 AATCGACAGC GAAGGGGCGT TTTATTTCCG CCGCCGCCAT TATTGA 



This corresponds to the amino acid sequence <SEQ ID 712; ORF 205-l>: 

m2 05-1. pep 

1 MLKTSFAVLG GCLLLAA CGK SENTAEQPQN AVQSAPKPVF KVKYIDNTAI 
51 AGLDLGQSSE GKTNDGKKQI SYPIKGLPEQ NVIRLIGKHP GDLEAVSGKC 
101 METDDKDSPA GWAENGVCHT LFAKLVGNIA EDGGKLTDYL VSHAALQPYQ 
151 AGKSGYAAVQ NGRYVLEIDS EGAFYFRRRH Y* 

m205-l/g205-l 92.0% identity in 174 aa overlap 



10 20 30 40 50 60 

g2 0 5 - 1 . pep MLKI PFAVLGGCLLLAACGKSENTAEQPQNAAQSAPKPVFKVKY I DNT A I AGLALGQSSE 

iii ii 1 1 iii 1 1 ii ii 1 1 Mill in 1 1 hin i ii inn inn inn hum 

m205-l MLKTSFAVLGGCLLLAACGKSENTAEQPQNAVQSAPKPVFKVKY I DNTAI AGLDLGQSSE 

10 20 30 40 50 60 



70 80 90 100 110 120 

g2 05 - 1 . pep GKTNDGKKQI SYPIKGLPEQNAVRLTGKHPNDLEAWGKCMETDGKDAPSGWAENGVCHT 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ^ i I MINIMI MMMI IhhMIMMIII 

m205-l GKTNDGKKQISYPIKGLPEQNVIRLIGKHPGDLEAVSGKCMETDDKDSPAGWAENGVCHT 
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70 



80 



90 



100 



110 



120 



g205-l .pep 



m205-l 



130 140 150 160 170 

LFAKLVGNIAEDGGKLTDYLISHSALQPYQAGKSGYAAVQNGRYVLEIDSEGAF 

N I III M 1 1 1 1 IN Ml I Ml hill MUM Mill I MM I Ml 1 1 1 1 1 1 1 

LFAKLVGNIAEDGGKLTDYLVSHAALQPYQAGKSGYAAVQNGRYVLEIDSEGAFYFRRRH 
130 140 150 160 170 180 



m205-l 



YX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 71 3>: 

a205-l.seq (partial) 

1 CCTCTTAAAG GCTTGCCGGA ACAAAACGTC GTCCGGCTGA CCGGCAAGCA 

51 TCCCAACGAC TTGGAAGCCG TCGTCGGCAA ATGTATGGAA ACCGACGGAA 

101 AGGGCGCGCC TTCGGGCTGG GCGGCAAACG GCGTGTGCCA TACCTTGTTT 

151 GCCAAACTGG TGGGCAATAT CGCCGAAGAC GGCGGCAAAC TGACGGATTA 

2 01 CCTGATTTCG CATTCCGCCC TGCAACCCTA TCAGGCAGGC AAAAGCGGCT 

2 51 ATGCCGCCGT GCAGAACGGA CGCTATGTGC TGGAAATCGA CAGCGAGGGG 

301 GCGTTTTATT TCCGCCGCCG CCATTATTGA 

This corresponds to the amino acid sequence <SEQ ID 714; ORF 205-1. a>: 

a205-l.pep (partial) 

1 PLKGLPEQNV VRLTGKHPND LEAWGKCME TDGKGAPSGW AANGVCHTLF 
51 AKLVGNIAED GGKLTDYLIS HSALQPYQAG KSGYAAVQNG RYVLEIDSEG 
101 AFYFRRRHY* 

m205-l/a205-l 89.0% identity in 109 aa overlap 

50 60 70 80 90 100 

m205-l pep KYIDNTAIAGLDLGQSSEGKTNDGKKQISYPIKGLPEQNVIRLIGKHPGDLEAVSGKCME 

^ I M MM M Ml I HUM llll II Ml 

a2 o 5 - 1 PLKGLP EQNWRLTGKH PNDLE AWG KCME 

10 20 30 

110 120 130 140 150 160 

m2 0 5 - 1 . pep TDDKDSPAGWAENGVCHTLFAKLVGNIAEDGGKLTDYLVSHAALQPYQAGKSGYAAVQNG 

II I MMM II I II II I MM I II III III llllhl MM I MM MM Ml MM 

a205-l TDGKGAPSGWAANGVCHTLFAKLVGNIAEDGGKLTDYLISHSALQPYQAGKSGYAAVQNG 
40 50 60 70 80 90 



m205-l .pep 



a205-l 



170 180 
RYVLEIDSEGAFYFRRRHYX 

MIMMMMIMIIMM 

RYVLEIDSEGAFYFRRRHYX 
100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 715>: 



g206 



seq 
1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 



atgttttccc ccgacaaaac ccttttcctc tgtctcggcg 

cgcctcatgc ggcacgacct ccggcaaaca ccgccaaccg 

agacagtccg gcaaatccaa gccgtccgca tcagccacat 

caaggctcgc aggaactcat gctccacagc ctcggactca 

ctacaaatgg ggcggcagca gcaccgcaac cggcttcgac 

tgattcaatt ggtttacaaa aacgccctca acgtcaagct 

gcccgcgaca tggcggcggc aagccgcaaa atccccgaca 

ggccggcgac atcgtattct tcaacaccgg cggcgcacac 

acgtcggact ctacatcggc aacggcgaat tcatccatgc 

ggcaaaacca tcaaaaccga aaaactctcc acaccgtttt 

ctaccttgga gcgcatacgt tttttacaga atga 

This corresponds to the amino acid sequence <SEQ ID 716; ORF 

g206 .pep 

1 MFSPDKTLFL CLGALLLASC GTTSGKHRQP KPKQTVRQIQ 



cactgctcct 
aaacccaaac 
cggccgcaca 
tcggcacgcc 
tgcagcggca 
gccgcgcacc 
gccgcctcaa 
cgctactcac 
ccccggcagc 
acgccaaaaa 

206.ng>: 

AVRISHIGRT 
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51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQLVYK NALNVKLPRT 
101 ARDMAAASRK IPDSRLKAGD IVFFNTGGAH RYSHVGLYIG NGEFIHAPGS 
151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 717>: 



m206 . seq 








1 


ATGTTTCCCC 


CCGACAAAAC 


CCTTTTCCTC 


51 


CGCCTCATGC 


GGCACGACCT 


CCGGCAAACA 


101 


AGACAGTCCG 


GCAAATCCAA 


GCCGTCCGCA 


151 


CAAGGCTCGC 


AGGAACTCAT 


GCTCCACAGC 


201 


CTACAAATGG 


GGCGGCAGCA 


GCACCGCAAC 


251 


TGATTCAATT 


CGTTTACAAr 


AACGCCCTCA 


301 


GCCCGCGACA 


TGGCGGCGGC 


AAGCCGsAAA 


351 


GGCCGGCGAC 


CTCGTATTCT 


TCAACACCGG 


401 


ACGTCGGACT 


CTACATCGGC 


AACGGCGAAT 


451 


GGCAAAACCA 


TCAAAACCGA 


AAAACTCTCC 


501 


CTACCTCGGC 


GCACATACTT 


TTTTTACAGA 



This corresponds to the amino acid sequence <SEQ ID 718; ORF 206>: 

m206 .pep. . 

1 MFPPDKTLFL CLSALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIDRT 
51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 
101 ARDMAAASRK IPDSRXKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 
151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 206 shows 96.0% identity over a 177 aa overlap with a predicted ORF (ORF 206.ng) 
from TV. gonorrhoeae: 

m206/g206 



10 20 30 40 50 60 

m206 . pep MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 

I! I II M ! 1 1 hi M I M 1 1 1 1 II IN 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 i I 1 1 1 II 1 1 1 II I 

g206 MFSPDKTLFLCLGALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIGRTQGSQELMLHS 

10 20 30 40 50 60 



70 80 90 100 110 120 

m206 . pep LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRXKAGD 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II II h 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

g206 LGLIGTPYKWGGSSTATGFDCSGMIQLVYKNALNVKLPRTARDMAAASRKIPDSRLKAGD 

70 80 90 100 110 120 



130 140 150 160 170 

m206 .pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g206 IVFFNTGGAHRYSHVGLYIGNGEFIHAPGSGKTIKTEKLSTPFYAKNYLGAHTFFTE 

130 140 150 160 170 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 719>: 

a206 . seq 

1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAA AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGCAAA ATCCCCGACA GCCGCCTTAA 

351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

401 ACGTCGGACT CTATATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

451 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TCTTTACAGA ATGA 
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This corresponds to the amino acid sequence <SEQ ID 720; ORF 206.a>: 

a206.pep 

1 MFPPDK TLFL CLSALLLASC GTT SGKHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRLKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

m206/a206 99.4% identity in 177 aa overlap 

10 20 30 40 50 60 

m206.pep MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQ^VRISHIDRTQGSQELMLHS 
I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
a206 MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m206.pep LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRXKAGD 
I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I MM 
a206 LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRLKAGD 

70 80 90 100 110 120 

130 140 150 160 170 

m206 . pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 
M I M M M M M I I M I M I M M M M M M I M M M M I M M I II I M M M I 
a206 LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 72 1>: 

g209.seq 

1 atgctgcggc atttaggaaa cgacttcgcc ttgggcgcgt tgtttttcga 

51 tgctgcggtt gatgtgccac tgctgggcga tggtcaggag gttgttgacc 

101 acccagtaga gaaccaaacc ggcagggaag aagaagaaca tgacggagaa 

151 aaccaacggc atgattttca tcattttcgc ctgcatcggg tcggtcggcg 

201 gcgggttcag ataggtttgg gcgaacatcg ttgccgccat aatgatgggc 

251 aggatgtagt aggggtcggc gcggctgagg tcggtaatcc agcccagcca 

301 aggtgcctgg cgcaattcta cggaggcgaa caatgcccag tacaagccga 

351 tgaagacggg gatttgcaac agcataggca gacagccgcc cagcgggttg 

401 atttcctcgt cttcgaaaag ctgcatcatc gcttgctgtt gcgccatacg 

451 gtcgtcgccg tatttttctt tgatggtctg cagttcgggt gcggcggcac 

501 gcattttcgc catcgaacgg taggaggcgt tggtcaatgg atacagtacg 

551 gctttgacga tgatggtcaa aacgacgatt gcccagcccc agttgccgat 

601 aatgttgtgc agttggttca ggagccagaa gagcggcgat gcgaaccagt 

651 gtactttacc gtagtctttt gccagttgca ggttgtcggc gatgtttgcg 

701 ataacggatg tggtttgcgg accggcatac aggttgaccg ccattttcgg 

751 ttttggcccc cgggttggga tagcggttaa 

This corresponds to the amino acid sequence <SEQ ID 722; ORF 209.ng>: 

g209.pep 

1 MLRHLGND FA LGALFFDAAV DVPLL GDGQE WDHPVENQT GREEEEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ IGLGEHRCRH NDGQDVVGVG AAEVGNPAQP 

101 RCLAQFYGGE QCPVQADEDG DLQQHRQTAA QRVDFLVFEK LHHRLLLRHT 

151 VVAVFFFDGL QFGCGGTHFR HRTVGGVGQW IQYGFDDDGQ NDDCPAPVAD 

201 NWQLVQEPE ERRCEPVYFT WFCQLQVVG DVCDNGCGLR TGIQVDRHFR 

251 FWPPGWDSG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 723>: 

m209.seq 

1 ATGCTGCGGC ATTTAGGAAA CGACTTCGCC TTGGgGCGTT GTTTTTCGAT 

51 GCTGCGGTTG ATGTGCCATT GCTGGGCGAT GGTCAGGAGG TTGTTGACTA 

101 CCCAGTACAA TACCAGACCG GCAGGGAAGA AGAAGAACAT GACGGAGAAA 

151 ACCAACGGCA TGATTTTCAT CATTTTCGCC TGCATCGGGT CGGTCGGCGG 

201 CGGGTTCAGA TAAGTTTGGG CGAACATCGT TGCCGCCATA ATGATGGGCA 

251 GGATGTAGTA GGGGTCGGCG CGGCTGAGGT CGGTAATCCA ACCCAGCCAA 
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301 GGTGCCTGGC GCAATTCTAC GGAGGCGAAC AATGCCCAAT ACAATCCGAT 

351 GAAGACGGGG ATTTGCAACA GCATAGGCAG GCAGCCGCCC AGCGGGTTGA 

4 01 TTTTCTCGTC TGTGTAAAGC TGCATCATCG CCTGTTGTTG CGCCATACGG 

4 51 TCGTCGCCGT ATTTCTCTTT GATGGCTTGC AGTTTGGGTG CGGCGGCACG 

501 CATTTTCGCC ATAGAGCGGT AAGAGGCGTT GGTCAATGGA TACAGTACGG 

551 CTTTGACGAT GATGGTTAAA ACGATAATCG CCCAGCCCCA GTTGCCGATG 

601 ATGTTGTGCA GTTGGTTCAG GAGCCAGAAG AGCGGGGAGG CGAACCAGTG 

651 TACTTTGCCG TAGTCTTTGG CCAGTTGCAG GTTGTCGGCG ATGTTTGCGA 

701 TGACGGATGT GGTCTGCGGG CCGGCGTAGA GGTTGATGGA GGCTTCGgTT 

751 TCGCGCCGTT TTGGATGGCG GCTAAAGGCA CGCTGACGCT GGTGCTGTAC 

801 AGCTTGTCGT TGCGGCGTTT GATGTCGATG TTGCACTCGC CTGCGGCGCA 

851 AACGCTTTGT CTGCCTTTAG GTTGGAGAAT CCAGGTGGAC ATGAAGTGGT 

901 GTTCAATCAT GCCGAGCCAG CCGGTCGGGG TTTTGCGGAT GTATTCGGCC 

951 TCGGATTTGC CGGATTTGGC ATCGTCGTCC AAGTCGGAAA AGCTGACTTT 

1001 TTGGAAGTTG CCTTCAGGGG TATAA 

This corresponds to the amino acid sequence <SEQ ID 724; ORF 209>: 

m209.pep 

1 MLRHLGND FA LGALFFDAAV DVPLL GDGQE VVDYPVQYQT GREEEEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ ISLGEHRCRH NDGQDVVGVG AAEVGNPTQP 

101 RCLAQFYGGE QCPIQSDEDG DLQQHRQAAA QRVDFLVCVK LHHRLLLRHT 

151 WAVFLFDGL QFGCGGTHFR HRAVRGVGQW IQYGFDDDG* NDNRPAPVAD 

201 DWQLVQEPE ERGGEPVYFA VVFGQLQVVG DVCDDGCGLR AGVEVDGGFG 

251 FAPFWMAAKG TLTLVLYSLS LRRLMSMLHS PAAQTLCLPL GWRIQVDMKW 

301 CSIMPSQPVG VLRMYSASDL PDLASSSKSE KLTFWKLPSG V* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 209 shows 88.5% identity over a 253 aa overlap with a predicted ORF (ORF 209.ng) 
from N. gonorrhoeae: 

m209/g209 

10 20 30 40 50 60 

m209 . pep MLRHLGNDFALGALFFDAAVDVPLLGDGQEWDYPVQYQTGREEEEHDGENQRHDFHHFR 
I I I I I I I I I I I I I i I I I I I I I I I I I I I I I i I I I : I I : I I I I I I I I I I I I I I I I I I I I I I 
g209 MLRHLGNDFALGALFFDAAVDVPLLGDGQEWDHPVENQTGREEEEHDGENQRHDFHHFR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m209.pep LHRVGRRRVQISLGEHRCRHNDGQDVVGVGAAEVGNPTQPRCLAQFYGGEQCPIQSDEDG 
I I I I I I I I I I I : II I I I I I I I I II I I I I II I I I ! I I I : I I I I I I I I I I I I I I I : I : I I I I 
g209 LHRVGRRRVQ I GLGEHRCRHNDGQDWGVGAAEVGNPAQPRCLAQFYGGEQCPVQADEDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 0 9 . pep DLQQHRQAAAQRVDFLVCVKLHHRLLLRHTVVAVFLFDGLQFGCGGTHFRHRAVRGVGQW 
I I I I I I I : ! I I I I I I I I I I I I I I I I I I I I I I i I : I I I I I I I I I I I I I I I I : I I I I I I 
g209 DLQQHRQTAAQRVDFLVFEKLHHRLLLRHTVVAVFFFDGLQFGCGGTHFRHRTVGGVGQW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m209.pep IQYGFDDDGXNDNRPAPVADDWQLVQEPEERGGEPVYFAWFGQLQWGDVCDDGCGLR 
I I I I I I I I I II: I I I I I I : I I I I I I I I II I llllhlll I I II I I I I 1 I : M I I I 
g209 IQYGFDDDGQNDDCPAPVADNWQLVQEPEERRCEPVYFTVVFCQLQVVGDVCDNGCGLR 

190 200 210 220 230 240 

250 260 270 280 290 299 

m209.pep AGVEVDGGFGF-APFWMAAKGTLTLVLYSLSLRRLMSMLHSPAAQTLCLPLGWRIQVDMK 

: I : : I I II II 
g209 TGIQVDRHFRFWPPGWDSG 

250 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 725>: 

a209.seq 

1 ATGCTGCGGC ATTTAGGAAA CGACTTCGCC TTGGGCGCGT TGTTTTTCGA 
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51 TGCTGCGGTT GATGTGCCAT TGCTGGGCGA 

101 ACCCAGTACA ATACCAGACC GGCAGGGAAG 

151 AACCAAAGGC ATGATTTTCA TCATTTTCGC 

201 GCGGGTTCAG ATAGGTTTGG GCGAACATCG 

251 AGGATGTAGT AGGGGTCGGC GCGGCTGAGG 

301 AGGTGCCTGG CGCAATTCTA CGGAGGCGAA 

351 TGAAGACGGG GATTTGCAAC AGCATAGGCA 

4 01 ATTTTCTCGT CTGTGTAAAG CTGCATCATG 

451 GTCGTCGCCG TATTTCTCTT TGATGGCTTG 

501 GCATTTTCGC CATCGAACGG TAAGAGGCGT 

551 GCTTTGACGA TGATGGTTAA AACGATAATC 

601 GATGTTGTGC AGTTGGTTCA AAAGCCAAAA 

651 GTACTTTGCC GTAGTCTTTG GCCAGTTGCA 

701 ATAACGGATG TGGTCTGTGG GCCGGCGTAG 

751 TTCGCACCGT TTTGGATAGC GGCTAAAGGC 

801 CAGCTTGTCG TTGCGGCGTT TGATGTCGAT 

851 AAACGCTTTG TCCGCCTTTG GGTTGGAGGA 

901 TGTTCAATCA TGCCGAGCCA GCCGGTCGGG 

951 CTCGGATTTG CCGGATTTGG CATCGTCGTC 

1001 TTTGGAAGTT GCCTTCAGGG GTATAA 

This corresponds to the amino acid sequence <SEQ ID 726; ORF 209.a>: 

a209 .pep 

1 MLRHLGND FA LGALFFDAAV DVPLL GDGQE VVDHPVQYQT GREEEEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ IGLGEHRCRH NDGQDWGVG AAEVGNPTQP 

101 RCLAQFYGGE QCPIQSDEDG DLQQHRQAAA QRVDFLVCVK LHHGLLLRHT 

151 VVAVFLFDGL QFGRGGTHFR HRTVRGVGQW IQYGFDDDG* NDNRPAPVAD 

201 DVVQLVQKPK EGGGEPVYFA WFGQLQWG DVCDNGCGLW AGVEVDGGFG 

251 FAPFWIAAKG TLTLVLYSLS LRRLMSIRQS PAAQTLCPPL GWRIQVDMKW 

301 CSIMPSQPVG VLRMYSASDL PDLASSSKSE KLTFWKLPSG V* 

m209/a209 95.6% identity in 341 aa overlap 

10 20 30 40 50 60 

m209 . pep MLRHLGNDFALGALFFDAAVDVPLLGDGQEWDYPVQYQTGREEEEHDGENQRHDFHHFR 
I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I : I I I I I I M I I I I I I I I I ! I I I I I I I I 
a209 MLRHLGNDFALGALFFDAAVDVPLLGDGQEWDHPVQYQTGREEEEHDGENQRHDFHHFR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m209 . pep LHRVGRRRVQISLGEHRCRHNDGQDVVGVGAAEVGNPTQPRCLAQFYGGEQCPIQSDEDG 
I I I I I I I I I I I : I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a209 LHRVGRRRVQIGLGEHRCRHNDGQDVVGVGAAEVGNPTQPRCLAQFYGGEQCPIQSDEDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 0 9 . pep DLQQHRQAAAQRVDFLVCVKLHHRLLLRHT VVAVFLFDGLQFGCGGTH FRHRAVRG VGQW 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I M I I I I I I 11 I I I I : i I I I I I 1 
a209 DLQQHRQ7VAAQRVDFLVCVKLHHGLLLRHTVVAVFLFDGLQFGRGGTHFRHRTVRGVGQW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m209.pep IQYGFDDDGXNDNRPAPVADDWQLVQEPEERGGEPVYFAVVFGQLQWGDVCDDGCGLR 

I I I I I I I I I I I I I I I I I I M I I I I I I I : I : I I I I I I I I i I I I I I 1 I I I I I ! I I : I I I I 
a209 IQYGFDDDGXNDNRPAPVADDWQLVQKPKEGGGEPVYFAWFGQLQVVGDVCDNGCGLW 

190 200 210 220 230 240 

250 260 270 280 290 300 

m209 . pep AGVEVDGGFGFAPFWMAAKGTLTLVLYSLSLRRLMSMLHSPAAQTLCLPLGWRIQVDMKW 
I I I I I I I I I I I I ! I I : I I I I I i I I I I I I I I I I I I 1 I : : I I I I I I I I I I I I I I I I I I I I 
a209 AGVEVDGGFGFAPFWIAAKGTLTLVLYSLSLRRLMSIRQSPAAQTLCPPLGWRIQVDMKW 

250 260 270 280 290 300 



TGGTCAGGAG 
AAGAAGAACA 
CTGCATCGGG 
TTGCCGCCAT 
TCGGTAATCC 
CAATGCCCAA 
GGCAGCCGCC 
GCTTGTTGCT 
CAGTTTGGGC 
TGGTCAATGG 
GCCCAGCCCC 
GAGGGGGGAG 
GGTTGTCGGC 
AGGTTGATGG 
ACGCTGACGC 
ACGGCAGTCG 
TCCAGGTGGA 
GTTTTGCGGA 
CAAGTCGGAG 



GTTGTTGATC 
TGACGGAGAA 
TCGGTCGGCG 
AATGATGGGC 
AACCCAGCCA 
TACAATCCGA 
CAGCGGGTTG 
GCGCCATACG 
GCGGCGGCAC 
ATACAGTACG 
AGTTGCCGAT 
GCGAACCAGT 
GATGTTTGCG 
AGGCTTCGGT 
TGGTGCTGTA 
CCAGCGGCGC 
CATGAAGTGG 
TGTATTCGGC 
AAGCTGACTT 



m209.pep 



310 320 330 340 

CSIMPSQPVGVLRMYSASDLPDLASSSKSEKLTFWKLPSGVX 
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a209 



I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I 

CSIMPSQPVGVLRMYSASDLPDLASSSKSEKLTFWKLPSGVX 
310 320 330 340 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 727>: 

g211 . seq 



1 atgttgcgga ttgctgctgc caatcagttg ggcggtcgaa atggtgcggc 

51 ggtgggaaac ggggtcgata agtttgggcg tggtgctgat aatcaggttg 

101 agtttttgga aggaaacctg attgtagtcg gcgcgtccgg gcgtgccgct 

151 gtaacggtag ccgtggcgca attcgagcgt gcgtttgttg tccttcagcg 

201 agaagttacc ttctttggcg aagatgatgt tgtcgccgcc gtttttgtcc 

251 tgttcgcgca ggaacaggtt tttcatgatg ccggattcgg tgtcaaaggt 

301 ttcgacgaaa taaaccctgc cgttgcgctt gcccaagtta ttgaactcgc 

351 cggcttccac caaagacaat tcctgcttct gcttcaaaat ttcggcatat 

401 tcgcggctgc gcagctctgc ccacggtatc acccaaagct gcatgacggc 

4 51 aatcaggatg gcaaacggca cggcaaactg catgacgggg cgtatccact 

501 gtttcaacgc caatccgcag gatag 



This corresponds to the amino acid sequence <SEQ ID 728; ORF 21 l.ng>: 

g211 .pep 

1 MLRIAAANQL GGRNGAAVGN GVDKFGRGAD NQVEFLEGN L IWGASGRAA 

51 VTVAVA QFER AFWLQRE VT FFGEDDWAA VFVLF AQEQV FHDAGFGVKG 

101 FDEINPAVAL AQ VIELAGFH QRQFLLLLQN FGIFAAAQLC PRYHPKLHDG 

151 NQDGKRHGKL HDGAYPLFQR QSAG* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 729>: 



1 ATGTTGCGGG TTGCTGCTGC CAATCAGTTG GGCGGTCGGA ATGGTACGGC 

51 GGTGGGAAAC GGGGTCGATG AGTTTGGGCG TGGTGCTGAT AATCAGGTTG 

101 AGTTTTTGGA AGGAAACCTG ATTGTAGTCG GCGCGTCCGG GCGTGCCGCT 

151 GTAACGGTAG CCGTGGCGCA ATTCGAGCGT GCGTTTGTTG TCGTTCAGCG 

201 AGAAGTTACC TTCTTTGGCG AAGATGATGT TGTCGCCGCC GTTTTTGTCC 

251 TGTTCGCGCA GGAACAGGTT TTTCATGATG CCGGATTCGG TATCGAAGGT 

3 01 TTCGACAAAA TAAACCCTGC CGTTGCGCTT GCCCAAACTG TTGAACTCGC 
351 CTGCCTCCAC CAAAGACAAT TCCTGCTTCT GCTTCAGGAT TTCAGCGTAT 

4 01 TCGCGGCTGC GTAGCTCTGC CCACGGTATC ACCCAAAGCT GCATGACGGC 
451 AACCAAAACG GCAAACGGCA CGGCAAACTG CATCACCGGG CGTATCCATT 
501 GTTTCAATGC CAATCCGCAg GATAG 



This corresponds to the amino acid sequence <SEQ ID 730; ORF 21 1>: 

m211 .pep 

1 MLRVAAANQL GGRNGTAVGN GVDEFGRGAD NQVEFLEGN L IWGASGRAA 
51 VTVAVA QFER AFVWQRE VT FFGEDDWAA VFVLF AQEQV FHDAGFGIEG 



101 FDKINPAVAL AQ TVELACLH QRQFLLLLQD FSVFAAAXLC PRYHPKLHDG 
151 NQNGKRHGKL HHRAYPLFQC QSAG* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 211 shows 89.1% identity over a 174 aa overlap with a predicted ORF (ORF 21 l.ng) 
from N. gonorrhoeae: 



m211 . seq 



m211/g211 



m211 .pep 



10 20 30 40 50 60 

MLRVAAANQLGGRNGTAVGNGVDE FGRGADNQVE FLEGNL I WGASGRAAVTVAVAQFER 



g211 




10 20 30 40 50 60 



m211 -pep 



70 80 90 100 110 120 

AFWVQREVTFFGEDDWAAVFVLFAQEQVFHDAGFGIEGFDKINPAVALAQTVELACLH 
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| MhM III IN llllll IIIMM III III I :MI-Mhll II I I M :| 
g2 ii AFWLQREVTFFGEDDWAAVFVLFAQEQVFHDAGFGVKGFDEINPAVALAQVIELAGFH 

70 80 90 100 110 120 

130 140 150 160 170 

m2 1 1 pep QRQFLLLLQDFS VFAAAXLCPRYHPKLHDGNQNGKRHGKLHHRAYPLFQCQSAGX 

MMMMMMMMI MIMMIMIMMMMMM MMM MM 

g2 ll QRQFLLLLQNFGIFAAAQLCPRYHPKLHDGNQDGKRHGKLHDGAYPLFQRQSAG 

130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 73 1>: 



a211 . seq 








1 


ATGTTGCGGG 


TTGCTGCTGC 


CAATCAGTTG 


51 


GGTGGGAAAC 


GGGGTCGATG 


AGTTTGGGCG 


101 


AGTTTTTGGA 


AGGAAACCTG 


ATTGTAGTCG 


151 


GTAACGGTAG 


CCGTGGCGCA 


ATTCGAGCGT 


201 


AGAAGTTACT 


TTCTTTGGCG 


AAGATGATGT 


251 


TGTTCGCGCA 


GGAACAGGTT 


TTTCATGATG 


301 


TTCGACAAAA 


TAAACCCTGC 


CGTTGCGCTT 


351 


CTGCCTCCAC 


CAAAGACAAT 


TCCTGCTTCT 


401 


TCGCGGCTGC 


GTAGCTCTGC 


CCACGGTATC 


451 


AACCAAAACG 


GCAAACGGCA 


CGGCAAACTG 


501 


GTTTCAATGC 


CAATCCGCAG 


GATAG 



This corresponds to the amino acid sequence <SEQ ID 732; ORF 21 l.a>: 

a211.pep 

1 MLRVAAANQL GGRNGTAVGN GVDEFGRGAD NQVEFLEGN L IWGASGRAA 

51 VTVAVAQFER AFWVQRE VT FFGEDDWAA VFVLF AQEQV FHDAGFGIEG 

101 FDKINPAVAL AQTVE PACLH QRQ FLLLLQD FSVFAAA*LC PRYHPKLHDG 

151 NQNGKRHGKL HHRAYPLFQC QSAG* 

m2 1 1 /a2 1 1 99.4% identity in 1 74 aa overlap 

10 20 30 40 50 60 

m2 1 1 pep MLRVAAANQLGGRNGTAVGNGVDEFGRGADNQVEFLEGNLI WGASGRAAVTVAVAQFER 

I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I t I I I I I I I i I 1 ! I I I I ! 

a 2H MLRVAAANQLGGRNGTAVGNGVDEFGRGADNQVEFLEGNLI WGASGRAAVTVAVAQFER 

10 20 30 40 50 60 

70 80 90 100 110 120 

m211 . pep AFVVVQREVTFFGEDDVVAAVFVLFAQEQVFHDAGFGIEGFDKINPAVALAQTVELACLH 

| | | M I I I II I I I I IN I I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I I I 1 I I 
a211 AFVWQREVTFFGEDDWAAVFVLFAQEQVFHDAGFGIEGFDKINPAVALAQTVE PACLH 

70 80 90 100 110 120 

130 140 150 160 170 

m211 . pep QRQFLLLLQDFS VFAAAXLCPRYHPKLHDGNQNGKRHGKLHHRAYPLFQCQSAGX 

I I | | | | I I I I I I II I I I I I I M I I I I I I I I I i I I I 1 I 1 I I I I I I I I M I I I I I I I 
a211 QRQFLLLLQD FSVFAAAXLCPRYHPKLHDGNQNGKRHGKLHHRAYPLFQCQSAGX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 733>: 

g212 . seq (partial) 

1 atggacaatc tcgtatggga cggcattccc gacatccgca cactcgacca 

51 aaccatccgc aaacacgcac acccgctcaa cctgattgtc tgcctccccg 

101 ataatcagat tcccgatttt caaaccgcac aagatgcttc ggactcggaa 

151 tgccgtctga agcaccgttt ggatcaggca acccagtgcc tccagttcga 

201 cagcatcaac ctcatcgaac acatcctgcc cgatgtccgc ttctggctgg 

251 ttcccccttc acgcacccgc cgcctgcacg aacacttcca ccacatttcc 

301 tggcagaccg aagccatccc gcaaaccgaa agcaagtccg acaaaccctg 

351 gtttgcactt ccacaaacat ccgaacggaa aaaaccggaa cacgtcctcg 
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ccgcccacgc cttagcatca 
cgaaaagccg ctcaagccgc 
caaaatctcg ccgcacgaca 
acggctacac caaacgcctg 
tggggcggca acggcatcat 
acgcaatcac gaattgggtt 
gcatcacgtc tgcagaagcc 
ccctacgccg aaccattatg 
caatccgccc gcattcgtcc 
tatatgaaaa cacaacgtta 
attgcaagca cgccaaacgg 
caccggcgcg cacagcccct 
ccctcaggca aatacgcgga 
tccgaacaac tgcgttgcgc 
gtggcacgga ctgcactgct 
ataccggatg gaacgaagcc 
caccttaacc ccgcccttgc 

This corresponds to the amino acid sequence <SEQ ID 734; ORF 212.ng>: 

g212.pep (partial) 

1 MDNLVWDGIP DIRTLDQTIR KHAHPLNLIV CLPDNQIPDF QTAQDASDSE 

51 CRLKHRLDQA TQCLQFDSIN LIEHILPDVR FWLVPPSRTR RLHEHFHHIS 

101 WQTEAIPQTE SKSDKPWFAL PQTSERKKPE H VLVIGAGIA GASTAHAL AS 

151 HGISVTVLEA RKAAQAASGN RQGLLYAKIS PHDTGQTELL LAGYGYTKRL 

201 LGHILPDSDT WGGNGIIHLN YSRTEQQRNH ELGLQKHHNH LYRSITSAEA 

2 51 EKIAGIPLNT PYAEPLCGLY WQHGVWLNPP AFVRTLLSHP LIELYENTTL 

3 01 TGISHDGEKW IASTPNGTFT ATHIIYCTGA HSPCLPETNL AALPLRQIRG 
351 QTGLTPSTPF SEQLRCAVSG ESYISPSWHG LHCYGASFIP NSSNTGWNEA 

4 01 EEASNRQALA HLNPALAESL F. . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 735>: 

m212 . seq 

1 ATGGACAATC TCGTATGGGA CGGCATTCCC GACATCCGCA CACTCGACCA 

51 AGCCATCCGC AAACACGCAC CCCCGCTCAA CCTGATTATC TGCCTCCCCG 

101 ATAATCAGAT TCCCGATTTT CAAACCGCAC AAGATGCTTC GGACGCGGAA 

151 TGCCGTCTGA AGCACCGTTT GGATCAGGCA ATGCAGTGCC TCCAGTTCGA 

201 CAGCATCAAC CTCATCGAAC ACATCCTGCC CGATGTCCGC TTCTGGCTGG 

251 TTCCCCCTTC ACGCACCCAC CACCTGCACG AACATTTCCA CCACATTTCC 

301 TGGCAGACCG AAGCCATCCC GCAAACCGAA AGCAAGCCCG ACAAACCCTG 

3 51 GTTTGCACTT CCACAAACAT CCGAACGGCA AAAACCGGAA CACATCCTCG 

4 01 TTATCGGCGC GGGCATATCC GGCGCGGCAA CCGCCCACGC CTTAGCATCA 
4 51 CACGGCATTT CCGTTACCGT ATTGGAAGCC CGAAAAGCCG CCCAAGCCGC 
501 CAGCGGCAAC CGCCAAGGGC TGCTCTACGC CAAAATCTCG CCGCACGACA 
551 CCGAACAGAC CGAACTTTTG CTTGCCGGCT ACGGCTACAC CAAACGCCTG 
601 CTCGGACACA TCCTGCCCGA ATCCGAAACC TGGGGCGGCA ACGGCATCAT 
651 CCACCTCAAT TACAGCCGCA CCGAACAACA ACGCAATCAC GAATTGGGTT 
701 TGCAAAAACA CCATAACCAC CTCTACCGCA GCATCACATC TGCAGAAGCC 
7 51 GAAAAAATCG CCGGTATCCC ACTGTCCGTC CCATACGACC ACCCTTCATG 
801 CGGACTCTAC TGGCAACACG GCGTATGGCT CAATCCACCC GCATTCGTCC 
851 GCACCCTCCT CAACCATCCG CTCATTGGAC TACACGAAGA CACACCCTTG 
901 ACCGACATTT CCCACGACGG GGaAAAGTGG ATTGCAAGCA CGCCAAACGG 

951 CACATTTACC GCCACACACA TCATCTACTG CACCGGTGCG AACAGCCCCT / 

1001 ACCTACCCGA AACCAACCTC GCCGCCCTGC CTCTCAGGCA AATACGCGGA 

1051 CAAACCGGCC TCACACCGTC CACCCCGTTT TCCGAACAAC TGCGTTGCGC 

1101 CGTTTCAGGC GAAAGCTACA TCAGCCCGTC GTGGCACGGA CTGCACTGCT 

1151 ACGGCGCGAG TTTTATTCCC AACAGCAGCC ATACCGGATG GAACGAAGCC 

1201 GAAGAAGCCT CAAACCGCCA AGCATTGGCA CACCTTAACC CCGCCCTTTC 

1251 CGAATCATTG TTTGCCGCCA ACCCAAACCC CCAAAAACAC CAAGGGCACG 

13 01 CCGCCATACG CTGCGACAGC CCCGACCACC TTCCCCTAGT CGGCGCACTC 

13 51 GGCGACATTG CCGCCATGCG GCAGACCTAC ACCAAACTCG CGCTGGACAA 

14 01 AAACTACCGC ATCGACACCC CATGCCCATA CCTGCCTAAT GCCTACGTCA 
14 51 ACACCGCGCA CGGCACCCGC GGACTCGCCA CCGCCCCCAT CTGCGCCGCC 



401 


tcatcggtgc 


451 


cacggcattt 


501 


cagcggcaac 


551 


ccggacagac 


601 


ctcggacaca 


651 


ccacctcaat 


701 


tgcaaaaaca 


751 


gaaaaaatcg 


801 


cggactctac 


851 


gcaccctcct 


901 


accggcattt 


951 


cacatttacc 


1001 


gcctgcccga 


1051 


caaaccggcc 


1101 


cgtttcaggc 


1151 


acggcgcgag 


1201 


gaagaagcct 


1251 


cgaatcattg 



aggcattgcc ggcgcatcga 
ccgttaccgt attggaagcc 
cggcaagggc tgctttacgc 
cgaactgctg cttgccggct 
tcctgcccga ctccgacact 
tacagccgca ccgaacaaca 
ccataaccac ctctaccgca 
ccggcatccc gctgaacacg 
tggcaacacg gcgtatggct 
cagccatccg ctgatcgaac 
cccacgacgg agaaaagtgg 
gccacacaca tcatctactg 
aaccaacctc gccgccctac 
tcacaccgtc caccccgttt 
gaaagctaca tcagcccgtc 
ttttattccc aacagcagca 
caaaccgcca agcattggca 
ttt . . . 
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1501 GmCAwTGCAG CCCAAATCsT AGGCyTGCCC CATCCCTTTT yAcAAcGCCT 
15 51 gCGCCACGCC cTAcACCCCA ACCGCACCAT CATCCGCGCC ATCGTCAGAA 
1601 GGAAGGATCT AACCCCTTAA 

This corresponds to the amino acid sequence <SEQ ED 736; ORF 212>: 

m212 .pep 

1 MDNLVWDGI P DIRTLDQAIR KHAPPLNLII CLPDNQIPDF QTAQDASDAE 

51 CRLKHRLDQA MQCLQFDSIN LIEHILPDVR FWLVPPSRTH HLHEHFHHIS 

101 WQTEAIPQTE SKPDKPWFAL PQTSERQKPE H ILVIGAGIS GAATAHALA S 

151 HGISVTVLEA RKAAQAASGN RQGLLYAKIS PHDTEQTELL LAGYGYTKRL 

201 LGHILPESET WGGNGI IHLN YSRTEQQRNH ELGLQKHHNH LYRSITSAEA 

251 EKIAGIPLSV PYDHPSCGLY WQHGVWLNPP AFVRTLLNHP LIGLHEDTPL 

301 TDISHDGEKW IASTPNGTFT ATHIIYCTGA NSPYLPETNL AALPLRQIRG 

3 51 QTGLTPSTPF SEQLRCAVSG ESYISPSWHG LHCYGASFIP NSSHTGWNEA 

4 01 EEASNRQALA HLNPALSESL FAANPNPQKH QGHAAIRCDS PDHLPLVGAL 
451 GDIAAMRQTY TKLALDKNYR IDTPCPYLPN AYVNTAHGTR GLATAPICAA 
501 XXAAQIXGLP HPFXQRLRHA LHPNRTIIRA IVRRKDLTP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 212 shows 92.9% identity over a 421 aa overlap with a predicted ORF (ORF 212.ng) 
from N. gonorrhoeae: 

m212/g212 

10 20 30 40 50 60 

MDNLVWDGI PDI RTLDQAI RKHAPPLNLI I CLPDNQI PDFQTAQDAS DAE CRLKHRLDQA 

1 1 !l II II i ; ill MINIMI ! 1 1 1 1 ^ I! M 1 1 1 1 1 1 1 i M ! I i I : I ! II : 1 1 1 M I 

MDNLVWDGI PDI RTLDQTIRKHAHPLNLIVCLPDNQ I PDFQTAQDASDSECRLKHRLDQA 
10 20 30 40 50 60 

70 80 90 100 110 120 

MQCLQFDS INL I EHI LPDVRFWLVPPSRTHHLHEHFHH I SWQTEAI PQTESKPDKPWFAL 

I M II MIIMIMIMIIMI IIIIIMMMII II Mill ! II 1 1 1 Ml IMIIM 

TQCLQFDSINLIEHILPDVRFWLVPPSRTRRLHEHFHHISWQTEAIPQTESKSDKPWFAL 
70 80 90 100 110 120 

130 140 150 160 170 180 

PQTS ERQKPEHI LVI GAGI S GAATAHALAS HG I S VTVLEARKAAQAASGNRQGLLYAKI S 

1 1 M M M M i M M II MM I M 1 1 II 1 1 1 M 1 1 1 M 1 1 1 1 II 1 1 1 M I II M 1 1 M 1 1 

PQTSERKKPEHVLVIGAGIAGASTAHALASHGISVTVLEARKAAQAASGNRQGLLYAKIS 
130 140 150 160 170 180 

190 200 210 220 230 240 

PHDTEQTELLLAGYGYTKRLLGHILPESETWGGNGIIHLNYSRTEQQRNHELGLQKHHNH 

MM E I 1 I I I I I I I I 1 I M I I I I I I = I : I I I I I I I ! I I I II I I I I I I I I I I I II I I I I I 
PHDTGQTELLLAGYGYTKRLLGHILPDSDTWGGNGIIHLNYSRTEQQRNHELGLQKHHNH 

190 200 210 220 230 240 

250 260 270 280 290 300 

LYRS I TSAEAEKI AGI PLSVPYDHPSCGLYWQHGVWLNPPAFVRTLLNTHPLIGLHEDTPL 

MMMMMIMMMMMI I I II || || II M I II M M I M M M I MM M I 

LYRS I TSAEAEKI AGI PLNTPYAEPLCGLYWQHGVWLNPPAFVRTLLSHPLI ELYENTTL 
250 260 270 280 290 300 

310 320 330 340 350 360 

TDISHDGEKWIASTPNGTFTATHIIYCTGANSPYLPETNLAALPLRQIRGQTGLTPSTPF 

I 1 1 1 II I M I II M 1 1 M II 1 1 1 1 II MM II M II MM I M MIMI I IMM 

TGISHDGEKWIASTPNGTFTATHIIYCTGAHSPCLPETNLAALPLRQIRGQTGLTPSTPF 
310 320 330 340 350 360 



m212 .pep 
g212 

m212 .pep 
g212 

m212 -pep 
g212 

m212 .pep 
g212 

m212 .pep 
g212 

m212 .pep 
g212 



m2i2 .pep 



370 380 390 400 410 420 

SEQLRCAVSGESYISPSWHGLHCYGASFIPNSSHTGWNEAEEASNRQALAHLNPALSESL 
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Ml II I II II III MUM I INI! I! Mill 1 = 11 II I Mill III! MM I M hi II 

g2 12 SEQLRCAVSGESYISPSWHGLHCYGASFIPNSSNTGWNEAEEASNRQALAHLNPALAESL 

370 380 390 400 410 420 

430 440 450 460 470 480 

m2 12 . pep FAANPNPQKHQGHAAIRCDSPDHLPLVGALGDIAAMRQTYTKLALDKNYRIDTPCPYLPN 

I 

g212 F 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 737>: 

a212.seq 

1 ATGGACAATC TCGCATGGAA ' CGGCATTCCC GACATCCGCA CACTCGACCA 

51 AACCATCCGC AAACACGCAC ACCCGCTCAA CCTGATTGTC TGCCTCCCCG 

101 ATAATCAGAT TCCCAATTTT CAAACCGCAC AAGATGCTTC GGACGCGGAA 

151 TGCCGTCTGA AGCACCGTTT GGATCAGGCA ACCCAGTGCC TCCAGTTCGA 

201 CAGCATCAAC CTGATTGAAC ACATCCTGCC CGATGTCCGC TTCTGGCTGG 

251 TTCCCCCTTC ACGCACCCGC CGCCTGCACG AACACTTCCA CCACATTTCC 

301 TGGCAGACCG AAGCCATCCC GCAAACCGAA AGTAAGCCCG ACAAACCCTG 

351 GTTTGCACTT CCACAAACAT CCGAACGGCA AAAACCGGAA CACATCCTCG 

4 01 TTATCGGAGC GGGCATATCC GGCGCGGCAA CCGCCCACGC CT TAG CATC A 

4 51 TACGGCATTT CCGTTACCGT ATTGGAAGCC CGAAAAGCCG CCCAAGCCGC 

501 CAGCGGCAAC CGCCAAGGGC TGCTCTACGC CAAAATCTCG CCGCACGACA 

551 CCGAACAAAC CGAACTGCTG CTTGCCGGCT ACGGCTACAC CAAACGCCTG 

601 CTCGGACATA TCCTGCCCGA ATCCGAAACC TGGGGCGGCA ACGGCATCAT 

651 CCACCTCAAT TACAGCCGCA CCGAACAACA ACGCAATCAC GAATTGGGTT 

701 TGCAAAAACA CCATAACCAC CTCTACCGCA GCATCACGCA GGCAGAAGCC 

751 GAAAAAATCG CCGGCATCCC TCTGAACACG CCCTACGCCG AACCATTATG 

801 CGGACTGTTT TGGCAGTACG GCGTATGGCT CAATCCTCCC ACATTCGTCC 

851 GCGCCCTCCT CAGCCATCCG CTCATTGGAC TACACGAAGA CACACCGTTA 

901 ACCGACATTT CCCACGACGG GGAAAAGTGG ATTGCAAGCA CGCCAAACGG 

951 CACATTTACC GCCACACACA TCATCTACTG CACCGGTGCG AACAGCCCCT 

1001 ACCTACCCGA AACCAACCTC GCCACCCTGC CCCTCAGGCA AATACGCGGA 

1051 CAAACCGGCC TCACACCGTC CACCCCGTTT TCCGAACAAC TGCGTTGCGC 

1101 CGTTTCAGGC GAAAGCTACA TCAGCCCGTC GTGGCACGGA CTGCACTGCT 

1151 ACGGCGCGAG TTTTATTCCC AACAGCAGCC ATACCGGATG GAACGAAGCC 

1201 GAAGAAGCCT CAAACCGCCA AGCATTGGCA CACCTTAACC CCGCCCTTTC 

1251 CGAATCATTG TTTGCCGCCA ACCCAAACCC CCAAAAACAC CAAGGGCACG 

1301 CCGCCATACG CTGCGACAGC CCCGACCACC TTCCCCTAGT CGGCGCACTC 

1351 GGCGACATTG CCGCTATGCA ACAAACTTAC GCCAAACTCG CGCTGGACAA 

14 01 AAACTATCGC ATCGATGCCC CCTGCCCGTA CCTGCCCAAT GCCTACGCCA 

14 51 ACACCGCCCA CGGCACACGC GGGCTTGCCA CCGCCCCCAT CTGCGCCGCC 

1501 GCCGTTGCAG CCGAAATCCT AGGCTTGCCC CATCCCCTCT CAAAACGCCT 

1551 GCGCCACGCC CTACACCCCA ACCGCGCCAT CATCCGCGCC ATCGTCAGAA 

1601 GGAAGGATCT AACCCCTTAA 

This corresponds to the amino acid sequence <SEQ ID 738; ORF 212.a>: 

a212.pep 

1 MDNLAWNGIP DIRTLDQTIR KHAHPLNLIV CLPDNQIPNF QTAQDASDAE 

51 CRLKHRLDQA TQCLQFDSIN LIEHILPDVR FWLVPPSRTR RLHEHFHHIS 

101 WQTEAIPQTE SKPDKPWFAL PQTSERQKPE H ILVIGAGIS GAATAHAL AS 

151 YGISVTVLEA RKAAQAASGN RQGLLYAKIS PHDTEQTELL LAGYGYTKRL 

201 LGHILPESET WGGNGIIHLN YSRTEQQRNH ELGLQKHHNH LYRSITQAEA 

251 EKIAGIPLNT PYAEPLCGLF WQYGVWLNPP TFVRALLSHP LIGLHEDTPL 

301 TDISHDGEKW IASTPNGTFT ATHIIYCTGA NSPYLPETNL ATLPLRQIRG 

351 QTGLTPSTPF SEQLRCAVSG ESYISPSWHG LHCYGASFIP NSSHTGWNEA 

4 01 EEASNRQALA HLNPALSESL FAANPNPQKH QGHAAIRCDS PDHLPLVGAL 

4 51 GDIAAMQQTY AKLALDKNYR IDAPCPYLPN AYANTAHGTR GLATAPICAA 

501 AVAAEILG LP HPLSKRLRHA LHPNRAIIRA IVRRKDLTP* 

m212/a212 93.7% identity in 539 aa overlap 

10 20 30 40 50 60 

ra212 pep MDNLVWDGIPDIRTLDQAIRKHAPPLNLIICLPDNQIPDFQTAQDASDAECRLKHRLDQA 
I i I I : 1 : I I I I I I I I I I : I I I I I I I I I I : II I I I I I I : i I I I I I I I M I i I I I ! I I I I I 

a212 MDNLAWNGIPDIRTLDQTIRKHAHPLNLIVCLPDNQIPNFQTAQDASDAECRLKHRLDQA 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m212.pep MQCLQFDS INLI EHI LPDVRFWLVPPSRTHHLHEHFHH I SWQTE AI PQTES KPDKPWFAL 

I I I I I I I I I I I I I I I I I I I I I I I I II I I : : I I I I I I ! I I I I I I I I I I I I I I II I I I I I I 
a212 TQCLQFDSINLIEHILPDVRFWLVPPSRTRRLHEHFHHISWQTEAIPQTESKPDKPWFAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m212 . pep PQTSERQKPEHILVIGAGISGAATAHALASHGISVTVLEARKAAQAASGNRQGLLYAKIS 
I I I I I I I I I I I I I I I I I II I I I I I I I I 1 I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a212 PQTSERQKPEHILVIGAGISGAATAHAI^ASYGISVTVLEARKAAQAASGNRQGLLYAKIS 

130 140 150 160 170 180 

190 200 210 220 230 240 

m212.pep PHDTEQTELLLAGYGYTKRLLGHILPESETWGGNGIIHLNYSRTEQQRNHELGLQKHHNH 
I I I I I I I II I I I I I I I I I I I i I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I 
a212 PHDTEQTELLLAGYGYTKRLLGHILPESETWGGNGIIHLNYSRTEQQRNHELGLQKHHNH 

190 200 210 220 230 240 

250 260 270 280 290 300 

m212.pep LYRSITSAEAEKIAGIPLSVPYDHPSCGLYWQHGVWLNPPAFVRTLLNHPLIGLHEDTPL 
I I I I I I :| I I I I I I I I I I : : I I - I I I I : I I : I I I I I I I : I I I : I I : I I I I I I II I I I I 
a212 LYRSITQAEAEKIAGIPLNTPYAEPLCGLFWQYGVWLNPPTFVRALLSHPLIGLHEDTPL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m212.pep TDISHDGEKWIASTPNGTFTATHIIYCTGANSPYLPETNLAALPLRQIRGQTGLTPSTPF 
I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I : I I ! I I I I I I I I I I I I I I I 
a212 TDISHDGEKWIASTPNGTFTATHIIYCTGANSPYLPETNLATLPLRQIRGQTGLTPSTPF 
310 320 330 340 350 360 

370 380 390 400 410 420 

m212 .pep SEQLRCAVSGESYISPSWHGLHCYGASFIPNSSHTGWNEAEEASNRQALAHLNPALSESL 
I I II I I I I I I I I I I I I I I I I I M I I I I I I I ! I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
a212 SEQLRCAVSGESYISPSWHGLHCYGASFIPNSSHTGWNEAEEASNRQALAHLNPALSESL 

370 380 390 400 410 420 

430 440 450 460 470 480 

m212.pep FAANPNPQKHQGHAAIRCDSPDHLPLVGALGDIAAMRQTYTKLALDKNYRIDTPCPYLPN 
I I I I I I I I I I I I 1 .1 I I I I I I I I I I I I I I I I I I I I I I : I I I : I I I I I I I I I I I : I I I I I I I 
a212 FAANPNPQKHQGHAAIRCDSPDHLPLVGALGDIAAMQQTYAKLALDKNYRIDAPCPYLPN 

430 440 450 460 470 480 

490 500 510 520 530 540 

m212 . pep AYVNTAHGTRGLATAPICAAXXAAQIXGLPHPFXQRLRHALHPNRTIIRAIVRRKDLTPX 
I I : I I I I I I I I I I I I I I II I I I : I Mill: : I I I I I I I I I I : I I I I I II I I I I I I I 
a212 AYANTAHGTRGLATAPICAAAVAAEILGLPHPLSKRLRHALHPNRAIIRAIVRRKDLTPX 

490 500 510 520 530 540 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 739>: 

g214 . seq 

1 atgatacaaa agatatgtaa gctatttgtt ttaattgtaa tttttgcaac 

51 ttctcccgct tttgcccttc aaagcgacag cagacggccc atccaaatcg 

101 aagccgacca aggttcgctc gatcaagcca accaaaggac cacatttagc 

151 ggcaatgtca tcatcagaca gggtacgctc aacatttccg cctcgtgtgt 

201 caacgtcaca cgcggcaggc aaaggcggcg aatccgtgag ggcggaaggt 

251 tcgcccgtcc gcttcagcca aacgttggac gggggcaaag ggacggtgcg 

301 cggtcaggca aacaacgtta cctattcctc cgcaggaagc actgtcgttc 

351 tgaccggcaa tgccaaagtg cagcgcggcg gcgacgttgc cgaaggtgcg 

4 01 gtcattacct acaacaccaa aaccgaagtc tataccatca acggcagcac 

451 gaaatcgggt gcgaaatccg cttccaaaac cggcagggtc agcgtcgtca 

501 tccagccttc aagcacacaa aaaaccgaat aaccccgatg ccgtctgaaa 

551 cggaaacgca gttcagacgg catttgccga ccgaaatgcc gagaagagat 

601 tattga 

This corresponds to the amino acid sequence <SEQ ID 740; ORF 214.ng>: 

g214 .pep 
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1 MIQKICKLFV LIVIFATSPA FA LQSDSRRP IQIEADQGSL DQANQRTTFS 
51 GNVI IRQGTL NISASCVNVT RGRQRRRIRE GGRFARPLQP NVGRGQRDGA 
101 RSGKQRYLFL RRKHCRSDRQ CQSAARRRRC RRCGHYLQHQ NRSLYHQRQH 
151 EIGCEIRFQN RQGQRRHPAF KHTKNRITPM PSETETQFRR HLPTEMPRRD 
201 Y 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 741>: 

m214.seq (partial) 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 

51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 

101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 

151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 

201 CAATGTTACA CGCGGCGGCA AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 

251 CGCCAGTCCG CTTCAGCCAG ACATTGGACG GCGGCAAAGG CACGGTGCGC 

3 01 GGACAGGCAA ACAACGTTGC TTATTCATCT GCAGGCAGCA CCGTAGTCTT 

3 51 AACCGGTAAT GCCAAAGTAC AGCGCGGCGG CGATGTCGCC GAAGGTGCGG 

401 TGATTACATA CAACACCAAA ACCGAAGTCT ATACCATCAG CGGCAGCACA 

451 AAATT . . . 

This corresponds to the amino acid sequence <SEQ ID 742; ORF 214>: 

m214.pep (partial) 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQIEADQGSL DQANQSTTFS 
51 GNWIRQGTL NISAARVNVT RGRQRRRIRE GGRFASPLQP DIGRRQRHGA 
101 RTGKQRCLFI CRQHRSLNR* CQS TARRRCR RRCGDYIQHQ NRSLYHQRQH 
151 KI . . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 214 shows 80.3% identity over a 152 aa overlap with a predicted ORF (ORF 214.ng) 
from K gonorrhoeae: 

m214/g214 



10 20 30 40 50 60 

m214 . pep MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNWIRQGTL 

M II II II III l-MMIMMMMMMMMIMMMI II 1 1 1 1 II h 1 1 1 1 1 1 

9214 MIQKICKLFVLIVIFATSPAFALQSDSRRPIQIEADQGSLDQANQRTTFSGNVI IRQGTL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 1 4 . p ep NI S AARVNVTRGRQRRR I REGGRFAS PLQPD I GRRQRHGARTGKQRCL FICRQHRSLNRX 

MM: || M| II III IMIMIII |||!::|| || |||:|||| ||: |:| :| 
9214 NISASCVNVTRGRQRRRIREGGRFARPLQPNVGRGQRDGARSGKQRYLFLRRKHCRSDRQ 

70 80 90 100 110 120 

130 140 150 

m2 14 . pep CQS TARRRCRRRCGD Y I QHQNRS L YHQRQHK I 

1 1 1 - 1 1 1 1 MM h I I I I I Ml II 1 1 1 M 

9214 CQSAARRRRCRRCGHYLQHQNRSLYHQRQHEIGCEIRFQNRQGQRRHPAFKHTKNRITPM 

130 140 150 160 170 180 

g2 14 PSETETQFRRHLPTEMPRRDY 

190 200 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 743>: 

a214.seq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 

51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 

101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 

151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 

201 CAATGTTACA CGCGGC . GGC AAAGGCGGCG AATCCGTGAG GGCGGAAGGT 

251 TCGCCAGTCC GCTTCAGCCA GACATTGGAC GGCGGCAAAG GCACGGTGCG 

301 CGGACAGGCA AACAACGTTG CTTATTCATC TGCAGGCAGC ACCGTAGTCT 
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351 TAACCGGTAA TGCCAAAGTA CAGCGCGGCG GCGATGTCGC CGAAGGTGCG 

4 01 GTGATTACAT ACAACACCAA AACCGAAGTC TAT AC CATC A GCGGCAGCAC 

4 51 AAAATCCGGC GCAAAATCCG CTTCCAAATC CGGCAGGGTC AGCGTCGTTA 

501 TCCAGCCTTC GAGTACGCAA AAATCCGAAT AATCCCAATG CCGTCTGAAA 

551 CATAAACCTG GTTCGGACGG CATTTGCCGA CCGAAATATT GAAGAGATAT 

601 TTATGA 



This corresponds to the amino acid sequence <SEQ ID 744; ORF 214.a>: 

a214 .pep 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQIEADQGSL DQANQSTTFS 
51 GNWIRQGTL NISAARVNVT RGXQRRRIRE GGRFASPLQP DIGRRQRHGA 
101 RTGKQRCLFI CRQHRSLNR* CQSTARRRCR RRCGDYIQHQ NRSLYHQRQH 
151 KIRRKIRFQI RQGQRRYPAF EYAKIRIIPM PSET*TWFGR HLPTEILKRY 
201 L* 

m214/a214 99.3% identity in 152 aa overlap 

10 20 30 40 50 60 

m214 . pep MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNVVIRQGTL 
i I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a214 MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNVVIRQGTL 

10 20 30 40 50 60 



70 80 90 100 110 120 

m214 . pep NI SAARVNVTRGRQRRRIREGGRFAS PLQPD IGRRQRHGARTGKQRCLFI CRQHRSLNRX 

I I I I I I I I I I I I I I I I I I I I I i I I I I I II I I ! I I I I I I I I I I I I I I I I I I I I I f| I I | I 
a214 NISAARVNVTRGXQRRRIREGGRFASPLQPDIGRRQRHGARTGKQRCLFI CRQHRSLNRX 

70 80 90 100 110 120 

130 140 150 

m214 . pep CQSTARRRCRRRCGDY IQHQNRS LYHQRQHKI 

I I I I I I I I I I I I !! I I I I I I I 1 I I I I I I I I I I 
a2 1 4 CQSTARRRCRRRCGDYIQHQNRSLYHQRQHKIRRKIRFQIRQGQRRYPAFEYAKIRI I PM 

130 140 150 160 170 180 



a214 PSETXTWFGRHLPTEILKRYLX 

190 200 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 745>: 

g214-l.seq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATTGTAA TTTTTGCAAC 

51 TTCTCCCGCT TTTGCCCTTC AAAGCGACAG CAGACGGCCC ATCCAAATCG 

101 AAGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGTAC CACATTTAGC 

151 GGCAATGTCA TCATCAGACA GGGTACGCTC AACATTTCCG CCTCGCGCGT 

201 CAACGTCACA CGCGGCGGCA AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 

251 CGCCCGTCCG CTTCAGCCAA ACGTTGGACG GGGGCAAAGG GACGGTGCGC 

301 GGTCAGGCAA ACAACGTTAC CTATTCCTCC GCAGGAAGCA CCGTCGTTCT 

351 GACCGGCAAT GCCAAAGTGC AGCGCGGCGG CGACGTTGCC GAAGGTGCGG 

4 01 TCATTACCTA CAACACCAAA ACCGAAGTCT ATACCATCAA CGGCAGCACG 

451 AAATCGGGTG CGAAATCCGC TTCCAAAACC GGCAGGGTCA GCGTCGTCAT 

501 CCAGCCTTCA AGCACACAAA AAACCGAATA A 

This corresponds to the amino acid sequence <SEQ ID 746; ORF 214-1. ng>: 

g214-l .pep 

1 MIQKICKLFV LI VI FATS PA FA LQSDSRRP IQIEADQGSL DQANQSTTFS 
51 GNVIIRQGTL NISASRVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKGTVR 
101 GQANNVTYSS AGSTVVLTGN AKVQRGGDVA EGAVITYNTK TEVYTINGST 
151 KSGAKSASKT GRVSVVIQPS STQKTE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 747>: 

u214-l.seq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 
51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 
101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 
151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 
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201 CAATGTTACA 

251 CGCCAGTCCG 

301 GGACAGGCAA 

351 AACCGGTAAT 

401 TGATTACATA 

451 AAATCCGGCG 

501 CCAGCCTTCG 



CGCGGCGGCA 
CTTCAGCCAG 
ACAACGTTGC 
GCCAAAGTAC 
CAACACCAAA 
CAAAATCCGC 
AGTACGCAAA 



AAGGCGGCGA 
ACATTGGACG 
TTATTCATCT 
AGCGCGGCGG 
ACCGAAGTCT 
TTCCAAATCC 
AATCCGAATA 



ATCCGTGAGG 
GCGGCAAAGG 
GCAGGCAGCA 
CGATGTCGCC 
ATACCATCAG 
GGCAGGGTCA 
A 



GCGGAAGGTT 
CACGGTGCGC 
CCGTAGTCTT 
GAAGGTGCGG 
CGGCAGCACA 
GCGTCGTTAT 



This corresponds to the amino acid sequence <SEQ ID 748; ORF 214-1>: 

m214-l .pep 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQIEADQGSL DQANQSTTFS 
51 GNVVIRQGTL NISAARVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKGTVR 
101 GQANNVAYSS AGSTVVLTGN AKVQRGGDVA EGAVITYNTK TEVYTISGST 
151 KSGAKSASKS GRVSVVIQPS STQKSE* 



m214-l/g214-l 93.8% identity in 176 aa overlap 

10 20 30 40 50 60 

m214-l.pep MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNVVIRQGTL 
I M I I I I I I I I I :: I :: i I I I I I I I I ( I : I I I I I 1 I I I ! I I I I t I I I i i I I I I : I I i I I I 
g 214-l MIQKICKLFVLIVIFATSPAFALQSDSRRPIQIEADQGSLDQANQSTTFSGNVIIRQGTL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m214-l.pep NISAARVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVAYSSAGSTVVLTGN 
I I I I : I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I 
g214-l NISASRVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVTYSSAGSTVVLTGN 

70 80 90 100 110 120 



130 140 150 160 170 

m214-l .pep AKVQRGGDVAEGAVITYNTKTEVYTISGSTKSGAKSASKSGRVSVVIQPSSTQKSEX 
I I II I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I : I I I I I I I I II I M I : I i 
g214-l AKVQRGGDVAEGAVITYNTKTEVYTINGSTKSGAKSASKTGRVSWIQPSSTQKTEX 
130 140 150 160 170 



g214-l/p38685 

sp | P3 8 68 5 | YHBN__ECOLI 17.3 KD PROTEIN IN MURA-RPON INTERGENIC REGION PRECURSOR (ORF185) 
>gi 1 551336 (U12684) orfl85 [Escherichia coli] >gi 1 606139 (U18997) ORF_ol85 [Escherichia coli] 
>gi 1 1789592 (AE000399) orf, hypothetical protein [Escherichia coli] Length = 185 
Score - 97.1 bits (238), Expect = 6e-20 

Identities = 57/126 (45%), Positives = 74/126 (58%), Gaps - 3/126 (2%) 

Query: 19 PAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNVVIRQGTLNISAARVNVTR--GGKGG 76 

PAFA+ D+ QPI IE+DQ SLD TF+GNV++ QGT+ I+A +V VTR G +G 

Sbjct: 24 PAFAVTGDTDQPIHIESDQQSLDMQGNVVTFTGNVIVTQGTIKINADKVVVTRPGGEQGK 83 

Query: 77 ESVRAEGSPVRFSQTLDGGKGTVRGQANNVAYSSAGSTVVLTGNAKVQRGGDVAEGAVIT 136 

E + GPFQDGKVGA++YA VVLTGNA +Q+ +G IT 

Sbjct: 84 EVIDGYGKPATFYQMQDNGK-PVEGHASQMHYELAKDFVVLTGNAYLQQVDSNIKGDKIT 142 

Query: 137 YNTKTE 142 

Y K + 
Sbjct: 143 YLVKEQ 148 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 749>: 

a214-l .seq 

1 AT GAT AC AAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 

51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 

101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 

151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 

201 CAATGTTACA CGCGGCGGCA AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 

251 CGCCAGTCCG CTTCAGCCAG ACATTGGACG GCGGCAAAGG CACGGTGCGC 

301 GGACAGGCAA ACAACGTTGC TTATTCATCT GCAGGCAGCA CCGTAGTCTT 

351 AACCGGTAAT GCCAAAGTAC AGCGCGGCGG CGATGTCGCC GAAGGTGCGG 

4 01 TGATTACATA CAACACCAAA ACCGAAGTCT ATACCATCAG CGGCAGCACA 

4 51 AAATCCGGCG CAAAATCCGC TTCCAAATCC GGCAGGGTCA GCGTCGTTAT 

501 CCAGCCTTCG AGTACGCAAA AATCCGAATA A 

This corresponds to the amino acid sequence <SEQ ID 750; ORF 214-1. a>: 

a214-l .pep 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQIEADQGSL DQANQSTTFS 
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51 GNVVIRQGTL NISAARVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKGTVR ' 
101 GQANNVAYSS AGSTVVLTGN AKVQRGGDVA EGAVITYNTK TEVYTISGST 
151 KSGAKSASKS GRVSVVIQPS STQKSE* 

a214-l/m214-l 100.0% identity in 176 aa overlap 

10 20 30 40 50 60 

a214-l.pep MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNVVIRQGTL 
I I I I I I I I I I I II I i I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m214-l MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNVVIRQGTL 

10 20 30 40 50 60 

70 80 90 100 110 120 

a2 1 4 - 1 . pep NI SAARVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVAYSS AGSTVVLTGN 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I i I I I I I 
m2 1 4 - 1 NI SAARVN VTRGGKGGES VRAEGS P VRFSQTLDGGKGTVRGQANNVAYS S AGSTVVLTGN 

70 80 90 100 110 120 

130 140 150 160 170 

a214-l .pep AKVQRGGDVAEGAVITYNTKTEVYTISGSTKSGAKSASKSGRVSVVIQPSSTQKSEX 
I I II i I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I 1 I I I I I 
m214-l AKVQRGGDVAEGAVITYNTKTEVYTISGSTKSGAKSASKSGRVSVVIQPSSTQKSEX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 75 1>: 

g215 . seq 

1 atgaaagtaa gatggcggta cggaattgcg ttcccattga tattggcggt 

51 tgccttgggc agcctgtcgg catggttggg ccgtatcagc gaagtcgaaa ' 

101 tcgaggaagt caggctcaat cccgacgaac ctcaatacac aatggacggc 

151 ttggacggaa ggcggtttga cgaacaggga tacttgaaag aacatttgag 

201 cgcgaaaggt gcgaaacagt ttcccgaaaa cagcgacatc cattttgatt 

251 cgccgcatct cgtgttcttc caagaaggca ggctgttgta cgaagtcggc 

301 agcgatgaag ccgtttacca taccgaaaac aaacaggttc tttttaaaaa 

3 51 caacgttgtg ctgaccaaaa ccgccgacgg caggcggcag gcgggtaaag 
401 tcgaaaccga aaaactgcac gtcgataccg aatctcaata tgcccaaacc 

4 51 gatacgcctg tcagtttcca atatggcgcg tcgcacggtc aggcgggcgg 
501 tatgacctac aaccacaaaa caggcatgtt gaacttctca tctaaagtga 
551 aagccgcgat ttatgataca aaagatatgt aa 

This corresponds to the amino acid sequence <SEQ ID 752; ORF 215.ng>: 

g215 .pep 

1 MKVRWRYGIA FPLILAVALG SLS AWLGRIS EVEIEEVRLN PDEPQYTMDG 

51 LDGRRFDEQG YLKEHLSAKG AKQFPENSDI HFDSPHLVFF QEGRLLYEVG 

101 SDEAVYHTEN KQVLFKNNW LTKTADGRRQ AGKVETEKLH VDTESQYAQT 

151 DTPVSFQYGA SHGQAGGMTY NHKTGMLNFS SKVKAAIYDT KDM* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 753>: 

m215.seq (partial) 

1 . .AGCCTGTCGG CATGGTTGGG TCGTATCAGC GAAGTCGAGA TTGAAGAAGT 

51 CAGGCTCAAT CCCGACGAAC CGCAATACAC AATGGACAGC TTGGACGGCA 

101 GGCGGTTTGA CGAACAGGGA TACTTGAAAG AACATTTGAG CGCGAAGGGC 

151 GCGAAACAGT TTCCGGAAAG CAGCGACATC CATTTTGATT CGCCGCATCT 

2 01 CGTGTTCTTC CAAGAAGGCA GGTTGTTGTA CGAAGTCGGC AGCGACGAAG 
251 CCGTTTACCA TACCGAAAAC AAACAGGTTC TTTTTAAAAA CAACGTTGTG 

3 01 CTGACCAAAA CCGCCGACGG CAAACGGCAG GCGGGTAAAG TTGAAGCCGA 

3 51 AAAGCTGCAC GTCGATACCG AATCTCAATA TGCCCAAACC GATACGCCTG 

4 01 CAGTTTCCA ATATGGTGCA TCGCACGGTC AGGCGGGCGG CATGACTTAC 
4 51 GACCACAwwA CAGGCATGTT GAACTTCTCA TCTAAAGTGA AAGCCACGAT 
501 TTATGATACA AAAGATATGT AA 

This corresponds to the amino acid sequence <SEQ ID 754; ORF 21 5>: 

m215.pep (partial) 

1 . . SLSAWLGRIS EVEIEEVRLN PDEPQYTMDS LDGRRFDEQG YLKEHLSAKG 

51 AKQFPESSDI HFDSPHLVFF QEGRLLYEVG SDEAVYHTEN KQVLFKNNW 

101 LTKTADGKRQ AGKVEAEKLH VDTESQYAQT DTPVSFQYGA SHGQAGGMTY 
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151 DHXTGMLNFS SKVKATIYDT KDM* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 215 shows 96.0% identity over a 173 aa overlap with a predicted ORF (ORF 215.ng) 
from N. gonorrhoeae: 

m215/g215 



m215 .pep 
g215 



10 20 30 40 

SLS AWLGRI S EVE I EE VRLNPDE PQ YTMDS LDGRRFDEQG 

MM M 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 i I M I hi 1 1 1 M M 1 1 

MKVRWRYGIAFPLILAVALGSLSAWLGRISEVEIEEVRLNPDEPQYTMDGLDGRRFDEQG 
10 20 30 40 50 60 



50 60 70 80 90 100 

m215 pep YLKEHLSAKGAKQFPESSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNW 

1 1 1 1 1 1 1 1 M 1 1 1 1 i I : M 1 1 ! 1 1 1 1 1 1 1 1 M M ! 1 1 M 1 1 M 1 1 11 1 1 1 1 M ! 1 1 1 1 1 1 

g2 i5 YLKEHLSAKGAKQFPENSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNW 

70 80 90 100 110 120 

110 120 130 140 150 160 

m2 15 pep LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDHXTGMLNFS 

IMIIIMIMII Ml Ill 1 1 1 1 M i i t ! 1 1 J 1 1 i 1 1 1 1 ! 1 1 1 1 1 1 1 

g2 i5 LTKTADGRRQAGKVETEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYNHKTGMLNFS 

130 140 150 160 170 180 

170 

m215 .pep S KVKATI YDTKDMX 

1 1 1 M • 1 1 1 1 1 1 1 

g 215 S KVKAAI YDTKDM 

190 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 755>: 

a215.seq 

1 ATGAAAGTAA GATGGCGGTA CGGAATTGCG TTCCCATTGA TATTGGCGGT 

51 TGCCTTGGGC AGCCTGTCGG CATGGTTGGG ACGCATCAGC GAAGTCGAGA 

101 TTGAAGAAGT CAGGCTCAAT CCCGACGAAC CGCAATACAC AATGGACGGA 

151 TTGGATGGCA GGCGGTTTGA CGAACAGGGA TACTTGAAAG AACATTTGAG 

201 TTCGAAGGGC GCGAAACAGT TTCCCGAAAG CAGCGACATT CATTTCGACT 

251 CACCGCATCT CGTGTTCTTC CAAGAAGGCA GGTTGTTGTA CGAAGTCGGC 

301 AGCGATGAAG CCGTTTACCA TACCGAAAAC AAACAGGTTC TTTTTAAAAA 

351 CAACGTTGTG CTGACCAAAA CCGCCGACGG CAAACGGCAG GCGGGTAAAG 

401 TTGAAGCCGA AAAGCTGCAC GTCGATACCG AATCTCAATA TGCCCAAACC 

4 51 GATACGCCTG TCAGTTTCCA ATATGGTGCA TCGCACGGTC AGGCGGGCGG 

501 CATGACTTAC GACCACAAAA CAGGCATGTT GAACTTCTCA TCTAAAGTGA 

551 AAGCCACGAT TTATGATACA AAAGATATGT AA 

This corresponds to the amino acid sequence <SEQ ID 756; ORF 215.a>: 

a215 .pep 

1 MKVRWRYGIA FPLILAVALG SLS AWLGRIS EVEIEEVRLN PDEPQYTMDG 

51 LDGRRFDEQG YLKEHLSSKG AKQFPESSDI HFDSPHLVFF QEGRLLYEVG 

101 SDEAVYHTEN KQVLFKNNVV LTKTADGKRQ AGKVEAEKLH VDTESQYAQT 

151 DTPVSFQYGA SHGQAGGMTY DHKTGMLNFS SKVKATIYDT KDM* 

m2 1 5/a2 1 5 98.3% identity in 1 73 aa overlap 

10 20 30 40 

m2 i5 pep SLSAWLGRISEVEIEEVRLNPDEPQYTMDSLDGRRFDEQG 

I | | I I I I I I I I I I 1 I I I I I I I I I I I I I I I : I M I I I I I M 
a215 MKVRWRYGIAFPLILAVALGSLSAWLGRISEVEIEEVRLNPDEPQYTMDGLDGRRFDEQG 

10 20 30 40 50 60 

50 60 70 80 90 100 

m215.pep YLKEHLSAKGAKQFPESSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNVV 
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I I I I I I I : I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 
a215 YLKEHLSSKGAKQFPESSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNVV 

70 80 90 100 110 120 

110 120 130 140 150 160 

m215 . pep LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDHXTGMLNFS 
I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I II I I I I I I I I I I I I I I I I I I I 
a215 LTKTADGKRQAGKVEAEKLHVDTESQYAQTDT PVSFQYGASHGQAGGMTYDHKTGMLNFS 

130 140 150 160 170 180 

170 

m2 15 . pep SKVKATIYDTKDMX 
I I I I I I I I.I I I I I I 
a215 SKVKATIYDTKDMX 

190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 757>: 

g216 . seq (partial ) 

1 . . atgatatcga tttcgagctc ggtacccagc gacgaaatca ccgccatcat 

51 ccccgcactc aaacgcaaag acattaccct cgtctgcatc accgcccgcc 

101 ccgattcaac catggcgcgc catgccgata tccacatcac cgcatcggtt 

151 tcgcaagaag cctgcccgtt ggggcttgcc ccgaccacca gcaccaccgc 

201 cgttatggct ttgggcgacg cgttggcggt cgtcctgctg cgcgcccgcg 

251 cgttcacgcc cgacgacttc gccttgatcc accctgccgg cagcctcggc 

3 01 aaacgcctgc ttttgcgcgt tgccgacatt atgcacaaag gcggcggcct 
351 gcccgccgtc cgactcggca cgcccttgaa aggagccatc gtcagcatga 

4 01 gcgagaaagg tttgggcatg tgggcgggaa cggacgggca aaggctgtct 
451 gaaaggcctt tttactga 

This corresponds to the amino acid sequence <SEQ ID 758; ORF 216.ng>: 

g216.pep (partial) 

1 . .MISISSSVPS DEITAIIPAL KRKDITLVCI TARPDSTMAR HADIHITASV 

51 SQEACPLGLA PT TSTTAVMA LGDALAWLL RARAFTPDDF ALIHPAGSLG 

101 KRLLLRVADI MHKGGGLPAV RLGTPLKGAI VSMSEKGLGM WAGTDGQRLS 

151 ERPFY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 759>: 

m216 . seq 

1 ATGGCAATGG CAGAAAACGG AAAATATCTC GACTGGGCAC GCGAAGTGTT 

51 GCACGCCGAA GCGGAAGGCT TGCGCGAAAT TGCAGCGGAA TTGsACAAAA 

101 ACTTCGTCCT TGCGGCAGAC GCGTTGTTGC ACTGCAAGGG CAGGGTCGTT 

151 ATCACGGGCA TGGTCAAGTC GGGACATATC GGGCGCAAAA TGGCGGCAAC 

2 01 TATGGCCTCG ACCGGCACGC CTGCGTTTTT CGTCCACCCT GCGGAAGCGG 
251 CACACGgCGA TTTGGGTATG ATTGTGGACA rCGACGTGGT CGTCGCGATT 

3 01 TCCAATTCCG GCGAAAGCGA CGAAATCGCC GCCATCATCC CCGCACTCAA 

3 51 ACGCAAAGAC ATCACGCTTG TCTGCATCAC CGCCCGCCCC GATTCAACCA 

4 01 TGGCGCGCCA TGCCGACATC CACATCACGG CGTCGGTTTC CAAAGAAGCC 
4 51 TGCCCGCTGG GGCTTGCCCC GACCACCAGC ACCACCGCCG TCATGGCTTT 
501 GGGCGATGCG TTGGCGGTCG TCCtGCTGCG CgcACGCGCG TTCACGCCCG 
551 ACGATTTCGC CTTGAGCCAT CCTGCCGGCA GCCTCGGCAA ACGCCTACTT 

6 01 TTGCGCGTTG CCGACATTAT GCACAAAGGC GGCGGCCTGC CTGCCGTCCG 
651 ACTCGGCACG CCCTTGAAAG AAGCCATCGT CAGCATGAGT GAAAAAGGGC 

7 01 TGGGCATGTT GGCGGTAACG GACGGGCAAG GCCGTCTGAA AGGCGTATTC 
751 ACCGACGGCG ATTTGCGCCG CCTGTTTCAA GAATGCGACA ATTTTACCGG 
801 TCTTTCGATA GACGAAGTCA TGCATACGCA TCCTAAAACC ATCTCCGCCG 
851 AACGTCTCGC CACCGAAGCC CTGAAAGTCA TGCAGGCAAA CCATGTGAAC 
901 GGGCTTCTGG TTACCGATGC AGATGGCGTG CTGATCGGCG CGCTGAATAT 
951 GCACGACCTG CTGGCGGCAC GGATTGTATA G 

This corresponds to the amino acid sequence <SEQ ID 760; ORF 216>: 

m216 .pep 

1 MAMAENGKYL DWAREVLHAE AEGLREIAAE LXKNFVLAAD ALLHCKGRW 
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51 ITGMVKSGHI GRKMAATMAS TGTPAFFVHP AEAAHGDLGM IVDXDVWAI 

101 SNSGESDEIA AIIPALKRKD ITLVCITARP DSTMARHADI HITASVS KEA 

151 CPLGLAPTT S TTAVMALGDA LAWLL RARA FTPDDFALSH PAGSLGKRLL 

201 LRVAD I MHKG GGLPAVRLGT PLKEAIVSMS EKGLGMLAVT DGQGRLKGVF 

251 TDGDLRRLFQ ECDNFTGLS I DEVMHTHPKT ISAERLATEA LKVMQANHVN 

301 GLLVTDADGV LIGALNMHDL LAARIV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 216 shows 91.8% identity over a 147 aa overlap with a predicted ORF (ORF 216.ng) 
from K gonorrhoeae: 

m216/g216 

70 80 90 100 110 120 

TMASTGTPAFFVHPAEAAHGDLGMIVDXDVWAISNSGESDEIAAI IPAIjKRKDITLVCI 

:::|hl I I I h I I I I I I I M I I I I I M 
MISISSSVPSDEITAIIPALKRKDITLVCI 
10 20 30 

130 140 150 160 170 180 

TARPDSTMARHADIHITASVSK£ACPLGLAPTTSTTAVMALGDALAWLLRARAFTPDDF 

M I Ml II III Ml I illllhiMI MMI Mill I Ml III! M MM II I! II 

TARPDSTMARHADIHITASVSQEACPLGLAPTTSTTAVMALGDALAWLLRARAFTPDDF 
40 50 60 70 80 90 

190 200 210 220 230 240 

ALSHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKEAIVSMSEKGLGMLAVTDGQGRL 

1 1 M I IMIMM III IMMM Mill MUM II 1 1 1 M 1 1 1 i 1 1 | MM 

ALIHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKGAIVSMSEKGLGMWAGTDGQRLS 
100 110 120 130 140 150 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 761 >: 

a216.seq 

1 ATGGCGATGG CAGGAAACGA AAAATATCTT GATTGGGCAC GCGAAGTGTT 

51 GCACACCGAA GCGGAAGGCT TGCGCGAAAT TGCGGCGGAT TTGGACGAAA 

101 ACTTCGCCCT TGCGGCGGAC GCGTTGTTGC ACTGCAAAGG CAGGGTCGTT 

151 ATCACGGGCA TGGGCAAGTC GGG AC AT AT C GGGCGCAAAA TGGCGGCAAC 

201 CATGGCCTCG ACCGGCACGC CCGCGTTTTT CGTCCACCCT GCGGAAGCGG 

251 CACACGGCGA TTTGGGCATG ATTGTGGACA ACGACGTGGT CGTCGCGATT 

301 TCCAATTCCG GTGAAAGCGA CGAAATCGCC GCCATCATCC CCGCGCTCAA 

351 ACGCAAAGAT ATCACGCTTG TCTGCATCAC CGCCCGCCCC GATTCAACCA 

4 01 TGGCGCGCCA TGCCGACATC CACATCACGG CGTCGGTTTC CAAAGAAGCC 

4 51 TGCCCGCTGG GGCTTGCCCC GACCACCAGC ACCACCGCCG TTATGGCTTT 

501 GGGCGATGCG TTGGCGGTTG TCCTGCTGCG CGCCCGCGCG TTCACGCCCG 

551 ACGACTTCGC CTTGAGCCAC CCTGCCGGCA GCCTCGGCAA ACGCCTACTT 

601 TTGCGCGTTG CCGACATTAT GCACAAAGGC GGCGGCCTGC CTGCCGTCCG 

651 ACTCGGCACG CCCTTGAAAG AAGCCATCGT CAGCATGAGT GAAAAAGGGC 

701 TGGGCATGTT GGCGGTAACG GACGGGCAAG GCCGTCTGAA AGGCGTATTC 

751 ACCGACGGCG ATTTGCGCCG CCTGTTTCAA GAATGCGACA ATTTTACCGG 

801 TCTTTCGATA GACGAAGTCA TGCATACGCA TCCTAAAACC ATCTCCGCCG 

851 AACGTCTCGC CACCGAAGCC CTGAAAGTCA TGCAGGCAAA CCATGTGAAC 

901 GGGCTTCTGG TTACCGATGC AGATGGCGTG CTGATCGGCG CGCTGAATAT 

951 GCACGACCTT TTGGCGGCGC GGATTGTATA G 

This corresponds to the amino acid sequence <SEQ ID 762; ORF 216.a>: 

a216.pep 

1 MAMAGNEKYL DWAREVLHTE AEGLREIAAD LDENFALAAD ALLHCKGRW 

51 ITGMGKSGHI GRKMAATMAS TGTPAFFVHP AEAAHGDLGM IVDNDVWAI 

101 SNSGESDEIA AIIPALKRKD ITLVCITARP DSTMARHADI HITASVSKEA 

151 CPLGLAPTT S TTAVMALGDA LAWLL RARA FTPDDFALSH PAGSLGKRLL 

201 LRVADIMHKG GGLPAVRLGT PLKEAIVSMS EKGLGMLAVT DGQGRLKGVF 

251 TDGDLRRLFQ ECDNFTGLSI DEVMHTHPKT ISAERLATEA LKVMQANHVN 



m216 .pep 
g216 

m216 .pep 
9216 

m2 1 6 . pep 
g216 
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151 DHXTGMLNFS SKVKATIYDT KDM* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 215 shows 96.0% identity over a 173 aa overlap with a predicted ORF (ORF 215.ng) 
from N. gonorrhoeae: 

m215/g215 



m215 .pep 
g215 



10 20 30 40 

SLSAWLGRISEVEIEEVRLNPDEPQYTMDSLDGRRFDEQG 

I II Ml 1 1 i I ! 1 1 M 1 1 1 1 M M 1 1 M MH 1 1 1 1 1 1 1 M 

MKVRWRYGI AF PL I LAVALGS LS AWLGRI S EVE I E E VRLNPDE PQ YTMDGLDGRRFDEQG 
10 20 30 40 50 60 



50 60 70 80 90 100 

m215 pep YLKEHLSAKGAKQFPESSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNW 

,| M Mill IIIMIhlllllllllllllllMMI II II M I III I IMM Ml I III 

g2 i5 YLKEHLSAKGAKQFPENSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNW 

70 80 90 100 110 120 

110 120 130 140 150 160 

m2 15 Dep LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDHXTGMLNFS 

MM MMIMMMMillMI! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h I M 1 1 1 1 1 

g2 1 5 LTKTADGRRQAGKVETEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYNHKTGMLNFS 

130 140 150 160 170 180 

170 

m215 .pep SKVKATIYDTKDMX 

II IMIMIMI 

g 215 S KVKAAI YDTKDM 

190 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 755>: 



a215 . seq 








1 


ATGAAAGTAA 


GATGGCGGTA 


CGGAATTGCG 


51 


TGCCTTGGGC 


AGCCTGTCGG 


CATGGTTGGG 


101 


TTGAAGAAGT 


CAGGCTCAAT 


CCCGACGAAC 


151 


TTGGATGGCA 


GGCGGTTTGA 


CGAACAGGGA 


201 


TTCGAAGGGC 


GCGAAACAGT 


TTCCCGAAAG 


251 


CACCGCATCT 


CGTGTTCTTC 


CAAGAAGGCA 


301 


AGCGATGAAG 


CCGTTTACCA 


TACCGAAAAC 


351 


CAACGTTGTG 


CTGACCAAAA 


CCGCCGACGG 


401 


TTGAAGCCGA 


AAAGCTGCAC 


GTCGATACCG 


451 


GATACGCCTG 


TCAGTTTCCA 


ATATGGTGCA 


501 


CATGACTTAC 


GACCACAAAA 


CAGGCATGTT 


551 


AAGCCACGAT 


TTATGATACA 


AAAGATATGT 



GGTTGTTGTA CGAAGTCGGC 
AAACAGGTTC TTTTTAAAAA 
CAAACGGCAG GCGGGTAAAG 
AATCTCAATA TGCCCAAACC 



This corresponds to the amino acid sequence <SEQ ID 756; ORF 215.a>: 

a215.pep 

1 MKVRWRYGI A FPLILAVALG SLS AWLGRIS EVEIEEVRLN PDEPQYTMDG 

51 LDGRRFDEQG YLKEHLSSKG AKQFPESSDI HFDSPHLVFF QEGRLLYEVG 

101 SDEAVYHTEN KQVLFKNNVV LTKTADGKRQ AGKVEAEKLH VDTESQYAQT 

151 DTPVSFQYGA SHGQAGGMTY DHKTGMLNFS SKVKATIYDT KDM* 

m2 1 5/a2 15 98.3% identity in 1 73 aa overlap 

10 20 30 40 

m215 pep SLSAWLGRISEVEIEEVRLNPDEPQYTMDSLDGRRFDEQG 

! M 1 I M M II I I I I I I M I I I I M I M l:M I I I I M I I 
a215 MKVRWRYGIAFPLILAVALGSLSAWLGRISEVEIEEVRLNPDEPQYTMDGLDGRRFDEQG 

10 20 30 4 0 50 60 

50 60 70 80 90 100 

m215.pep YLKEHLSAKGAKQFPESSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNVV 
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I M I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I II I I I I I I I I I I I I II I 
a215 YLKEHLSSKGAKQFPESSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNVV 

70 80 90 100 110 120 

110 120 130 140 150 160 

m215 . pep LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDHXTGMLNFS 
I ! I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I 

a215 LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDHKTGMLNFS 

130 140 150 160 170 180 

170 

m2 1 5 . pep SKVKAT I YDTKDMX 

I I I I I I I I I I I I I I 
a215 SKVKAT I YDTKDMX 

190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 757>: 

g216.seq (partial) 

1 . .atgatatcga tttcgagctc ggtacccagc gacgaaatca ccgccatcat 
51 ccccgcactc aaacgcaaag acattaccct cgtctgcatc accgcccgcc 
101 ccgattcaac catggcgcgc catgccgata tccacatcac cgcatcggtt 
151 tcgcaagaag cctgcccgtt ggggcttgcc ccgaccacca gcaccaccgc 
201 cgttatggct ttgggcgacg cgttggcggt cgtcctgctg cgcgcccgcg 
251 cgttcacgcc cgacgacttc gccttgatcc accctgccgg cagcctcggc 
3 01 aaacgcctgc ttttgcgcgt tgccgacatt atgcacaaag gcggcggcct 

3 51 gcccgccgtc cgactcggca cgcccttgaa aggagccatc gtcagcatga 

4 01 gcgagaaagg tttgggcatg tgggcgggaa cggacgggca aaggctgtct 
451 gaaaggcctt tttactga 

This corresponds to the amino acid sequence <SEQ ID 758; ORP 216.ng>: 

g216.pep (partial) 

1 ..MISISSSVPS DEITAIIPAL KRKDITLVCI TARPDSTMAR HADIHITASV 
51 SQEACPLGLA PT TSTTAVMA LGDALAWLL RARAFTPDDF ALIHPAGSLG 
101 KRLLLRVADI MHKGGGLPAV RLGTPLKGAI VSMSEKGLGM WAGTDGQRLS 
151 ERPFY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 759>: 

m216 . seq 

1 ATGGCAATGG CAGAAAACGG AAAATATCTC GACTGGGCAC GCGAAGTGTT 

51 GCACGCCGAA GCGGAAGGCT TGCGCGAAAT TGCAGCGGAA TTGsACAAAA 

101 ACTTCGTCCT TGCGGCAGAC GCGTTGTTGC ACTGCAAGGG CAGGGTCGTT 

151 ATCACGGGCA TGGTCAAGTC GGGACATATC GGGCGCAAAA TGGCGGCAAC 

2 01 TATGGCCTCG ACCGGCACGC CTGCGTTTTT CGTCCACCCT GCGGAAGCGG 
251 CACACGgCGA TTTGGGTATG ATTGTGGACA rCGACGTGGT CGTCGCGATT 
301 TCCAATTCCG GCGAAAGCGA CGAAATCGCC GCCATCATCC CCGCACTCAA 

3 51 ACGCAAAGAC ATCACGCTTG TCTGCATCAC CGCCCGCCCC GATTCAACCA 

4 01 TGGCGCGCCA TGCCGACATC CACATCACGG CGTCGGTTTC CAAAGAAGCC 
4 51 TGCCCGCTGG GGCTTGCCCC GACCACCAGC ACCACCGCCG TCATGGCTTT 
501 GGGCGATGCG TTGGCGGTCG TCCtGCTGCG CgcACGCGCG TTCACGCCCG 
551 ACGATTTCGC CTTGAGCCAT CCTGCCGGCA GCCTCGGCAA ACGCCTACTT 
601 TTGCGCGTTG CCGACATTAT GCACAAAGGC GGCGGCCTGC CTGCCGTCCG 

6 51 ACTCGGCACG CCCTTGAAAG AAGCCATCGT CAGCATGAGT GAAAAAGGGC 

7 01 TGGGCATGTT GGCGGTAACG GACGGGCAAG GCCGTCTGAA AGGCGTATTC 
751 ACCGACGGCG ATTTGCGCCG CCTGTTTCAA GAATGCGACA ATTTTACCGG 
801 TCTTTCGATA GACGAAGTCA TGCATACGCA TCCTAAAACC ATCTCCGCCG 
851 AACGTCTCGC CACCGAAGCC CTGAAAGTCA TGCAGGCAAA CCATGTGAAC 
901 GGGCTTCTGG TTACCGATGC AGATGGCGTG CTGATCGGCG CGCTGAATAT 
951 GCACGACCTG CTGGCGGCAC GGATTGTATA G 

This corresponds to the amino acid sequence <SEQ ID 760; ORF 216>: 

m216 .pep 

1 MAMAENGKYL DWAREVLHAE AEGLREIAAE LXKNFVLAAD ALLHCKGRW 
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51 ITGMVKSGHI GRKMAATMAS TGTPAFFVHP AEAAHGDLGM IVDXDVWAI 

101 SNSGESDEIA AIIPALKRKD ITLVCITARP DSTMARHADI HITASVSKEA 

151 CPLGLAPTT S TTAVMALGDA LAWLL RARA FTPDDFALSH PAGSLGKRLL 

201 LRVADIMHKG GGLPAVRLGT PLKEAIVSMS EKGLGMLAVT DGQGRLKGVF 

2 51 TDGDLRRLFQ ECDNFTGLS I DEVMHTHPKT ISAERLATEA LKVMQANHVN 

3 01 GLLVTDADGV LIGALNMHDL LAARIV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 216 shows 91.8% identity over a 147 aa overlap with a predicted ORF (ORF 216.ng) 
from N. gonorrhoeae: 

m216/g216 



70 80 90 100 110 120 

m216 . pep TMASTGTPAFFVHPAEAAHGDLGMIVDXDVVVAISNSGESDE I AAI I PALKRKDITLVCI 

:::|hl I I I I : II I I I I I I I I I I I I I I 
g216 MISISSSVPSDEITAIIPALKRKDITLVCI 

10 20 30 



130 140 150 160 170 180 

m216 .pep TARPDSTMARHADIHITASVSKEACPLGLAPTTSTTAVMALGDALAWLLRARAFTPDDF 

M II II II I II II I II I II I h I !M I ! 1 1 1 1 1 i 1 1 1 1 1 ! 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

g216 TARPDSTMARHADIHITASVSQEACPLGLAPTTSTTAVMALGDALAWLLRARAFTPDDF 

40 50 60 70 80 90 



190 200 210 220 230 240 

m216.pep ALSHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKEAIVSMSEKGLGMLAVTDGQGRL 

II I II I II 1 1 II I II I II I Ml 1 1 1 1 1 1 1 ! 1 1 1 1 II II MINIMI I I 1 1 1 1 

g216 ALIHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKGAIVSMSEKGLGMWAGTDGQRLS 

100 110 120 130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 76 1>: 

a216.seq 

1 ATGGCGATGG CAGGAAACGA AAAATATCTT GATTGGGCAC GCGAAGTGTT 

51 GCACACCGAA GCGGAAGGCT TGCGCGAAAT TGCGGCGGAT TTGGACGAAA 

101 ACTTCGCCCT TGCGGCGGAC GCGTTGTTGC ACTGCAAAGG CAGGGTCGTT 

151 ATCACGGGCA TGGGCAAGTC GGGACATATC GGGCGCAAAA TGGCGGCAAC 

201 CATGGCCTCG ACCGGCACGC CCGCGTTTTT CGTCCACCCT GCGGAAGCGG 

251 CACACGGCGA TTTGGGCATG ATTGTGGACA ACGACGTGGT CGTCGCGATT 

301 TCCAATTCCG GTGAAAGCGA CGAAATCGCC GCCATCATCC CCGCGCTCAA 

351 ACGCAAAGAT ATCACGCTTG TCTGCATCAC CGCCCGCCCC GATTCAACCA 

4 01 TGGCGCGCCA TGCCGACATC CACATCACGG CGTCGGTTTC CAAAGAAGCC 

4 51 TGCCCGCTGG GGCTTGCCCC GACCACCAGC ACCACCGCCG TTATGGCTTT 

501 GGGCGATGCG TTGGCGGTTG TCCTGCTGCG CGCCCGCGCG TTCACGCCCG 

551 ACGACTTCGC CTTGAGCCAC CCTGCCGGCA GCCTCGGCAA ACGCCTACTT 

601 TTGCGCGTTG CCGACATTAT GCACAAAGGC GGCGGCCTGC CTGCCGTCCG 

651 ACTCGGCACG CCCTTGAAAG AAGCCATCGT CAGCATGAGT GAAAAAGGGC 

701 TGGGCATGTT GGCGGTAACG GACGGGCAAG GCCGTCTGAA AGGCGTATTC 

751 ACCGACGGCG ATTTGCGCCG CCTGTTTCAA GAATGCGACA ATTTTACCGG 

801 TCTTTCGATA GACGAAGTCA TGCATACGCA TCCTAAAACC ATCTCCGCCG 

851 AACGTCTCGC CACCGAAGCC CTGAAAGTCA TGCAGGCAAA CCATGTGAAC 

901 GGGCTTCTGG TTACCGATGC AGATGGCGTG CTGATCGGCG CGCTGAATAT 

951 GCACGACCTT TTGGCGGCGC GGATTGTATA G 



This corresponds to the amino acid sequence <SEQ ID 762; ORF 216.a>: 

a216 .pep 

1 MAMAGNEKYL DWAREVLHTE AEGLREIAAD LDENFALAAD ALLHCKGRW 

51 ITGMGKSGHI GRKMAATMAS TGTPAFFVHP AEAAHGDLGM IVDNDVWAI 

101 SNSGESDEIA AIIPALKRKD ITLVCITARP DSTMARHADI HITASVSKEA 

151 CPLGLAPTT S TTAVMALGDA LAWLL RARA FTPDDFALSH PAGSLGKRLL 

201 LRVADIMHKG GGLPAVRLGT PLKEAIVSMS EKGLGMLAVT DGQGRLKGVF 

251 TDGDLRRLFQ ECDNFTGLSI DEVMHTHPKT ISAERLATEA LKVMQANHVN 
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301 GLLVTDADGV LIGALNMHDL LAARIV* 

m2 1 6/a2 1 6 97.2% identity in 326 aa overlap 

10 20 30 40 50 60 

m21 6 . pep MAMAENGKYLDWAREVLHAEAEGLREIAAELXKNFVLAADALLHCKGRVVITGMVKSGHI 
Mil ! I I I I I I I I I II : I I I I I I I I I 1 : I : I I : I I I I I I I I I II I I I I I I I I I I I I 
a216 MAMAGNEKYLDWAREVLHTEAEGLREIAADLDENFALAADALLHCKGRWITGMGKSGHI 

10 20 30 40 50 60 



70 80 90 100 110 120 

m216.pep GRKMAATMASTGTPAFFVHPAEAAHGDLGMIVDXDVWAI SNSGESDE I AAI IPALKRKD 

I I I I I I I I I I I I 1 I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a216 GRKMAATMASTGT PAFFVHPAEAAHGDLGMI VDNDVWAI SNSGE S DE I AAI I PALKRKD 

70 80 90 100 110 120 

130 140 150 160 170 180 

m216.pep ITLVCITARPDSTMARHADIHITASVSKEACPLGLAPTTSTTAVMALGDALAVVLLRARA 
I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a216 ITLVCITARPDSTMARHADIHITASVSKEACPLGLAPTTSTTAVMALGDALAWLLRARA 

130 140 150 160 170 180 



190 200 210 220 230 240 

m21 6 . pep FTPDDFALSHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKEAIVSMSEKGLGMLAVT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I i I I II I I I I i I I I I 
a216 FTPDDFALSHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKEAIVSMSEKGLGMLAVT 

190 200 210 220 230 240 



250 260 270 280 290 300 

m216.pep DGQGRLKGVFTDGDLRRLFQECDNFTGLSIDEVMHTHPKTISAERLATEALKVMQANHVN 
I I 1 I I I I I I I I I I I I i I I I I I I I I I ) I I I I I 1 I I I I II I I I I 1 I I I I I I I I I I I I I I II I 
a216 DGQGRLKGVFTDGDLRRLFQECDNFTGLSIDEVMHTHPKTISAERLATEALKVMQANHVN 

250 260 270 280 290 300 



310 320 
m2 1 6 . pep GLLVTDADGVLIGALNMHDLLAARIVX 
I I I I I II I I I II I I I I i I I II I II I II 
a216 GLLVTDADGVLIGALNMHDLLAARIVX 

310 320 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 763>: 

g217 . seq 

1 atggcggatg acggtttgtt gcggcaactg tccgaaaaac ccagccaaag 

51 tgctctcttc ctgccatttg acccattcgt tttcgaggtt ttggactgcc 

101 ttttggtcat cgggcccggc ttgaaacaat gtttcaagca aatcccggca 

151 acgcgccacc cattcgccga ccgtcgcagg ttgccgccat atccgggcaa 

2 01 tatccgacag ggtttcgagg aaggcggcaa aacgtccgaa catggcggtt 

2 51 tgattcacgt cggcatacca cgcgctgaca tcctgccaca tcgggttgcc 

301 gccttcgggc agcatccagc ccaatatcat acggtctgcc gcctgcttcc 

351 aggtaaacag ctgatccgtg ccgccgcgca tttctccgtc caatccccaa 

401 tggacgttca aatcggcaac catatcgtgc aaaagcggca aatcgtcccc 

451 ggtcagtccg aaacggcgca acacgggcgc ggtttccaaa agcgcgagca 

501 ctttgccgac ttcaaaacgg ctttccagca agtcggacac gcactccaac 

551 gcataaaaaa acggttgccg gcggctgatt ttcacgtccg aaacggaata 

601 cggcaatgcc tgcgcgccgg gttgcgcctg tccgaacacg gcttccataa 

651 aaggcgtata gggttcgata ttcggggtta a 

This corresponds to the amino acid sequence <SEQ ID 764; ORF 217.ng>: 

g217.pep. . 

1 MADDGLLRQL SEKPSQSALF LPFDPFVFEV LDCLLVI GPG LKQCFKQIPA 

51 TRHPFADRRR LPPYPGNIRQ GFEEGGKTSE HGGLIHVGIP RADILPHRVA 

101 AFGQHPAQYH TVCRLLPGKQ LIRAAAHFSV QSPMDVQIGN HIVQKRQIVP 

151 GQSETAQHGR GFQKREHFAD FKTAFQQVGH ALQRIKKRLP AADFHVRNGI 
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201 RQCLRAGLRL SEHGFHKRRI GFDIRG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 765>: 

m217.seq 

1 ATGGCGGATG ACGGTGTGCG GCGGCAACTG TCCGGAAAAT TGCGCCAATT 

51 CGGTTTCCGC CTrCCATTTG ACCCATTCGT TTTCAAGGTT TTGGACTGAC 

101 TTTTGGTCAT CGGCTTCAGC TTGGAACAAT GTTTCAAGCA AATCCCGGCA 

151 ACGCGCCACC CATTCGCCGA CCGTTGCGGG CTGCCGCCAT ATCCGTACAA 

201 TATCCGTCAG GGTTTCGAGG AAGGCGGCAA AACGTCCGAA CATGGCGGTT 

251 TGATTCACGT CGGCATACCA CGCGCTGACA TCCTGCCACA TCGGATTGCC 

301 GCCTTTGGGC AGCATCCAGC CCAATATCAT GCGTTCTACC GCCTGCTTCC 

351 AGGTGAACAG CTGATCCGTG CCGCCGCGCA TTTCTCCGTC CAAACCCCAG 

401 TGGACGTTCA AATCGGCAAC CATGTCGTGC AAAAGCGGTA AATCGTCCTC 

4 51 AGTCAGTCCG AAACGGCGCA ACACGGGCGC GGTTTCTAAA AGCACAAGCA 

501 CTTTATCGAC TTCAAATCGG CTTTCCAACA AGTCGAACAG GCATGACAAA 

551 G C ATGAAAC A GCGGTTGGCG GCGGCTGATT TTCACGTCTG ACACGGAATA 

601 CGGCAATGCC TGCGCACCgG GctGCGCCTG TCCGAACACG GCTTCGATAA 

651 AAGGCGTATA GGATTCGATA TTCGGGGTTA A 

This corresponds to the amino acid sequence <SEQ ID 766; ORF 217>: 

m217 .pep 

1 MADDGVRRQL SGKLRQFGFR LPFDPFVFKV LDXLLVI GFS LEQCFKQIPA 

51 TRHPFADRCG LPPYPYNIRQ GFEEGGKTSE HGGLIHVGIP RADILPHRIA 

101 AFGQHPAQYH AFYRLLPGEQ LIRAAAHFSV QTPVDVQIGN HWQKRXIVL 

151 SQSETAQHGR GFXKHKHFID FKSAFQQVEQ AXQSMKQRLA AADFHVXHGI 

201 RQCLRTGLRL SEHGFDKRRI GFDIRG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 217 shows 80.5% identity over a 226 aa overlap with a predicted ORF (ORF 217.ng) 
from N. gonorrhoeae: 

m217/g217 

10 20 30 40 50 60 

MADDGVRRQLSGKLRQFGFRLPFDPFVFKVLDXLLVIGFSLEQCFKQIPATRHPFADRCG 

INN: I I I I I I IMMMMIM Mill : h I I I I I I I I I I I I I II I 

MADDGLLRQLSEKPSQSALFLPFDPFVFEVLDCLLVIGPGLKQCFKQIPATRHPFADRRR 
10 20 30 40 50 60 

70 80 90 100 110 120 

LPPYPYNIRQGFEEGGKTSEHGGLIHVGIPRADILPHRIAAFGQHPAQYHAFYRLLPGEQ 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I h I I I I I I I I I I h I I II I : I 
LPPYPGNIRQGFEEGGKTSEHGGLIHVGIPRADILPHRVAAFGQHPAQYHTVCRLLPGKQ 

70 80 90 100 110 120 

130 140 150 160 170 180 

LIRAAAHFSVQTPVDVQIGNHWQKRXIVLSQSETAQHGRGFXKHKHFIDFKSAFQQVEQ 

I ; I I I III Mhhl Ml II I HI! I II MUM HIM I - I I llhlllll : 
LIRAAAHFSVQSPMDVQIGNHIVQKRQIVPGQSETAQHGRGFQKREHFADFKTAFQQVGH 
130 140 150 160 170 180 

190 200 210 220 

AXQSMKQRLAAADFHVXHGI RQCLRTGLRLS EHGFDKRR IGFDI RGX 

I I : 1 : 1 1 llllll MIMMIMMMIIM MMIIIMI 

ALQRIKKRLPAADFHVRNGIRQCLRAGLRLSEHGFHKRRIGFDIRG 
190 200 210 220 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 767>: 

a217 . seq 

1 GTGGCGGATG ACGGTGTGCA GCGGCAACTG TCCGGAAAAT TGCGCCAATT 

51 CGGTTTCCGC CTGCCATTTG ACCCATTCGT TTTCGAGGCT TTGGACTGCC 

101 TTTTGGTCAT CGCCTTCGAC TTGGAACAAT GTTTCAAGCA AATCCCGGCA 

151 ACGCGCCACC CATTCGTCAA CCGTCGCAGG TTGCCGCCAT ATCCGTACAA 



m217 .pep 
g217 

m217 .pep 
g217 

m217 .pep 
g217 

m217 .pep 
g217 
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201 TATCCGTCAG GGTTTCGAGG AAGGCGGCAA AACGTCCGAA CAGGGCGGTT 

251 TGGTTCACGT CGGCATACCA CGCGCTGACC CCCTGCCACA TCGGATTGCC 

301 GCCTTCGGGC AGCATCCAGC CCAATATCAT GCGTTCTACC GCCTGCTTCC 

351 AGGTGAACAG CTGATCCGTG CCGCCGCGCA TTTCTCCGTC CAAACCCCAG 

4 01 CGGACGTTCA AATCGGCAAC CATGTCGTGC AAAAGCGGCA AATCGTCCTC 

4 51 AGTCAGTCCG AAATGGCGCA ACACGGGCGC GGTTTCTAAA AGCACAAGCA 

501 CTTTATCGAC TTCAAATCGG CTTTCCAACA AGTCGAACAG GCATGACAAA 

551 GC AT GAAAC A GCGGTTGTCG GCGGCTGATT TTCACATCCG AAACGGAATA 

601 CGGCAATGCC TGCGCGCCGG GCTGCGCCTG TCCGAACACG GCTTCGATAA 

651 AAGGCGTATA GGATTCGATA TTCGGGGTTA A 

This corresponds to the amino acid sequence <SEQ ID 768; ORF 217.a>: 

a217.pep 

1 VADDGVQRQL SGKLRQFGFR LPFD PFVFEA LDCLLVIAFD L EQCFKQIPA 

51 TRHPFVNRRR LPPYPYNIRQ GFEEGGKTSE QGGLVHVGIP RADPLPHRIA 

101 AFGQHPAQYH AFYRLLPGEQ LIRAAAHFSV QTPADVQIGN HVVQKRQIVL 

151 SQSEMAQHGR GF*KHKHFID FKSAFQQVEQ A*QSMKQRLS AADFHIRNGI 
201 RQCLRAGLRL SEHGFDKRRI GFDIRG* 

m217/a217 90.3% identity in 226 aa overlap 

10 20 30 40 50 60 

MADDGVRRQLSGKLRQFGFRLPFDPFVFKVLDXLLVIGFSLEQCFKQIPATRHPFADRCG 
: I I I I I : I I I I I I II I M I I I I I I I I I I : : I I I I i I : I : I I I I I I 1 I I I I I I I I :: I 
VADDGVQRQLSGKLRQFGFRLPFDPFVFEALDCLLVIAFDLEQCFKQIPATRHPFVNRRR 
10 20 30 40 50 60 

70 80 90 100 110 120 

LPPYPYNIRQGFEEGGKTSEHGGLIHVGIPRADILPHRIAAFGQHPAQYHAFYRLLPGEQ 
I I 111 milltll I I I ! ! I : I ! I : I I I I I I I I I I j I I I I I i I I i I M ] I I I I I | | | | | 
LPPYPYNIRQGFEEGGKTSEQGGLVHVGIPRADPLPHRIAAFGQHPAQYHAFYRLLPGEQ 
70 80 90 100 110 120 

130 140 150 160 170 180 

LIRAAAHFSVQTPVDVQIGNHVVQKRXIVLSQSETAQHGRGFXKHKHFIDFKSAFQQVEQ 
I I I I I I I I I I I M : I I I I I I I I II I I I I I I II I I I I I I I t I I I I I t I I I I I II I I I I I 
LIRAAAHFSVQTPADVQIGNHVVQKRQIVLSQSEMAQHGRGFXKHKHFIDFKSAFQQVEQ 
130 140 150 160 170 180 

190 200 210 220 

AXQSMKQRLAAADFHVXHGI RQCLRTGLRLSEHGFDKRRI GFDI RGX 
I I I I I I I I I : I I I I I : : I I I I I I I : I I I I I I I II I I I I 1 I I I I I I I 
AXQSMKQRLSAADFHIRNGIRQCLRAGLRLSEHGFDKRRIGFDIRGX 
190 200 210 220 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 769>: 

g218 . seq 

1 atggttgcgg tggatcctta tacggcaaaa gtggtcaaca ccatgccgcg 

51 caatcagggt tggtatcaca ctatggatga aatccacggc gatatgatgc 

101 tcggtgcggc aggcgattat cttttggaaa cggcagcttc actgaccatt 

151 attatggttg tcagcggctt gtacctttgg tgggcgaaac agcgcggcat 

201 taaagcgatg ctgctgccgc caaaaagcag ggcgcgttct tggtggcgga 

251 atctgcacgg cgcgtttgga acttgggtgt cgttgatttt actgttgttc 

3 01 tgcctgtcgg gtattgcttg ggcaggtatt tggggcggca aattcgtgca 

3 51 ggcttggaat cagttcccgg ccggcaaatg gggtgtcgaa ccgaaccccg 

401 tttcaatcgt gccgacccac ggcgaggtat tgaatgacgg caaggtztaag 

451 gaagtgccgt ggattttgga gcttatgcct atgcctgtct cagggacgac 

501 tgtgggtgaa aacggcatta accccaccga gcccaataac attggaaacc 

551 gtcgaccgtt tcgcgcggga aatcggtttc aaagggcgtt atcagttgaa 

601 tttgcccaaa ggcgaggacg gggtatggac tttgtcgcag gattctatga 

651 gttatga 

This corresponds to the amino acid sequence <SEQ ID 770; ORF 21 8.ng>: 

g218.pep 

1 MVAVDPYTAK WNTMPRNQG WYHTMDEIHG DMMLGAAGDY LLETAASLTI 
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m217 . pep 
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m217 .pep 
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m2 17 . pep 
a217 
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51 IMWSGL YLW WAKQRGIKAM LLPPKSRARS WWRNLHGAFG TW VSLILLLF 

101 CLSGIAWAGI WGGKFVQAWN QFPAGKWGVE PNPVSIVPTH GEVLNDGKVK 

151 EVPWILELMP MPVSGTTVGE NGINPTEPNN IGNRRPFRAG NRFQRALSVE 

201 FAQRRGRGMD FVAGFYEL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 77 1>: 



m218 . seq 












1 


ATGGTCGCGG 


TCGATCCTTA 


TACGGCAAAA 


GTGGTCAGTA 


CCATGCCGCG 


51 


CAATCAGGGT 


TGGTATTACA 


CGATGGATOA 




un 1±\L uii J. vjL- 


101 


TCGGTGCGGC 


AGGCGATTAT 


CTTTTGGAAA 


CGGCAGCTTC 


ACTGACCATT 


151 


ATTATGGTTG 


TCAGCGGCTT 


GTACCTTTGG 


TGGGTGAAAC 


GGCGCGGCAT 


201 


CAAGGCGATG 


CTGCTGCCGT 


CAAAAGGCAr 


GGCGCGTTCT 


TGGTGGCGGA 


251 


ATCTGCACGG 


CACGTTTGGA 


ACTTGGGTGT 


CGTTGATTTT 


GCTGTTGTTC 


301 


TGCCTGTCGG 


GTATTGCTTG 


GGCGGGTATT 


TGGGGCGGCA 


AGTTCGTACA 


351 


GGCTTGGAGT 


CAGTTCCCTG 


CCGGTAAATG 


GGGTGTCGAA 


CCGAACCCCG 


401 


TTTCAGTCGT 


GCCGACCCAC 


GGCGAGGTAT 


TGAATGACGG 


CAAGGTTAAG 


451 


GAAGTGCCGT 


GGGTTTTGGA 


GCTTACGCCT 


ATGCCTGTTT 


CAGGGACGaC 


501 


yGtgGGCAAA 


GACGGCATTA 


ACCCTGACGA 


GCCGATGACA 


TTGGAAACCG 


551 


TCGACCGCTT 


TGCGCGGnGA 


AATCGGTTTC 


AAAGGGCGTT 


ATCAGTTGAA 


601 


TTTGCCCAAA 


GGCGAGGACG 


GCGTATGGAC 


TTTGTCGCAG 


GATTCTATGA 


651 


GTTA 











This corresponds to the amino acid sequence <SEQ ID 772; ORF 218>: 

m218 .pep 

1 MVAVDPYTAK WSTMPRNQG WYYTMDEIHS DMMLGAAGDY LLETAASLTI 

51 IMWSGL YLW WVKRRGIKAM LLPSKGXARS WWRNLHGTFG TW VSLILLLF 

101 CLSGIAWAGI WGGKFVQAWS QFPAGKWGVE PNPVSWPTH GEVLNDGKVK 

151 EVPWVLELTP MPVSGTTVGK DGINPDEPMT LETVDRFARX NRFQRALSVE 

2 01 FAQRRGRRMD FVAGFYEL 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 218 shows 87.2% identity over a 218 aa overlap with a predicted ORF (ORF 218.ng) 
from N. gonorrhoeae: 

m218/g218 

10 20 30 40 50 60 

MVAVDPYTAKWSTMPRNQGWYYTMDE IHSDMMLGAAGDYLLETAASLTI IMWSGLYLW 

I II I M I I I! I hli I I IN | h|| | | | | :|| | | |i | M| | M II I II I I I I I I I I I II I 
MVAVDPYTAKWNTMPRNQGWYHTMDEIHGDMMLGAAGDYLLETAASLTI IMWSGLYLW 
10 20 30 40 50 60 

80 90 100 110 120 

WVKRRG I KAMLLPSKGXARSWWRNLHGTFGTWVS LI LLLFCLSGIAWAGI WGGKFVQAWS 

hhlllllllll h II I II II 1 1 1 : 1 M II 1 1 1 II II I II 1 1 1 1 1 1 1 1 M 1 1 M I h 

WAKQRG I KAMLLPPKSRARSWWRNLHGAFGTWVS LI LLLFCLSGIAWAGI WGGKFVQAWN 
70 80 90 100 110 120 

130 140 150 160 170 180 

QFPAGKWGVE PNPVSWPTHGEVLNDGKVKEVPWVLELTPMPVSGTTVGKDGINPDEPMT 

IN II III MM I II : II I III II I MINIM hi II I III II Ml MM 1 1 1 II : 

QFPAGKWGVE PNPVS I VPTHGEVLNDGKVKEVPW I LELMPMPVSGTTVGENGINPTEPNN 
130 140 150 160 170 180 

190 200 210 

LE TVDR FARXNR FQRALS VE FAQRRGRRMD FVAGF YE L 

= = I IIIIIIIIMIIIllll Ml III III I 

I GNRRPFRAGNRFQRALSVEFAQRRGRGMD FVAGFYEL 
190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 773>: 



m218 .pep 
g218 
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g218 
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g218 
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a218 . seq 

1 ATGGTCGCGG TCGATCCTTA TACGGCAAAA GTGGTCAGTA CCATGCCGCG 

51 CAATCAGGGT TGGTATTACG CGATGGATGA AATCCACAGC GATATGATGC 

101 TCGGTTCGAC AGGTGATTAT CTTTTGGAAA CGGCTGCATC GCTGACGATT 

151 ATCATGATAA TCAGCGGTTT GTACCTTTGG TGGGTGAAAC GGCGCGGCAT 

201 CAAGGCGATG CTGCTGCCGC CAAAAGGCAG GGCGCGTTCT TGGTGGCGGA 

251 ATCTGCACGG CGCGTTTGGA ACTTGGGTGT CGTTGATTTT ACTGTTGTTC 

301 TGCCTGTCGG GTATTGCTTG GGCAGGTATT TGGGGCGGCA AGTTCGTGCA 

351 GGCTTGGAGT CAGTTCCCGG CAGGCAAATG GGGTGTCGAA CCGAACCCTG 

4 01 TTTCAGTCGT GCCGACCCAC GGCGAGGTAT TGAATGACGG CAAGGTTAAG 

4 51 GAAGTGCCGT GGGTTTTGGA GCTTACGCCT ATGCCTGTTT CAGGGACGAC 

501 TGTGGGCAAA GACGGTATTA ACCCTGACGA GCCGATGACA TTGGAAACCG 

551 TCGACCGTTT TGCGCGG.GA AATCGGTTTC AAAGGGCGTT ATCAGCTGAA 

601 TTTGCCCAAA GGCGAGGACG GCGTATGGAC TTTGTCGCAG GATTCTATGA 

651 GTTA 

This corresponds to the amino acid sequence <SEQ ID 774; ORF 218.a>: 

a218.pep 

1 MVAVDPYTAK VVSTMPRNQG WYYAMDEIHS DMMLGSTGDY LLETAASLTI 

51 IMIISGL YLW WVKRRGIKAM LLP'PKGRARS WWRNLHGAFG TWV SLILLLF 

101 CLSGIAWAGI WGGKFVQAWS QFPAGKWGVE PNPVSVVPTH GEVLNDGKVK 

151 EVPWVLELTP MPVSGTTVGK DGINPDEPMT LETVDRFARX NRFQRALSAE 

201 FAQRRGRRMD FVAGFYEL 



ra218/a218 95.9% identity in 218 aa overlap 

10 20 30 40 50 60 

m2 1 8 . pep MVAVDPYTAKVVSTMPRNQGWYYTMDEIHSDMMLGAAGDYLLETAASLTI IMVVSGLYLW 

I I I I I I ! M I I I I I I I I I I I I I I : I I I I I I I I I f I :: I I I I I j I I I I I I M I :: I I I I I I 
a218 MVAVDPYTAKVVSTMPRNQGWYYAMDEIHSDMMLGSTGDYLLETAASLT I IMIISGLYLW 

10 20 30 40 50 60 



70 80 90 100 110 120 

m218.pep WVKRRGIKAMLLPSKGXARS WW RNLHGTFGTWVSLILLLFCLSGIAWAGI WGGKFVQAWS 

M I I I I I ! I I I I I I' I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I | I 
a218 WVKRRGIKAMLLPPKGRARSWWRNLHGAFGTWVSLILLLFCLSGIAWAGI WGGKFVQAWS 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 18 . pep QFPAGKWGVEPNPVSVVPTHGEVLNDGKVKEVPWVLELTPMPVSGTTVGKDGINPDEPMT 
I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 1 1 I I I I I I I I I I ! I I I I I I 
a2 1 8 QFPAGKWGVEPNPVS VVPTHGEVLNDGKVKEVPWVLELT PMPVSGTTVGKDGINPDEPMT 

130 140 150 160 170 180 



190 200 210 

m218 .pep LETVDRFARXNRFQRALSVEFAQRRGRRMDFVAGFYEL 
I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I 
a218 LETVDRFARXNRFQRALSAE FAQRRGRRMD FVAGFYEL 

190 200 210 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 775>: 

g219.seq 

1 atgacggcaa ggttaaggaa gtgccgtgga ttttggagct tatgcctatg 

51 cctgtctcag ggacgactgt gggtgaaaac ggcattaacc ccaccgagcc 

101 caataacatt ggaaaccgtc gaccgtttcg cgcgggaaat cggtttcaaa 

151 gggcgttatc agttgaattt gcccaaaggc gaggacgggg tatggacttt 

201 gtcgcaggat tctatgagtt atgacatgat cagcccgttt gccgaccgca 

251 cggtacatat cgaccagtac agcggcgaga ttcttgccga catccgtttt 

3 01 gacgattaca acccgttcgg caaatttatg gcggcaagca ttgcgctgca 

351 tatggggact ttgggctggt ggagcgtgtt ggcgaacgtc gtgttctgcc 

401 ttgccgtgat ttttatcggc atcagcggct gcgtgatgtg gtggaaacgc 

451 cgtccgtccg gcgtggcggg cattgttcct ccggcgcaaa aaatcaaact 

501 gcccgtctgg tgggcgatgg cattgccgct gctgttgatt gcactgcttt 

551 tcccgaccgc gctgcttgcc attgccgtga tttggctgtt ggataccttg 

601 ctgctgtcgc ggattcctgt gttgaggaaa tggtttaaat ga 

This corresponds to the amino acid sequence <SEQ ID 776; ORF 219.ng>: 

g219 -pep 

1 MTARLRKCRG FWSLCLCLSQ GRLWVKTALT PPSPITLETV DRFAREIGFK 

51 GRYQLNLPKG EDGVWTLSQD SMSYDMISPF ADRTVHIDQY SGEILADIRF 

101 DDYNPFGKFM AASIALHMGT LGWWSVLAN V VFCLAVIFIG ISGCVM WWKR 

151 RPSGVAGIVP PAQKIKLPVW WAMALPL LLI ALLFPTALLA IAVI WLLDTL 
2 01 LLSRI PVLRK WFK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 777>: 



m219 . seq 












1 


ATGACGGCAA 


GGTTAAGGAA 


GTGCCGTGGG 


TTTTGGAGCT 


TACGCCTATG 


51 


CCTGTTTCAG 


GGACGaCyGt 


gGGCAAAGAC 


GGCATTAACC 


CTGACGAGCC 


101 


GATGACATTG 


GAAACCGTCG 


ACCGCTTTGC 


GCGGnGAAAT 


CGGTTTCAAA 


151 


GGGCGTTATC 


AGTTGAATTT 


GCCCAAAGGC 


GAGGACGGCG 


TATGGACTTT 


201 


GTCGCAGGAT 


TCTATGAGTT 


ACGACATGAT 


CAGCCCGTTT 


GCCGACCGCA 


251 


CGGTACATAT 


CGACCAGTAC 


AGCGGCAAAA 


TCCTTGCCGA 


CATCCGTTTT 


301 


GACGATTACA 


ACCCGTTCGG 


CAAATTTATG 


GCGGCAAGCA 


TTGCGCTGCA 


351 


TATGGGGACT 


CTGGGCTGGT 


GGAGCGTGTT 


GGCGAACGTC 


TTGTTCTGCC 


401 


TTGCCGTCAT 


TTTTATCGGT 


ATCAGCGGCT 


GCGTGATGTG 


GTGGAAACGC 


451 


CGTCCGACCG 


GAGCGGTGGG 


CATCGTTCCG 


CCGGCGCAGA 


AAGTCAAGCT 


501 


GCCGGTTTGG 


TGGATGATGG 


CATTGCCGCT 


ATTGGCAATC 


GCACTGCTCT 


551 


TCCCGACCTC 


ACTGCTTGCC 


ATTGCCGTGA 


TTTGGCTGTT 


GGATACGCTG 


601 


CTGTTGTCGC 


GGATTCCTGT 


TTTGAGGAGA 


TGGTTTAAAT 


GA 



This corresponds to the amino acid sequence <SEQ ID 778; ORF 219>: 

m219 .pep 

1 MTARLRKCRG FWSLRLCLFQ GRXWAKTALT LTSRXHWKPS TALRGEIGFK 

51 GRYQLNLPKG EDGVWTLSQD SMSYDMISPF ADRTVHIDQY SGKILADIRF 

101 DDYNPFGKFM AASIALHMGT LGWWSVLAN V LFCLAVIFIG ISGCVM WWKR 

151 RPTGAVGIVP PAQKVKLPVW WMMALPL LAI ALLFPTSLLA IAVI WLLDTL 

201 LLSRI PVLRR WFK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 219 shows 86.9% identity over a 213 aa overlap with a predicted ORF (ORF 219.ng) 
from N. gonorrhoeae: 

m219/g219 

m219.pep 
g219 



m219 .pep 
g219 



10 20 30 40 50 60 

MTARLRKCRGFWSLRLCLFQGRXWAKTALTLTSRXHWKPSTALRGEIGFKGRYQLNLPKG 

IIIIMIIMIMI III Ml hlllll I : : MINIMUM 

MTARLRKCRGFWSLCLCLSQGRLWVKTALTPPSPITLETVDRFAREIGFKGRYQLNLPKG 
10 20 30 40 50 60 

70 80 90 100 110 120 

EDGVWTLSQDSMSYDMI SPFADRTVHIDQYSGKILADIRFDDYNPFGKFMAAS IALHMGT 

I li I Ml II IMII MMIIIMII Mill Ihll IMIMIMII 1 1 IMMII 1 1 1 

EDGVWTLSQDSMSYDMISPFADRTVHIDQYSGEILADIRFDDYNPFGKFMAAS IALHMGT 
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70 80 90 100 110 120 

130 140 150 160 170 180 

m219.pep LGWWSVLANVLFCI^VIFIGISGCV^^ 

M I i 1 1 1 M hi 1 1 ; 1 1 M I Ml MM I !| M H : :|| II I II h( 1 1 ! I ' MINI I 

g219 LGWWSVLANWFCLAVIFIGISGCVMWWKRRPSGVAGIVPPAQKIKL 

130 140 150 160 170 180 

190 200 210 

m2 1 9 . pep ALLFPTSLLAI AVI WLLDTLLLSRI PVLRRWFKX 

M II I hi I II 1 1 1 1 M Ml 1 1 M M M M M I 

g2 1 9 ALLFPTALLAI AVI WLLDTLLLSRI PVLRKWFK 

190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 779>: 

a219.seq 

1 ATGACGGCAA GGTTAAGGAA GTGCCGTGGG TTTTGGAGCT TACGCCTATG 

51 CCTGTTTCAG GGACGACTGT GGGCAAAGAC GGTATTAACC CTGACGAGCC 

101 GATGACATTG GAAACCGTCG ACCGTTTTGC GCGG . GAAAT CGGTTTCAAA 

151 GGGCGTTATC AGCTGAATTT GCCCAAAGGC GAGGACGGCG TATGGACTTT 

201 GTCGCAGGAT TCTATGAGTT ACGACATGAT CAGCCCGTTT GCTGACCGCA 

251 CGGTGCATAT CGACCAGTAC AGCGGCAAGA TTCTTGCCGA CATCCGTTTT 

301 GACGATTACA ACCCGTTCGG CAAATTTATG GCGGCAAGCA TTGCGCTGCA 

351 TATGGGGACT TTGGGCTGGT GGAGCGTGTT GGCGAACGTT TTGTTCTGCC 

4 01 TTGCCGTGAT TTTTATCGGC ATCAGCGGCT GCGTGATGTG GTGGAAACGC 

4 51 CGTCCGTCCG GCGCGGTGGG CATGGTTCCG CCGGCGCAAA AAATCAAGCT 

501 GCCCGTCTGG TGGGCAATGG CGGTGCCGCT GCTGCTGATT GCATTGCTTT 

551 TCCCGACCGC GTTGCTTGCC ATTGCCGTGA TTTGGCTGTT GGATACGCTG ' 

601 CTGTTGTCGC GGATTCCTGT TTTGAGGAGA TGGTTTAAAT GA 

This corresponds to the amino acid sequence <SEQ ID 780; ORF 219.a>: 

a219.pep 

1 MTARLRKCRG FWSLRLCLFQ GRLWAKTVLT LTSR*HWKPS TVLRXEIGFK 

51 GRYQLNLPKG EDGVWTLSQD SMSYDMISPF ADRTVHIDQY SGKILADIRF 

101 DDYNPFGKFM AASIALHMGT LGWWSVLAN V LFCLAVIFIG ISGCVM WWKR 

151 RPSGAVGMVP PAQKIKLPVW WAMAVPL LLI ALLFPTALLA IAVI WLLDTL 

201 LLSRIPVLRR WFK* "~ 

m2 1 9/a2 1 9 94.8% identity in 2 1 3 aa overlap 

10 20 30 40 50 60 

m2 1 9 . pep MTARLRKCRG FWSLRLCLFQGRXWAKTALTLTSRXHWKPSTALRGEIGFKGRYQLNLPKG 

I I I I I I I II I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I : I I I I I M I I I I I I I I I ! 
a219 MTARLRKCRG FWSLRLCLFQGRLWAKTVLTLTSRXHWKPSTVLRXEIGFKGRYQLNLPKG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m219.pep EDGWTLSQDSMSYDMISPFADRTVHIDQYSGKILADIRFDDYNPFGKFMAASIALHMGT 

I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a219 E DGVWTLSQDSMS YDMI SPFADRTVHI DQYSGKILADIRFDDYNPFGKFMAAS I ALHMGT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m219 . pep LGWWSVLANVLFCLAVIFIGISGCVMWWKRRPTGAVGIVPPAQKVKLPVWWMMALPLLAI 
I I I I I I I I I I I I I I ! I i I I I I I I I I I I I I I I I : I I I I : I I I I I I : I I I I I I I I : I I I I 
a219 LGW WS VLANVL FCLAVI FI G I SGCVMWWKRR P S GAVGMVP PAQKI KL PVWWAMAV PLLL I 

130 140 150 160 170 180 

190 200 210 

m219 .pep ALLFPTSLLAI AVI WLLDTLLLSRI PVLRRWFKX 

I I I I I I : I I f I I I I I I I I I I I I I I I I I I I I I I I I 
a219 ALLFPTALLAI AVIWLLDTLLLSRI PVLRRWFKX 

190 200 210 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 78 1>: 

g221 . seq 



1 atgcacgacc acggcgccat ggatcgccgc ctccccgctt tcggaagtct 

51 gatgcggcga gccgtaaatc adatcgacgc tgacggattt gaaccctgcc 

101 tcacgggcgg catcgatgac ttctttggtt tcttcgtagc tttggatgcg 

151 gttgactgcc gcctgcactt tggggtcgaa atcctgaatg ccgacgctca 

201 tgcggttgaa gccgagtctg ccgagcatga ggacggtgtc gcggctgact 

251 ttgcgcgggt cgatttcgat ggaatattcg ccggacggta tcagttcgaa 

301 atgtttgcgg atcatgcgga agacacgttc gatctgttcg tcgctcaaaa 

351 aggtcggcgt gccgccgccg aagtgcagtt gggcaagctg gtgccgtccg 

401 ttcagatgtg gagcgagcag ttccatttct ttttcaagat attcgatgta 

451 ggtatcggcg cggcttttgt ctttggtgat gattttgttg cagccgcagt 

501 agtagcagat ggtgttgcaa aacggaatgt gaatgtaaag ggaaagcggt 

551 ttgtttaa 



This corresponds to the amino acid sequence <SEQ ID 782; ORF 221.ng>: 

g221.pep 



1 MHDHGAMDRR LPAFGSLMRR AVNXIDADGF EPCLTGGIDD FFGFFVALDA 

51 VDCRLHFGVE ILNADAHAVE AESAEHEDGV AADFARVDFD GIFAGRYQFE 

101 MFADHAEDTF DLFVAQKGRR AAAEVQLGKL VPSVQMWSEQ FHFFFKIFDV 

151 GIGAAFVFGD D FY AAA WAD GVAKRNVNVK GKRFV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 783>: 



1 ATGGyGGTTT TGATGcwcmg AAGTCTGGTG CGGCAGGCCG TAAATCAAAT 

51 CGACGCTGAC GGATTTGAAC CCCGCTTCGC GCGCCGCATC GATGACTTCT 

101 TTGGTTTCTT CGTAACTTTG GATGCGGTTG ACCGCCGCCT GCACTTTGGG 

151 GTCGAAATCC TGAATGCCGA TGCTCATGCG GTTGAAGCCG AGTCTGCCGA 

201 GCATGAGGAC GGTGTCGCGG CTGACTTTGC GCGGGTCGAT TTCGATGGAG 

2 51 TATTCGCCGG TGGGGATTAA CTCGAAATGT TTGCGTATCA TGCGGAAGAC 

3 01 ACGTTCGATC TGTTCGTCGC TCAAAAAGGt GCGTGCcCCG CCGAAGTGCA 
351 GTTGGGCAAG CTGGTGCCGT CCGTTCAGAT GTGGAGCGAG CAGTTCCATT 

4 01 TCTTTTTCAA GATATTCGAT GTAGGCATCG GCGCGGCTTT TGTCTTTGGT 
4 51 GATGATTTTG TTGCAGCCGC AGTAGTAGCA GATGGTGTTG CAGAACGGAA 
501 TGTGAATGTA AAGGGAAAGC GGTTTGTTTA A 



This corresponds to the amino acid sequence <SEQ ID 784; ORF 22 1>: 

m221 .pep 

1 MXVLMXRSLV RQAVNQIDAD GFEPRFARRI DDFFGFFVTL DAVDRRLHFG 



51 VE I LNADAHA VEAESAEHED GVAADFARVD FDGVFAGGDX LEMFAYHAED 
101 TFDLFVAQKG ACPAEVQLGK LVPSVQMWSE QFHFFFKIFD VGIGAAFVFG 
151 DP FVAAAW A DGVAERNVNV KGKRFV* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 221 shows 87.6% identity over a 170 aa overlap with a predicted ORF (ORF 221.ng) 
from N. gonorrhoeae: 

m221/g221 



m221 . seq 



m221 .pep 



10 20 30 40 50 

MX VLMXRS LVRQAVNQ I DADG FE PRF ARR I DD F FG F FVTLD AVDRRLHFGVE 



g221 




10 20 30 40 50 60 



m221 .pep 



60 70 80 90 100 110 

ILNADAHAVEAESAEHEDGVAADFARVDFDGVFAGGDXLEMFAYHAEDTFDLFVAQKGA- 



g221 




70 80 90 100 110 120 



120 130 140 150 160 170 
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m221 .pep 
g221 



m221.pep 
g221 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 785>: 

a221.seq 

1 ATGGTGGTTT TGATGCTCCG . AAGTCTGGTG CGGCAGGCCG TAAATCAAAT 

51 CGACGCTGAC GGATTTGAAC CCCGCTTCGC GCGCCGCATC GATGACTTCT 

101 TTGGTTTCTT CGTAACTTTG GATGCGGTTG ACCGCCGCCT GCACTTTGGG 

151 GTCGAAATCC TGAATGCCGA TGCTCATGCG GTTGAAGCCG AGTCTGCCGA 

201 GCATGAGGAC GGTGTCGCGG CTGACTTTGC GCGGGTCGAT TTCGATGGAG 

251 TATTCGCCGG TGGGGATTAA CTCGAAATGT TTGCGTATCA TGCGGAAGAC 

301 ACGTTCGATT TGGTCGTCGC TCAAAAAGGT CGGCGTGCCG CCGCCGAAGT 

351 GCAGTTGGGC AAGCTGGTGC CGTCCGTTCA GATGTGGAGC GAGCAGTTCC 

401 ATTTCTTTTT CAAGAAATTC GATGTAGGCA TCGGCGCGGC TTTTGTCTTT 

4 51 GGTGATGATT TTGTTGCAGC CGCAGTAGTA GCAGATGGTG TTGCAGAACG 

501 GAATGTGAAT GTAAAGGGAA AGCGGTTTGT TTAA 

This corresponds to the amino acid sequence <SEQ ID 786; ORF 221.a>: 

a221.pep 

1 MVVLMLRSLV RQAVNQIDAD GFEPRFARRI DDFFGFFVTL DAVDRRLHFG 

51 VEILNADAHA VEAESAEHED GVAADFARVD FDGVFAGGD* LEMFAYHAED 

101 TFDLVVAQKG RRAAAEVQLG KLVPSVQMWS EQFHFFFKKF DVGIGAAFVF 

151 GDDFVAAAV V ADGVAERNVN VKGKRFV* 

m221/a221 95.5% identity in 177 aa overlap 

10 20 30 40 50 60 

MXVLMXRSLVRQAVNQIDADGFEPRFARRIDDFFGFFVTLDAVDRRLHFGVEILNADAHA 
I Ml I ! I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
MVVLMLRSLVRQAVNQIDADGFEPRFARRIDDFFGFFVTLDAVDRRLHFGVEILNADAHA 
10 20 30 40 50 . 60 

70 80 90 100 110 119 

VEAESAEHEDGVAADFARVDFDGVFAGGDXLEMFAYHAEDTFDLFVAQKGA-CPAEVQLG 

I I i I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I MUM 
VEAESAEHEDGVAADFARVDFDGVFAGGDXLEMFAYHAEDTFDLWAQKGRRAAAEVQLG 

70 80 90 100 110 120 

120 130 140 150 160 170 

KLVPSVQMWSEQFHFFFKIFDVGIGAAFVFGDDFVAAAWADGVAERNVNVKGKRFVX 

II I I I I M I M I M I I I I I M II I I I II I I I M i I I I I II M II I M M M M M I I 
KLVPSVQMWSEQFHFFFKKFDVGIGAAFVFGDDFVAAAWADGVAERNVNVKGKRFVX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 787>: 

g223.seq 

1 atggaattca ggcaccaggt agtggtagtt ggtgtcgaac catttggtca 

51 tttcgatggc gaattggtct ttgttgccgc gcgccagttg gaagaattgt 

101 tccaaaggca ggttttggct atcgaagccg aaacgggcgg gaatcgcgcc 

151 cgtggatact tgcaggtcga ggatgtgatg gtagaaagtg aaatcacgta 

201 cagcaacgta atcagcgtta ggagcagctt ggtgtttcca gtttttctcg 

251 cgcaggtctt tggcaacgtc gagcagctct tgttcactga tctctttgcg 

301 ccagtatttt tcttgggcga atttcaattc acggaaggcg ccgacacgcg 

351 ggaagcctga 

This corresponds to the amino acid sequence <SEQ ID 788; ORF 223.ng>: 

g223.pep. . 



CPAEVQLGKLVPSVQMWSEQFHFFFKIFDVGIGAAFVFGDDFVAAAWADGVAERNVNVK 

M 1 1 III Ml II II II MINI IN II II I II II Mi I III M II 1 1 1 Mh: I MM 

AAAEVQLGKLVPSVQMWSEQFHFFFKIFDVGIGAAFVFGDDFVAAAWADGVAKRNVNVK 
130 140 150 160 170 180 



GKRFVX 

MINI 
GKRFVX 



m221.pep 
a221 

m221 .pep 
a221 

m221.pep 
a221 
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1 MEFRHQWW GVEPFGHFDG ELVFVAARQL EELFQRQVLA IEAETGGNRA 
51 RGYLQVEDVM VESEITYSN V ISVRSSLVFF VFLAQV FGNV EQLLFTDLFA 
101 PVFFLGEFQF TEGADTREA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 789>: 

m223. seq 

1 GTGGAATTCA GGCACCAAGT AGTGGTAGTT GGTGTCGAAC CATTTGGTCA 

51 TTTCGATAGC GAATTGGTCT TTGTTACCGC GCGCCAGTTG GAAGAATTGT 

101 TCCAAAGACA GGTTTTGGCT GTCGAAGCCG AAGCGGGCGG GAATCGCGCC 

151 GGTGGCGACT TGCAGGTCGA GGATGTGGTC GTAGAAAGTG AAATCsCTAC 

201 GGCAACGAAA TCGGCGTTGG CAGCGACCTG GTGTTTCCAG TTTTTCTCGC 

. 251 GCAAGTCTTT AGCAACAGCC AGCAATTCTT GCTCGCTGAT TTCTTTGCGC 

301 CAGTATTTTT CTTGTGCGAA TTTCAATTCG CGGAAGGCGC CGACACGCGG 

351 GAAGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 790; ORF 223>: 

m223.pep 

1 VEFRHQWW GVEPFGHFDS ELVFVTARQL EELFQRQVLA VEAEAGGNRA 

51 GG DLQVEDW VESEIXYGNE IGVGSDLVFP VFLAQV FSNS QQFLLADFFA 

101 PVFFLCEFQF AEGADTREA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 223 shows 80.7% identity over a 1 19 aa overlap with a predicted ORF (ORF 223.ng) 
from N. gonorrhoeae: 

m223/g223 

10 20 30 40 50 60 

m223 . pep VEFRHQWWGVEPFGHFDSELVFVTARQLEELFQRQVLAVEAEAGGNRAGGDLQVEDVV 
: I I I I I I I I I I I I I i I t I I : I I I i I : I I I I I I I I I I I I 1 I : I I I : I I I I I I I I I I I I : 
g223 MEFRHQVVWGVEPFGHFDGELVFVAARQLEELFQRQVLAIEAETGGNRARGYLQVEDVM 

10 20 30 40 50 60 



70 80 90 100 110 120 

m223 . pep VESEIXYGNE IGVGSDLVFPVFLAQVFSNSQQFLLADFFAPVFFLCEFQFAEGADTREAX 

11111:1:1 1:1 I : I I I I I I I I I I I : I : I : I : : I : I I I I I I I I i II : I I I I ! t I I I 
g223 VESEITYSNVISVRSSLVFPVFLAQVFGNVEQLLFTDLFAPVFFLGEFQFTEGADTREAX 

70 80 90 . 100 110 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 791>: 

a223.seq 

1 GTGGAATTCA GGCACCAAGT AGTGGTAGTT GGTGTCGAAC CATTTGGTCA 

51 TTTCGATAGC GAATTGGTCT TTGTTACCGC GCGCCAGTTG GAAGAATTGT 

101 TCCAAAGATA GGTTTTGGCT GTCGAAGCCG AAGCGGGCGG GAATCGCGCC 

151 GGTGGCGACT TGCAGGTCGA GGATGTGGTC GTAGAAAGTG AAATCGCCTA 

201 CGGCAACGTA ATCGGCGTTG GCAGCGGCCT GGTGTTTCCA GTTTTTCTCG 

251 CGCAAGTCTT TAGCAACAGC CAGCAATTCT TGCTCGCTGA TTTCTTTGCG 

301 CCAGTATTTT TCTTGTGCGA ATTTCAATTC GCGGAAGGCA CCGACACGCG 

351 GGAAGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 792; ORF 223. a>: 

a223.pep 

1 VEFRHQWW GVEPFGHFDS ELVFVTARQL EELFQR*VLA VEAEAGGNRA 

51 GG DLQVEDW VESEIAYGN V IGVGSGLVFP VFLAQV FSNS QQFLLADFFA 

101 PVFFLCEFQF AEGTDTREA* 



m223/a223 95.8% identity in 1 19 aa overlap 

10 20 30 40 50 60 

m223 . pep VEFRHQVVVVGVEPFGHFDSELVFVTARQLEELFQRQVLAVEAEAGGNRAGGDLQVEDVV 

I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I i I I I I I 

a223 VEFRHQVVVVGVEPFGHFDSELVFVTARQLEELFQRXVLAVEAEAGGNRAGGDLQVEDVV 
10 20 30 40 50 60 



70 80 90 100 110 120 

m223.pep VESEIXYGNEIGVGSDLVFPVFLAQVFSNSQQFLLADFFAPVFFLCEFQFAEGADTREAX 
I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II : I I I I | | 
a223 VESEIAYGNVIGVGSGLVFPVFLAQVFSNSQQFLLADFFAPVFFLCEFQFAEGTDTREAX 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 793>: 



y^/ij . bey 








a t*aaattctt 


ttttcaaacc 


51 
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201 


cggcgcgatg 


gggcttaacg 


251 


ccgcccggcg 


ggcgggcaat 


301 


cttttgggta 


ttgcctaccg 


351 


tgactgcagc 


ggattcatgc 


401 


acctgccgcg 


cacgtcggcg 


451 


cgaagcgaat 


tgcagcccgg 


501 


cagccgcatt 


tcccatgtcg 


551 


acgcgccgcg 


cacggggaaa 


601 


tattggagcg 


gcaaatatgc 


651 


gtcacgcttt 


ctgaactga 



This corresponds to the amino acid sequence <SEQ ID 794; ORF 225. ng>: 

g225 .pep 

1 MDSFFKPAVW AVLWLMFAVR PALA DELTNL LSSREQILRQ FAEDEQPVLP 

51 VNRAPARRAG NADELIGGAM GLNEQPWRV NRAXARRAGN ADKLIGSAMR 

101 LLGIAYRYGG TSVSTGFDCS GFMQHIFKRA MGINLPRTSA EQARMGAPVA 

151 RSELQPGDMV FFRTLGGSRI SHVGLYIGNN RFIHAPRTGK NIEITSLSHK 

201 YWSGKYAFAR RVKKNDPSRF LN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 795>: 

m225.seq (partial) 

1 . . TTTTCAAACC CGGCAGTTTG GGCGGTTTTG TGGCTGAwGT TTGCCGTCCG 

51 CCCCGCCCTT GCCGACGAGT TGACCAACCT GCTCAGCAGC CGCGAGCAGA 

101 TTCTCAGACA GTTTGCCGAA GACGAACAGC CCGTTTTACC CATCAACCGA 

151 GCCCCCGCCC GGCGGGCGGG CAATGCCGAC GAACTCATCG GCAGCGCGAT 

2 01 GGGGCTTAAC GAACAGCCCG TTTTACCCGT CAACCGAGTC CCCGCCCGGC 

2 51 GGGCGGGCAA TGCCGACGAA CTCATCGGCA ACGCGATGGG GCTTAACGAA 

3 01 CAGCCCGTTT TACCCGTCAA CCGAGCCCCC GCCCGGCGGG CGGGCAATGC 

3 51 CGACGAACTC ATCGGCAACG CGATGGGACT TTTGGGTATT GCCTACCGCT 

4 01 ACGGCGGCAC ATCGGTTTCT ACCGGTTTTG ACTGCAGCGG CTTCATGCAG 
4 51 CACATCTTCA AACGCGCCAT GGGCATCAAC CTGCCGCGCA CGTCGGCAGA 
501 ACAGGCACGG ATGGGTACGC CGGTTGCCCG AAGCGAATTG CAGCCCGGAG 
551 ATATGGTGTT TTTCCGCACG CTCGGCGGCA GCCGCATTTC CCATGTCGGA 
601 CTTTATATCG GCAACAACCG CTTCATCCAC GCGCCGCGCA CGGGGAAAAA 
651 TATCGAAATC ACCAGCCTGA GCCACAAATA TTGGAGCGGC AAATACGCGT 
701 TCGCCCGCCG GGTCAAGAAA AACGACCCGT CCCGCTTTCT GAACTGA 

This corresponds to the amino acid sequence <SEQ ID 796; ORF 225>: 

m225.pep (partial) 

1 . . FSNPAVWAVL WLXFAVRPAL AD ELTNLLSS REQILRQFAE DEQPVLPINR 

51 APARRAGNAD ELIGSAMGLN EQPVLPVNRV PARRAGNADE LIGNAMGLNE 

101 QPVLPVNRAP ARRAGNADEL IGNAMGLLGI AYRYGGTSVS TGFDCSGFMQ 

151 HIFKRAMGIN LPRTSAEQAR MGTPVARSEL QPGDMVFFRT LGGSRISHVG 

2 01 LYIGNNRFIH APRTGKNIEI TSLSHKYWSG KYAFARRVKK NDPSRFLN* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 225 shows 83.5% identity over a 248 aa overlap with a predicted ORF (ORF 225.ng) 
from K gonorrhoeae: 

m225/g225 

10 20 30 40 50 

m225 . pep FSNPAVWAVLWLXFAVRPALADELTNLLSSREQ I LRQ FAEDEQPVLP I NRAPARRAG 

I :|IMIIIM II I I I III i I I I I I I II I I I I i II Ml I I I I I IM I I I I il I ! 
g225 MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
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10 20 30 40 50 60 

60 70 80 90 100 110 

m2 2 5 . pep NADELIGSAMGLNEQPVLPWRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

IIIIIM 'MINIMI: MM MMMI 

g 2 2 5 NADELIG GAMGLNEQPWRVNRAXARRAGNA 

70 80 90 

120 130 140 150 160 170 

m225 . pep DELI GNAMGLLGI AYR YGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

I MMMI 1 1 1 M 1 1 M M M M M I M 1 1 1 1 1 1 1 1 1 M 1 1 M I M 1 1 M 1 1 1 1 M 1 1 1 

g2 2 5 DKLIGSAMRLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGAPVAR 
100 110 120 130 140 150 

180 190 200 210 220 230 

m225 . pep SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 

MMMMMMMMMMMMMMMIMMMMMMMMMIMMMMM 

9225 SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
160 170 180 190 200 210 



240 249 
m2 2 5 . pep VKKNDPSRFLNX 

MMMIMM 

g2 25 VKKNDPSRFLN 
220 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 797>: 

a225.seq 

1 ATGGATTCTT TTTTCAAACC GGCAGTTTGG GCGGTTTTGT GGCTGATGTT 

51 TGCCGTCCGC CCCGCCCTTG CCGACGAGTT GACCAACCTG CTCAGCAGCC 

101 GCGAGCAGAT TCTCAGACAG TTTGCCGAAG ACGAACAGCC CGTTTTACCC 

151 ATCAACCGAN CCCCCGCCCG GCGGGCGGGC AATGCCGACG AACTCATCGG 

201 CAGCGCGATG GGGCTTAACG AACAGCCCGT TTTACCCGTC AACCGANTCC 

251 CCGCCCGGCG GGCGGGCAAT GCCGACNAAC TCATCGGCAA CGCGATGGGG 

301 CTTAACGAAC AGCCCGTTTT ACCCGTCAAC CGAGTCCCCG CCCGGCGGGC 

351 GGGCAATGCC GACGAACTCA TCGGCAACGC GATGGGGCTT AACGAACAGC 

4 01 CCGTTTTACC CGTCAACCGA GCCCCCGCCC GGCGGGCGGG CAATGCCGAC 

4 51 GAACTCATCG GCAACGCGAT GGGACTTTTG GGTATTGCCT ACCGCTACGG 

501 CGGCACATCG ATTTCTACCG GTTTTGACTG CAGCGGCTTC ATGCAGCACA 

551 TCTTCAAACG CGCCATGGGC ATCAACCTGC CGCGCACGTC GGCAGAACAG 

601 GCGCGGATGG GTACGCCGGT TGCCCGAAGC GAATTGCAGC CCGGGGATAT 

651 GGTGTNTTTC CGCACGCTCG GCGGCAGCCG CATTTCCCAT GTCGGACTTT 

701 ATATCGGCAA CAACCGCTTC ATCCACGCGC CGCGCACGGG GAAAAATATC 

751 GAAATCACCA GCCTGAGCCA CAAATATTGG AGCGGCAAAT ACGCGTTCGC 

801 CCGCCGGGTC AAGAAAAACG ACCCGTCCCG CTTTCTGAAC TGA 

This corresponds to the amino acid sequence <SEQ ID 798; ORP 225. a>: 

a225.pep 

1 MDSFFKPAVW AVLWLMFAVR PALA DELTNL LSSREQILRQ FAEDEQPVLP 

51 INRXPARRAG NADELIGSAM GLNEQPVLPV NRXPARRAGN ADXLIGNAMG 

101 LNEQPVLPVN RVPARRAGNA DELIGNAMGL NEQPVLPVNR APARRAGNAD 

151 ELIGNAMGLL GIAYRYGGTS ISTGFDCSGF MQHIFKRAMG INLPRTSAEQ 

201 ARMGTPVARS ELQPGDMVXF RTLGGSRISH VGLYIGNNRF IHAPRTGKNI 

251 EITSLSHKYW SGKYAFARRV KKNDPSRFLN * 

m225/a225 87.4% identity in 277 aa overlap 

10 20 30 40 50 

m225 . pep FSNPAVWAVLWLXFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | | I | | | | | MINI 
a225 MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQ I LRQFAEDEQPVLP INRXPARRAG 

10 20 30 40 50 60 



60 70 79 80 
m225 . pep NADELIGSAMGLNEQPVLPVNR VPARRAGNA 
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I I I I I I I I I I I I I i I I I I I I I I MINIMI 
a225 NADELIGSAMGLNEQPVLPVNRXPARRAGNADXLIGNAMGLNEQPVLPVNRVPARRAGNA 

70 80 90 100 110 120 

90 100 110 120 130 140 

m225 . pep DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 

I I ! I I I I I I I I M I i I I I I I I I I M i I I ! I I I I I I I I I I M I I I I I I I I I : I M I M I I I 
a225 DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSISTGFDCSGF 

130 140 150 160 170 180 

150 160 170 180 190 200 

m225.pep MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
I I I I I I I I I I I I I II I I I I I I I 11 I I I I I I I I I I I I I I II I I I I I I I I I M I I I M I I I 
a225 MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVXFRTLGGSRISHVGLYIGNNRF 

190 200 210 220 230 240 

210 220 230 240 249 

m225.pep IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLNX 
I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
a225 IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLNX 

250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 799>: 

g225-l.seq 

1 atggattctt ttttcaaacc ggcagtttgg gcggttttgt ggctgatgtt 

51 tgccgtccgc cccgcccttg ccgacgagtt gaccaacctg ctcagcagcc 

101 gcgagcagat tctcagacag tttgccgaag acgaacagcc cgttttaccc 

151 gtcaaccgag cccccgcccg gcgggcgggc aatgccgacg aactcatcgg 

201 cggcgcgatg gggcttaacg aacagcccgt tgtacgcgtc aaccgagccn 

251 ccgcccggcg ggcgggcaat gccgacaaac tcatcggcag cgcgatgcgg 

301 cttttgggta ttgcctaccg ctacggcggc acatcggtgt ctaccggttt 

351 tgactgcagc ggattcatgc agcacatctt caaacgcgcc atgggcatca 

4 01 acctgccgcg cacgtcggcg gaacaggcgc ggatgggcgc acccgttgcc 

4 51 cgaagcgaat tgcagcccgg ggatatggtg tttttccgca cgctcggcgg 

501 cagccgcatt tcccatgtcg gactttatat cggcaacaac cgcttcatcc 

551 acgcgccgcg cacggggaaa aatatcgaaa tcaccagcct gagccacaaa 

601 tattggagcg gcaaatatgc gttcgcccgc cgggtcaaga aaaacgaccc 

651 gtcacgcttt ctgaactga 

This corresponds to the amino acid sequence <SEQ ID 800; ORF 225-l.ng>: 

g225-l.pep 

1 MDSFFKPAVW AVLWLMFAVR PALAD ELTNL LSSREQILRQ FAEDEQPVLP 

51 VNRAPARRAG NADELIGGAM GLNEQPWRV NRAXARRAGN ADKLIGSAMR 

101 LLGIAYRYGG TSVSTGFDCS GFMQHIFKRA MGINLPRTSA EQARMGAPVA 

151 RSELQPGDMV FFRTLGGSRI SHVGLYIGNN RFIHAPRTGK NIEITSLSHK 

201 YWSGKYAFAR RVKKNDPSRF LN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 80 1>: 

m225-l.seq 

1 ATGGATTCTT TTTTCAAACC GGCAGTTTGG GCGGTTTTGT GGCTGATGTT 

51 TGCCGTCCGC CCCGCCCTTG CCGACGAGTT GACCAACyTG CTCAGCAGCC 

101 GCGAGCAGAT TCTCAGACAG TTTGCCGAAG ACGAACAGCC CGTTTTACCC 

151 ATCAACCGAG CCCCCGCCCG GCGGGCGGGC AATGCCGACG AACTCATCGG 

2 01 CAGCGCGATG GGGCTTAACG AACAGCCCGT TTTACCCGTC AACCGAGTCC 

251 CCGCCCGGCG GGCGGGCAAT GCCGACGAAC TCATCGGCAA CGCGATGGGG 

301 CTTAACGAAC AGCCCGTTTT ACCCGTCAAC CGAGCCCCCG CCCGGCGGGC 

351 GGGCAATGCC GACGAACTCA TCGGCAACGC GATGGGACTT TTGGGTATTG 

4 01 CCTACCGCTA CGGCGGCACA TCGGTTTCTA CCGGTTTTGA CTGCAGCGGC 

4 51 TTCATGCAGC ACATCTTCAA ACGCGCCATG GGCATCAACC TGCCGCGCAC 
501 GTCGGCAGAA CAGGCACGGA TGGGTACGCC GGTTGCCCGA AGCGAATTGC 

5 51 AGCCCGGAGA TATGGTGTTT TTCCGCACGC TCGGCGGCAG CCGCATTTCC 
601 CATGTCGGAC TTTATATCGG CAACAACCGC TTCATCCACG CGCCGCGCAC 
651 GGGGAAAAAT ATCGAAATCA CCAGCCTGAG CCACAAATAT TGGAGCGGCA 
7 01 AATACGCGTT CGCCCGCCGG GTCAAGAAAA ACGACCCGTC CCGCTTTCTG 
751 AACTGA 
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This corresponds to the amino acid sequence <SEQ ID 802; ORF 21 7>: 

m225-l.pep 

1 MDSFFKPAVW AVLWLM FAVR PAIAD ELTNL LSSREQILRQ FAEDEQPVLP 
51 INRAPARRAG NADELIGSAM GLNEQPVLPV NRVPARRAGN ADELIGNAMG 
101 LNEQPVLPVN RAPARRAGNA DELIGNAMGL LGIAYRYGGT SVSTGFDCSG 
151 FMQHIFKRAM GINLPRTSAE QARMGTPVAR SELQPGDMVF FRTLGGSRIS 
201 HVGLYIGNNR FIHAPRTGKN IEITSLSHKY WSGKYAFARR VKKNDPSRFL 
251 N* 

m225-l/g225-l 84.9% identity in 251 aa overlap 

10 20 30 40 50 60 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

I II Ml I lllllllll 1 1 !M I MM III MM INI I MM INI II I hlllM II II 

MDS FFKPA VWAVLWLMFAVRP ALADELTNLLS SREQI LRQFAEDEQ PVL PVNRAPARRAG 
10 20 30 40 50 60 

70 80 90 100 110 120 

NAD ELI G S AMGLNEQ P VL P VNRV PARRAGNADE L I GN AMG LNEQ P VL P VNRA P ARRAGNA 

MM M I M M I M M I • IMI MMMI 

NADE LIGGAMGLNEQPWRVNRAXARRAGNA 

70 80 90 

130 140 150 160 170 180 

DELIGNAMGLI^I AYR YGGTSVSTGFDCSGFMQHI FKRAMG INLPRTSAEQARMGTPVAR 

IMMMI M M M M M M M M M M I M M 1 1 M M I M M M M M I M I M M I 

DKL I GS AMRLLG I AYR YGGTS VS TG FD CSGFMQH I FKRAMG I NL PRT S AEQ ARMG AP VAR 
100 110 120 130 140 150 

190 200 210 220 230 240 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 

1 1 1 1! 1 1 1 1 ! 1 1 1 1 II 1 1 1 ! 1 1 1! f ! I M I [ 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 

SELQPGDMVF FRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
160 170 180 190 200 210 

250 

VKKNDPSRFLNX 

IIMIIMIMI 

VKKNDPSRFLNX 
220 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 803>: 

a225-l .seq 

1 ATGGATTCTT TTTTCAAACC GGCAGTTTGG GCGGTTTTGT GGCTGATGTT 

51 TGCCGTCCGC CCCGCCCTTG CCGACGAGTT GACCAACCTG CTCAGCAGCC 

101 GCGAGCAGAT TCTCAGACAG TTTGCCGAAG ACGAACAGCC CGTTTTACCC 

151 ATCAACCGAN CCCCCGCCCG GCGGGCGGGC AATGCCGACG AACTCATCGG 

201 CAGCGCGATG GGGCTTAACG AACAGCCCGT TTTACCCGTC AACCGANTCC 

251 CCGCCCGGCG GGCGGGCAAT GCCGACNAAC TCATCGGCAA CGCGATGGGG 

301 CTTAACGAAC AGCCCGTTTT ACCCGTCAAC CGAGTCCCCG CCCGGCGGGC 

351 GGGCAATGCC GACGAACTCA TCGGCAACGC GATGGGGCTT AACGAACAGC 

401 CCGTTTTACC CGTCAACCGA GCCCCCGCCC GGCGGGCGGG CAATGCCGAC 

4 51 GAACTCATCG GCAACGCGAT GGGACTTTTG GGTATTGCCT ACCGCTACGG 

501 CGGCACATCG ATTTCTACCG GTTTTGACTG CAGCGGCTTC ATGCAGCACA 

551 TCTTCAAACG CGCCATGGGC ATCAACCTGC CGCGCACGTC GGCAGAACAG 

601 GCGCGGATGG GTACGCCGGT TGCCCGAAGC GAATTGCAGC CCGGGGATAT 

651 GGTGTNTTTC CGCACGCTCG GCGGCAGCCG CATTTCCCAT GTCGGACTTT 

701 ATATCGGCAA CAACCGCTTC ATCCACGCGC CGCGCACGGG GAAAAATATC 

751 GAAATCACCA GCCTGAGCCA CAAATATTGG AGCGGCAAAT ACGCGTTCGC 

801 CCGCCGGGTC AAGAAAAACG ACCCGTCCCG CTTTCTGAAC TGA 

This corresponds to the amino acid sequence <SEQ ID 804; ORF 225-1. a>: 

a225-l.pep 

1 MDSFFKPAVW AVLWLMFAVR PALA DELTNL LSSREQILRQ FAEDEQPVLP 

51 INRXPARRAG NADELIGSAM GLNEQPVLPV NRXPARRAGN ADXLIGNAMG 

101 LNEQPVLPVN RVPARRAGNA DELIGNAMGL NEQPVLPVNR APARRAGNAD 

151 ELIGNAMGLL GIAYRYGGTS ISTGFDCSGF MQH I FKRAMG INLPRTSAEQ 

201 ARMGTPVARS ELQPGDMVXF RTLGGSRISH VGLYIGNNRF IHAPRTGKNI 

251 EITSLSHKYW SGKYAFARRV KKNDPSRFLN * 



m225-l .pep 
g225-l 

m225-l .pep 
g225-l 

m225-l .pep 
g225-l 

m225-l -pep 
g225-l 

m225~l .pep 
g225-l 
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a225-l/m225-l 88.6% identity in 280 aa overlap 

10 20 30 40 50 60 

a22 5 - 1 . pep MD S F FK P AVW AVLWLM FAVRP ALAD E LTNL LS S REQ I LRQFAED EQ P VL P I NRX PARRAG 

lllllllllllll IIIIIIIIIIIIIIIIIIMIIillllllllllllllill HUM 

m225-l MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
10 20 30 40 50 60 

70 80 90 100 110 120 

a225-l. pep NADELIGSAMGLNEQPVLPVNRXPARRAGNADXLIGNAMGLNEQPVLPVNRVPARRAGNA 

IIIIIHMIIMIII IIIIIIIIIIMIII 

m22 5 - 1 NADELIGSAMGLNEQP VLPVNR V P ARRAGN A 

70 80 90 

130 140 150 160 170 180 

a22 5 - 1 .pep DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLIiGIAYRYGGTSISTGFDCSGF 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I : I I I I I I I I I 
m22 5 - 1 DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 

100 110 120 130 140 150 

190 200 210 220 230 240 

a22 5 - 1 . pep MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVXFRTLGGSRISIIVGLYIGNNRF 

I I i 1 1 iiiiiiii ii ii 1 1 M ii mum 1 1 imi i imimmiiiiiiiiii 

m22 5 - 1 MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
160 170 180 190 200 210 

250 260 270 280 

a225 - 1 . pep I HA PRTG KN I E IT S LSHKYWSG KYAF ARRVKKND PS RF LNX 

1 1 MM II II I M I II I II II Mill Ml 1 1 III M 1 1 1 M 

m2 2 5 - 1 IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLNX 
220 230 240 250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 805>: 

g226 . seq 

1 ATGAGCGAAA TCCTCAGGCA GCCCAGCGTT CTGCTTTTCC TCACGCTTGC 

51 CGTGTACGCG CTTGCGATTA TCGTGCGCAC GCGCACGGGC AATATCTTCT 

101 GCAACCCCGT ACTCGTCAGC ACTATCGTGC TGATTGCCTA CCTGAAAATC 

151 CTCGGTATCG ATTATGCGGT GTACCACAAC GCCGCGCAAT TCATTGATTT 

2 01 TCGGCTGAAA cccGccgtCG TCGTGCTTGC CGTGCCGCTC TACCAAAACC 

2 51 GCCGTAAAAT CTTCAACCAG TGGCTGCCCG TCATCGTTTC GCAGCTTGCG 

301 GGCAGCGTTA cggGCATTGT tacggggATG TATTTTgccg cttggctcgg 

351 gccggatacc caattctcct tcccgcctcg tcttcaatat ctgttattta 

401 caccctctgg aatcccaatt cacaccctgt atgcgcgggt tctcccgcca 

451 tttctgttgc ctccgcctct cctgccgcgc ctcggcccgc atacattgcg 

501 ccggttcaca atacttccaa aaaaactacg gccgtttaag cccctcctcc 

551 cagttgtggt cctttctcct Ccgggcctcg cccctcccct cttataa 

This corresponds to the amino acid sequence <SEQ ID 806; ORF 226.ng>: 

g226.pep 

1 MSEILRQPSV LLFLTLAVYA L AIIVRTRTG NIFCNP VLVS TIVLIAYLKI 

51 LGIDYAVYHN AAQFIDFRLK PAWVLAVPL YQNRRKI FNQ WLPVIVSQLA 

101 GSVTGIVTGM YFAAWLGPDT QFSFPPRLQY LLFTPSGIPI HTLYARVLPP 

151 FLLPPPLLPR LGPHTLRRFT ILPKKLRPFK P LLPVWLSP PGLAPPLL * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 807>: 

m226 . seq 

1 ATGAACGAAA TCCTCAGGCA GCCCAGCGTT CTGCTTTTCC TCACGCTTGC 

51 CGTGTACGCG CTTGCGATTA TCGtGCGCAC GCGCACGGGC AATATCTTCT 

101 GCAACCCCGT ACTCGTCAGC ACTATCGTGC TGATTGCCTA CCTGAAAATC 

151 CTCGGTATCG ATTATGCGGT GTACCACAAC GCCGCGCAAT TCATTGATTT 

2 01 TTGGCTGAAA CCCGCCGTCG TCGTGCTTGC CGTGCCGCTC TACCAAAACC 

251 GCCGTAAAAT CTTCAACCAG TGGCTGCCCG TCATCGTTTC ACAGCTTGCG 

301 GGCAGCGTTA CGGGCATTGT TACAGGGATG TATTTTGCCA AATGGCTGGG 

351 CGCGGAACGC GAAGTCGTCC TCTCGCTCGC GTCCAAATCT GTTACCAACC 

4 01 CCATCGCTAT TGAAATCACC CGCTCCATCG GCGGCATTCC CGCCATTACC 

451 GCCGCCACCG TCATCATTGC CGGTCTGGTC GGACAGATTG CCGGTTACAA 
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501 AATGCTGAAG AACACGGTCG TCATGCCCTC GTCCGTGGGT ATGTCGCTCG 
551 GCACGGCTTC GCACGCGATG GGGATTGCCG CCTCGCTCGA ACGCAGCCGC 
601 CGTATGGCGG CATACGCGGG GCTGGGGCTG ACGTTCAACG GCGTACTGAC 
651 CGCGCTGATT GCGCCGCTGC TCATCCCCGT TTTGGGATTT TGA 

This corresponds to the amino acid sequence <SEQ ID 808; ORF 226>: 

m22 6 .pep 

1 MNEILRQPSV LLFLTLAVYA L AI I VRTRTG NIFCNP VLVS TIVLIAYLKI 

51 LGIDYAVYHN AAQFIDFWLK PAWVLAVPL YQNRRKI FNQ WLPVIVSQLA 

101 GSVTGIVTGM YFAKWLGAER EWLSLASKS VTNPIAIEIT RSIGGIPAIT 

151 AATVIIAGLV GQIAGYKMLK NTWMPSSVG MSLGTASHAM GIAASLERSR 

2 01 RMAAYAGLGL TFNGVLTALI APLLIPVLGF * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 226 shows 94.2% identity over a 121 aa overlap with a predicted ORF (ORF 226.ng) 
from N. gonorrhoeae: 

m226/g226 

10 20 30 40 50 60 

m226 . pep MNE I LRQP S VLL FLTLAVYALAI I VRTRTGNI FCNP VLVS T I VL I A YL KI LG I D YAVYHN 

h I I I I II I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I 
g226 MSE I LRQPSVLLFLTLAVYALAIIVRTRTGNIFCNPVLVS TIVLIAYLKI LGIDYAVYHN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m226 . pep AAQFIDFWLKPAVWLAVPLYQNRRKI FNQWLPVIVSQLAGSVTGIVTGMYFAKWLGAER 

I MM II 1 1 f 1 1 1 1 1 1 1 1 ! I ! 1 1 1 r I 1 I J I I I I f I I J I 1 I I I I 1 I I I I 1 I 1 I III : 
g226 AAQFIDFRLKPAVWLAVPLYQNRRKI FNQWLPVIVSQLAGSVTGIVTGMYFAAWLGPDT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m226 . pep EVVLSLASKSVTNPIAIEITRSIGGIPAITAATVIIAGLVGQIAGYKMLKNTVVMPSSVG 

g226 QFSFPPRLQYLLFTPSGIPIHTLYARVLPPFLLPPPLLPRLGPHTLRRFTILPKKLRPFK 

130 140 150 160 170 180 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 809>: 

a226. seq 

1 ATGAACGAAA TCCTCAGGCA GCCGAGCATC CTGCTTTTCC TCACGCTTGC 

51 CGTGTACGCG CTTGCGATTA TCGTGCGCAC GCGCACGGGT AATATCTTCT 

101 GCAACCCCGT ACTCGTCAGC ACTATCGTGC TGATTGCCTA CCTGAAAATC 

151 CTCGGTATCG ATTATGCGGT GTACCACAAC GCCGCGCAGT TTATCGATTT 

201 CTGGCTCAAG CCCGCCGTCG TCGTGCTTGC CGTGCCGCTC TACCAAAACC 

251 GCCGTAAAAT CTTCAACCAA TGGCTGCCCG TCATCGTTTC GCAGCTTGCG 

301 GGCAGCGTTA CGGGCATTGT TACGGGGATG TATTTTGCCA AATGGCTGGG 

351 CGCGGAACGC GAAGTCGTCC TCTCGCTCGC GTCCAAATCT GTTACCAATC 

4 01 CTATCGCCAT CGAAATCACC CGCTCCATCG GCGGCATTCC CGCCATTACC 

451 GCCGCCACCG TCATCATTGC CGGCCTGGTC GGACAGATTG CCGGTTACAA 

501 AATGTTGAAA AACACGGTCG TTATGCCCTC ATCTGTCGGA ATGTCGCTCG 

551 GCACGGCTTC GCACGCGATG GGCATTGCCG CCTCGCTCGA ACGCAGCCGC 

601 CGCATGGCGG CATACGCGGG GCTGGGGCTG ACGTTCAACG GCGTACTGAC 

651 CGCGCTGATT GCGCCGCTGC TTATCCCCGT TTTGGGATTT TGA 

This corresponds to the amino acid sequence <SEQ ID 810; ORF 226.a>: 

a226.pep 

1 MNE I LRQP SI LLFLTLAVYA LA IIVRTRTG NIFCNP VLVS TIVLIAYLKI 

51 LGIDYAVYHN AAQ FIDFWLK PAWVLAVPL YQNRRKI FNQ WLPVIVSQLA 

101 GSVTGIVTGM YFAKWLGAER EWLSLASKS VTNPIAIEIT RS IGGIPAIT 

151 AATVIIAGL V GQIAGYKMLK NTWMPSSVG MSLGTASHAM GIAASLERSR 

201 RMAAYAGLGL TFN GVLTALI APLLIPVLGF * 

m226/a226 99.6% identity in 230 aa overlap 

10 20 ' 30 40 50 6.0 
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m22 6.pep MNEILRQPSVLLFLTLAVYALAIIVRTRTGNIFCN PVLVSTIVLIAYLKILGIDYAVYHN 

I I I I I I I I I : I I I I I I I I I I I I I I i I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
a22 6 MNEILRQPSILLFLTLAVYALAIIVRTRTGNIFCNPVLVSTIVLIAYLKILGIDYAVYHN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m22 6 - pep AAQFI DFWLKPAVWLAVPLYQNRRKI FNQWLPVIVSQLAGSVTGIVTGMYFAKWLGAER 

I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I 
a226 AAQFI DFWLKPAVWLAVPLYQNRRKI FNQWLPVIVSQLAGSVTGIVTGMYFAKWLGAER 

70 80 90 100 110 120 

130 140 150 160 170 180 

m22 6.pep EWLSLASKSVTNPIAIEITRSIGGIPAITAATVIIAGLVGQIAGYKMLKNTVVMPSSVG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
a22 6 EVVLSLASKSVTNPIAIEITRSIGGIPAITAATVIIAGLVGQIAGYKMLKNTWMPSSVG 

130 140 150 160 170 180 

190 200 210 220 230 

m22 6 . pep MS LGTASHAMGI AASLERSRRMAAYAGLGLT FNGVLTALI APLLI PVLGFX 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a226 MS LGTASHAMGI AASLERSRRMAAYAGLGLT FNGVLTALI APLLI PVLGFX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 81 1>: 

g227.seq 

1 atgaacatca tccgcgcgct cctcatcatc ctcggctgcc tcgccgccgg 

51 cgaaaccgcc gttttcctag caggcatcaa actgcccggc agcatcgtcg 

101 gcatgggcgt gctgtttgcg cttttgcagg cgggttggct caaaacgtct 

151 tggctgcaac agcttaccga cgcgctgatg gcaaacctga cgctgttcct 

201 cgtgccgccc tgcgtggcgg tcatcagcta tttggatttg attgccgacg 

251 attggttttc gatactggtt tccgcctccg ccagcacttt gtgcgtactg 

301 ctggttacgg gcaaggttca ccgctggata cggagcatta tctga 

This corresponds to the amino acid sequence <SEQ ID 812; ORF 227.ng>: 

g227 .pep 

1 MNIIRALLII LGCLAAGETA VFLAGIKLPG SIVGMGVLFA LLQAGWLKT S 

51 WLQQLTD ALM ANLTLFLVPP CVAVI SYLDL IADDW FSILV SASASTLCVL 

101 LVTGKVHRWI RSI I* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 813>: 

m227 . seq (partial) 

1 . . ACGTCTTJcGC TGCAACAGCT TACCGACGCG CTGATGTCGA ACCTGACGCT 

51 GTtCCTCGTG CCgCC.TGCG TGGCGGTCAT CAGCTATTTG GATTTGATTG 

101 CCGACGATTG GTTTTCGATA CTGGTTTCCG CCTCCGCCAG cACTTTGTGC 

151 GTACTGCTGG TTACGGGCAA AGTCCACCGG TGGATACGGG GTATTATCCG 

201 ATGA 

This corresponds to the amino acid sequence <SEQ ID 814; ORF 227>: 

m227 .pep (partial) 

1 . . TSXLQQLTD A LMSNLTLFLV PPCVAVI SYL DLIADDW FSI LVSASASTLC 
51 VLLV TGKVHR WIRGIIR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 227 shows 95.5% identity over a 66 aa overlap with a predicted ORF (ORF 227.ng) 
from N. gonorrhoeae: 

m227/g227 

10 20 30 

m227 .pep TSXLQQLTDALMSNLTLFLVPPCVAVISYL 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I ! 
g227 T AVFLAG I KL PG S I VGMG VLFALLQAG WLKT S WLQQLT DALMAN LTL FLVP PC VAV I S Y L 

20 30 40 50 60 70 

40 50 60 
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m227 .pep DLIADDWFS I LVSASASTLCVLLVTGKVHRWIRGI IRX 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I : I I I 
g2 2 7 DLIADDWFS I LVS ASASTLCVLLVTGKVHRWIRS I IX 

80 90 100 110 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 815>: 

a227 . seq 

1 ATGAACATCA TCCGCGCGCT CCTCATCATC CTCGGCTGCC TCGCCACCGG 

51 CGAAACCGCC GTTTTCCTAG CAGGCATCAA ACTGCCCGGC AGCATCGTCG 

101 GCATGGGCGT ACTGTTTGCG CTTTTGCAGG CGGGTTGGGT CAAAACGTCT 

151 TGGCTGCAAC AGCTTACCGA CGCGCTGATG GCGAATCTGA CGTTGTTTCT 

201 CGTGCCGCCC TGCGTGGCGG TCATCAGCTA TTTGGATTTG ATTGCCGACG 

251 ATTGGTTTTC GATACTGGTT TCCGCCTCCG CCAGCACTTT GTGCGTACTG 

301 CTGGTTACAG GCAAGGTTCA CCGCTGGATA CGGAGCATTA TCTGA 

This corresponds to the amino acid sequence <SEQ ID 816; ORF 227.a>: 

a227 .pep 

1 MNIIRALLII LGCLATGETA VFLAGIKLPG SIVGMGVLFA LLQAGWVKT S 

51 WLQQLTD ALM ANLTLFLVPP CVAVI SYLDL IADDW FSILV SASASTLCVL 

101 LVTGKVHRWI RSII* 

m227/a227 95.5% identity in 66 aa overlap 

10 20 30 

m227 .pep TSXLQQLTDALMSNLTLFLVPPCVAVISYL 

II I I I I I I I I I : I I I I I I I I I I I I I I I I I 
a227 TAVFLAGIKLPGSIVGMGVLFALLQAGWVKTSWLQQLTDALMANLTLFLVPPCVAVISYL 
20 30 40 50 60 70 

40 50 60 

m227 .pep DLIADDWFS ILVSASASTLCVLLVTGKVHRWIRGI IRX 

I I I I I I I I I I I I I I I i I II I I I I I I I I I I I I I I : I I 
a227 DLIADDWFS ILVSASASTLCVLLVTGKVHRWIRS I IX 

80 90 100 110 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 817>: 

m228 . seq 

1 ATGAAAAAAT TATTGATTGC CGCAATGATG GCGGCTGCCT TGGCAGCTTG 

51 TTCGCAAGAA GCCAAACAGG AGGTTAAGGA AGCGGTTCAA GCCGTTGAGT 

101 CCGATGTTAA AGACACTGCG GCTTCTGCCG CCGAGTCTGC CGCTTCTGCC 

151 GTCGAAGAAG CGAAAGACCA AGTCAAAGAT GCTGCGGCTG ATGCAAAGGC 

201 AAGTGCCGAG GAAGCTGTAA CTGAAGCCAA AGAAGCTGTA ACTGAAGCAG 

251 CTAAAGATAC TTTGAACAAA GCTGCCGACG CGACTCAGGA AGCGGCAGAC 

3 01 AAAATGAAAG ATGCCGCCAA ATAA 

This corresponds to the amino acid sequence <SEQ ID 818; ORF 228>: 

m228.pep 

1 MKKLLIAAMM AAAIAAC SQE AKQEVKEAVQ AVESDVKDTA ASAAESAASA 
51 VEEAKDQVKD AAADAKASAE EAVTEAKEAV TEAAKDTLNK AADATQEAAD 
101 KMKDAAK* 

Computer analysis of this amino acid sequence gave the following results: 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 819>: 

a228 . seq 

1 ATGAAAAAAT TATTGATTGC CGCAATGATG GCGGCTGCCT TGGCAGCTTG 

51 TTCGCAAGAA GCCAAACAGG AGGTTAAGGA AGCGGTTCAA GCCGTTGAGT 

101 CCGATGTTAA AGACACTGCG GCTTCTGCCG CCGAGTCTGC CGCTTCTGCC 

151 GTCGAAGAAG CGAAAGACCA AGTCAAAGAT GCTGCGGCTG ATGCAAAGGC 

201 AAGTGCCGAG GAAGCTGTAA CTGAAGCCAA AGAAGCTGTA ACTGAAGCAG 

251 CTAAAGATAC TTTGAACAAA GCTGCCGACG CGACTCAGGA AGCGGCAGAC 

301 AAAATGAAAG ATGCCGCCAA ATAA 

This corresponds to the amino acid sequence <SEQ ID 820; ORF 228.a>: 
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a228 .pep 

1 MKKLLIAAMM AAALAA CSQE AKQEVKEAVQ ' AVESDVKDTA ASAAESAASA 

51 VEEAKDQVKD AAADAKASAE EAVTEAKEAV TEAAKDTLNK AADATQEAAD 

101 KMKDAAK* 

m228/a228 100.0% identity in 107 aa overlap 

10 20 30 40 50 60 

m228 . pep MKKLLIAAMMAAALAACSQEAKQEVKEAVQAVESDVKDTAASAAESAASAVEEAKDQVKD 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I II I I I I I ! I I I I I I I I I I 
a228 MKKLLIAAMMAAALAACSQEAKQEVPCEAVQAVESDVKDTAASAAESAASAVEEAKDQVKD 

10 20 30 40 50 60 

70 80 90 100 

m2 2 8 . pep AAADAKAS AEEAVTEAKEAVTEAAKDTLNKAADATQEAADKMKDAAKX 

I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
a22 8 AAADAKASAEEAVTEAKEAVTEAAKDTLNKAADATQEAADKMKDAAKX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 82 1>: 

g229.seq 

1 atggctgccg tatcgggcgg cggtgcggtc ttcctgataa tgcttccaca 

51 tattgcccgc gttcagcgtc agccgccagc gttcgcccaa gcgtcgggag 

101 aaatcggcat tgaagccgcc ggcgaaattg tatcggctgc cgcccaagag 

151 'gttttgcccg acaaacggca cggtgccgaa cgagcgcgtt accgaacggt 

201 tttgatggcc gaacgacagg cgcaggttct gttcgctgaa atctttgtta 

251 tcccaataat gcacgccgcg gctgatgccg ccgtagagga aatgatgccc 

301 gcccgcattg atttcgcgcg acacgcccaa gccgtagcgc aaaccgtgtg 

3 51 ccttttgcgg caggctgtcg gcggttttcg tccagcttct gcccgcaaat 

401 tcaatcgttt tttcggacga agcgttgttt atagcggatt aacaaaaatc 

451 aggacaaggc ggcgggccgc aggcagtacg gatggtacgg aaccggttcg 

501 cccggtgctt ggacgcctta gggaaccgtt ccctttgagc cggggcgggg 

551 caacccgtac cggtttttgt tcatccgcca tattgtgttg a 

This corresponds to the amino acid sequence <SEQ ID 822; ORF 229.ng>: 

g22 9 .pep 

1 MAAVSGGGAV FLIMLPHIAR VQRQPPAFAQ ASGEIGIEAA GEIVSAAA QE 

51 VLPDKRHGAE RARYRTVLMA ERQ AQVLFAE IFVIPIMHAA ADAAVEEMMP 

101 ARIDFARHAQ AVAQTVCLLR QAVGGFRPAS ARKFNRFFGR SWYSGLTKI 

151 RTRRRAAGST DGTEPVRPVL GRLREPFPLS RGGATRTGFC SSAILC* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 823>: 

m229.seq (partial) 

1 . . GCTCAAGCGT TGGGAGAAAT CGGCATTGAA GCCGCCGACG AAATTGTATC 

51 GGCTGCCGCC TAAGAGGTTT TGCTCGACAA ACGGCACGAT GCCGAACGAG 

101 CGCGTTACCG AACGGTTTTT ATAGCCGAAC GACAGGCGCA GGCTCTGTTC 

151 GCTGAAATCT TTGTTATCCC AATAATGCAC GCCGCCGCCG CTGATGCCGC 

2 01 CGTAGAGGAA ATGATGCCTG CCCGCATTGA TTTCGCGCGA CACGCCTAAG 
251 CCCTAGCGCA AACCGTGTGC CTTTTGCGGC AGGCTGTCGG CGGTTTTCGT 

3 01 CCAGCTTCTG CCCGCAAATT CAATCGTTTT TTCGGACGAA GCGTTGTTTA 

3 51 TAGCGGATTA ACAAAAATCA GGACAAGGCA ACGAAGCCGC AGACAGTACA 

4 01 AATAGTACGG AACCGATTCA CTTGGTGCTT CAGCACcTTA GAGAATCGTT 
4 51 CTCTTTTTTG TTCATCCGCT ATATTGTGTT GA 

This corresponds to the amino acid sequence <SEQ ID 824; ORF 229>: 

m229.pep (partial) 

1 . . AQALGEIGIE AADEIVSAAA XEVLLDKRHD AERARYRTVF IAERQ AQALF 

51 AEIFVIPIMH AA AADAAVEE MM PAR ID FAR HAXALAQTVC LLRQAVGGFR 

101 PASARKFNRF FGRSWYSGL TKIRTRQRSA DSTNSTEPIH LVLQHLRESR 

151 SLFCSSAILC * 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 229 shows 80.5% identity over a 169 aa overlap with a predicted ORF (ORF 229.ng) 
from TV. gonorrhoeae: 

m229/g209 

10 20 30 

m2 2 9 . pep AQALGEIGIEAADEIVSAAAXEVLLDKRHDAE 

III IIIIMII lllllll III I I I I II 
g229 MAAVS GGGAVFLI ML PH I ARVQRQ P PAFAQAS GE I G I EAAGE I VS AAAQE VLPDKRHGAE 

10 20 30 40 50 60 

40 50. 60 70 80 90 

m229 .pep RARYRTVF I AERQ AQALFAE I FVI P I MHAAAADAAVEEMMPARI DFARHAXALAQTVCLL 

IIIMI|::|IIII|:|||IMIIIIIMII II I II I I I I I I I I I I I I I 1 = 1111111 
g2 2 9 RARYRTVLMAERQAQVLFAEI FVI PIMHAAA-DAAVEEMMPARIDFARHAQAVAQTVCLL 

70 80 90 100 110 

100 110 120 130 140 
m229 .pep RQAVGGFRPASARKFNRFFGRSWYSGLTKIRTRQRSADSTNSTEPIHLVLQHLRE 

Mill II lllllll I I II II I I II II I II MM I : I : I I MM II- II MM 
g229 RQAVGGFRPASARKFNRFFGRSWYSGLTKIRTRRRAAGSTDGTEPVRPVLGRLREPFPL 
120 130 140 150 160 170 

150 160 
m22 9.pep SRSLFCSSAILCX 

:h IIIIMII 
g 2 2 9 SRGGATRTGFCS S AI LC 

180 190 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 825>: 

a229.seq (partial) 

1 ATGGCTGTCG TATCGGGCGG CGGTGCGGTC TTCCTGATAA CGCTTCCACA 

51 TATTGCCCAC GTTCAGCGTC AGCCGCCA. . GTTCGCTCAA GCGTCGGGAG 

101 AAATCGGCAT TGAAGCCGCC GACGAAATTG TATCGGCTGC CGCCTAAGAG 

151 GTTTTGCTCG ATAAACGGCA CGATGCCGAA TGAGCGCGTT ACTGAACGGT 

201 TTTTATAGCC GAGCGACAGG CGCAGGCTCT GTTCGCTGAA ATCTTTGTTA 

251 TCCTAATAGT GCACGCCGCC GCCGCTGATG TCTCCGTAGA GGAAATGATG 

301 CCCGCCCGCA TTGATTTCGC GCGACACGCC CAAGCCGTAG CGCAAACCGT 

351 GTGCCTTTTG CGGCAGGCTG TCGGCGGTTT TCGTCCAGCT TCTGCCTGCA 

401 AATTCAATCG TTTTTTCGGA CGAAGCGTTG TTTATAGCGG ATTAACAAAA 

451 ATCAGGACAA GGCGACGAAG CGCAGACAGT ACAGATAGTA CGGAACCGAT 

501 TCACTTGGTG CTTCAGCACC TTAGAGAATC GTCTCTTTGA GCTAAGGCGA 

551 GGCAACGCCG TACTGGTTTT TGTTCATCCA CTATA 

This corresponds to the amino acid sequence <SEQ ID 826; ORF 229.a>: 

a229.pep (partial) 

1 MAWSGGGAV FLITLPHIAH VQRQPPXFAQ ASGEIGIEAA DEIVSAAA *E 

51 VLLDKRHDAE *ARY*TVFIA ERQ AQALFAE IFVILIVHAA AADVSVEEMM 

101 PARIDFARHA QAVAQTVCLL RQAVGGFRPA SACKFNRFFG RSWYSGLTK 

151 IRTRRRSADS TDSTEPIHLV LQHLRESSL* AKARQRRTGF CSSTI 

m229/a229 85.6% identity in 167 aa overlap 

10 20 30 

m22 9 . pep AQALGEIGIEAADEIVSAAAXEVLLDKRHDAE 

Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a22 9 MAWSGGGAVFLITLPHIAHVQRQPPXFAQASGEIGIEAADEIVSAAAXEVLLDKRHDAE 

10 20 30 40 50 60 

40 50 60 70 80 90 

m22 9.pep RARYRTVFIAERQAQALFAEIFVIPIMHAAAADAAVEEMMPARIDFARHAXALAQTVCLL 
III I I I I I I I I I I I I I I I I I I I I : I I I I I I :: I I I I 1 I I I I I I I I I I I : i I I I I I I 
a22 9 XARYXTVFIAERQAQALFAEIFVILIVHAAAADVSVEEMMPARIDFARHAQAVAQTVCLL 

70 80 90 100 110 120 
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100 110 120 130 140 149 

m229.pep RQAVGGFRPASARKFNRFFGRSWYSGLTKIRTRQRSADSTNSTEPIHLVLQHLRES 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I : I I I I I I : || I M I I I I I I I I I I 
a22 9 RQAVGGFRPASACKFNRFFGRSWYSGLTKIRTRRRSADSTDSTEPIHLVLQHLRESSLX 

130 140 150 160 170 180 

150 160 

m229.pep RSLFCSSAILCX 

I : I I I I: I 

a229 AKARQRRTGFCSSTI 

190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 827>: 

g230 . seq 

1 atgttccatt ccatcgaaaa atacagaaca cccgcccaag tcttattagg 

51 cctgattgca ttaacttttg tcggcttcgg cgtcagcacg * gtttcccatc 

101 cgggcgccga ctacatcgtc caagtgggcg acgaaaaaat cagcgagcac 

151 tcaatcaaca acgccatgca gaacgagcag gcggacggcg gcagcccttg 

201 gcgcgacgcg gtgttccaat ccctgctgca acgcgcctac ctgaaacagg 

251 gcgcgaagct gatgggcatt tcggtttctt ccgaacaaat caagcagatg 

301 attgtggacg atcccaattt ccacgacgca aacggcaaat tcagtcacgc 

351 gcttttgagt caatacctgt cgcaacgcca tatgtctgaa gaccagtttg 

401 tcgaagaaat ccgcgatcag tttgccttgc agaatttggt aagcctcgtc 

4 51 caaaacggcg tattggtcgg cgacgcgcag gcggaacagc tgatcaggct 

501 gacgcaggtc aaccgcacca tccgttcgca cactttcaac cccgacgagt 

551 tcatcgccca agtcaaagcg tctgaagccg atttgcagaa attttataat 

601 gcgaacaaaa aagactatct gctgccgcag gcggtcaaat tggaatatgt 

651 cgccttgaat ctgaaggatt ttgcagacaa gcagaccgtc agtgaaacgg 

701 aagtgaaaaa tgcgtttgaa gagcgcgtgg cgcgtttgcc ggcacatgaa 

751 gccaaacctt ctttcgagca ggaaaaagcc gccgtcgaaa acgaattgaa 

801 aatgaaaaag gcggttgccg acttcaacaa ggcaaaagaa aagctgggcg 

851 acgatgcgtt caatcatccc tcctcgcttg ccgaagccgc caaaaacagc 

901 ggtttgaaag tggaaaccca agaaacttgg ctgagcaggc aggacgcaca 

951 aatgtccggc atgcccgaaa acctaatcaa tgccgtattc agcgacgacg 

1001 tattgaagaa aaaacacaat tccgaagtgc tgaccatcaa cagcgaaacc 

1051 gcgtgggtcg tccgcgccaa agaagtccgc gaagaaaaaa acctactgtt 

1101 tgaagaagcc aaagatgcgg tgcgtcaggc ctatatccgt accgaagccg 

1151 ccaaactttt gaaaacaatg taa 

This corresponds to the amino acid sequence <SEQ ID 828; ORF 230.ng>: 

g230 .pep 

1 MFHSIEKYRT PAQVLLGLIA LTFVGFGVST VS HPGADYIV QVGDEKISEH 

51 SINNAMQNEQ ADGGSPWRDA VFQSLLQRAY LKQGAKLMGI SVSSEQIKQM 

101 IVDDPNFHDA NGKFSHALLS QYLSQRHMSE DQFVEEIRDQ FALQNLVSLV 

151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKA SEADLQKFYN 

2 01 ANKKDYLLPQ AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPAHE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWWRAKEVR EEKNLLFEEA KDAVRQAYIR TEAAKLLKTM * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 829>: 

m23 0.seq (partial) 

1 ATGTTCCATT CCATCGAAAA ATACAGAACG CCCGCCCAAG TCCTTTTGGG 

51 CCTGATTGCA TTAACCTTCG TCGGCTTCGG GGTCAGCACG GTATCCCATC 

101 CGGGTGCCGA CTACATCGTC CAAGTGGGCG ACGAAAAAaT CAGCGACCAC 

151 TCCATCAACA ACGCCATACA GAACGAACAG GCGGACGGCG GCGGCCCTTC 

2 01 GCC.GACGCG GTGTTCCAAT CCCTGCTGCA ACGCGCCTAC CTGAAACAGG 
251 GCGCGAAGCT GATGGGCATT TCGGTTTCTT CCGAACAAAT CAAGCAAATT 
301 ATCGTGGACG ATCCCAATTT CCACGACGCA AACGGCAAAT TCGACCACGC 

3 51 GCTTTTAAAC CGCTACCTTT CCCAACGCCA TATGTCTGAA GACCAGTTTG 
401 TCGAAGAAAT CCGCGATCAG TTTGCCTTGC AGAATTTGGT AAACCTCGTC 
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451 CAAAACGGCG TATTGGTCGG CGACGCGCAG GCGGAACAGC TGATCAGGCT 

501 GACACAGGTC AACCGCACCA TCCGTTCGCA CACTTTCAAC CCCGACGAGT 

551 TCATCGCCCA AGTCAAAGTG TCTGAAGCCG ATTTGCAGAA ATTTTATAAT 

601 GCGAACAAAA AAGACTATCT GCTGCCGCAG gCGGTCAAAT TGGAATATGT 

651 CGCCTTGAAT CTGAAGGATT TTGCAGACAA GCAGACCGTC AGTGAAACGg 

701 AAGTGAAAAA TGCATTTGAA GAGCGCGTGG CGCGTTTGCC GGCAAATGAA 

751 GCCAAACCTT CTTTCGAGCA GGAAAAAGCC GCCGTCGAAA ACGAATTGAA 

801 AATGAAAAAG GCGGTTGCCG ACTTCAACAA GGCAAAAGAA AAATTGGGCG 

851 ACGATGC.GT cAACCATCCT TCyTCGCTTG CCGAAGCCGC CAAAAACAGC 

901 GGTTTGAAAG TCGAAACCCA AGAAACTTGG CTGAGTAGGC AGGACGCGCA 

951 AATGTCCGGT ATGCCCGAAA ACCTGATCAA TGCCGTATTC AGCGACGACG 

1001 TATTGAAGAA AAAACACAAT TCCGAAGTGC TGACCATCAA CAGCGAAACC 

1051 GCGTGGGTCG TCCGCGCCAA AGAAGTCCGC GAAGAGAAAA CCCTGCCGTT 

1101 TGCCGAAGCC AAAGACGCGG TACGTCAGGC TTATATCCGT ACCGAAGCCG 

1151 CCAAACTT 



This corresponds to the amino acid sequence <SEQ ID 830; ORF 230>: 

m230.pep (partial) 

1 MFHSIEKYRT PAQVLLGLIA LTFVGFGVST VS HPGADYIV QVGDEKISDH 



51 SINNAIQNEQ ADGGGPSPDA VFQSLLQRAY LKQGAKLMGI SVSSEQIKQI 

101 IVDDPNFHDA NGKFDHALLN RYLSQRHMSE DQFVEEIRDQ FALQNLVNLV 

151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKV SEADLQKFYN 

201 ANKKDYLLPQ AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPANE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDD AYNH P SSLAEAAKNS 

3 01 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWWRAKEVR EEKTLPFAEA KDAVRQAYIR TEAAKL. ... 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 230 shows 95.9% identity over a 386 aa overlap with a predicted ORF (ORF 230.ng) 
from N. gonorrhoeae: 

m230/g230 



m23 0.pep 



10 20 30 40 50 60 

MFHS I EKYRTPAQVLLGLI ALTFVGFGVSTVSHPGADY I VQVGDEKI SDHS INNAI QNEQ 



g230 




10 20 30 40 50 60 



m230 .pep 



70 80 90 100 110 120 

ADGGGPSPDAVFQSLLQRAYLKQGAKLMGISVSSEQI KQI IVDDPNFHDANGKFDHALLN 



g230 




70 80 90 100 110 120 



m230 .pep 



130 140 150 160 170 180 

RYLSQRHMSEDQFVEEIRDQFALQNLVNLVQNGVLVGDAQAEQLIRLTQVNRTIRSHTFN 



g230 




130 140 150 160 170 



m230 .pep 



190 200 210 220 230 240 

PDEFIAQVKVSEADLQKFYNANKKDYLLPQAVKLEYVALNLKDFADKQTVSETEVKNAFE 



g230 




190 200 210 220 230 240 



m230 .pep 



250 260 270 280 290 300 

ERVARLPANEAKPSFEQEKAAVENELKMKKAVADFNKAKEKLGDDAVNHPSSLAEAAKNS 



g230 




250 260 270 280 290 300 
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310 320 330 340 350 360 

m230 .pep GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWWRAKEVR 

I M 1 1 1 1 1 1 1 1 1 1 1 1 111 Ml I i 1 1 M Ml M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II M I , 

g23 0 GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWWRAKEVR 

310 320 330 340 350 360 



370 380 

m2 3 0 . pep E E KTL P FAE AKD AVRQ AY I RTE AAKL 

! I h I I 1 1 M 1 1 1 M II I II M 1 1 

g2 3 0 EEKNLLFEEAKDAVRQAYIRTEAAKLLKTM 

370 380 390 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 831: 

a230.seq (partial) 

1 ATGTTCCATT CCATCGAAAA ATACAGAACG CCCGCCCAAG TCCTTTTGGG 

51 CCTGATTGCA TTAACCTTCG TCGGCTTCGG GGTCAGCACG GTATCCCATC 

101 CGGGTGCCGA CTACATCGTC CAAGTGGGCG ACGAAAAAAT CAGCGACCAC 

151 TCCATCAACA ACGCCATACA GAACGAACAG GCGGACGGCG GCGGCCCTTC 

201 GCGCGACGCG GTGTTCCAAT CCCTGCTACA ACGCGCCTAC CTGAAACAGG 

251 GCGCGAAGCT GATGGGCATT TCGGTTTCTT CCGAACAAAT CAAGCAGATT 

301 ATCGTGGACG ATCCCAATTT CCACGACGCA AACGGCAAAT TCGACCACGC 

351 GCTTTTAAAC CGCTACCTTT CCCAACGTCA TATGTCTGAA GACCAGTTTG 

4 01 TCGAAGAAAT CCGCGATCAG TTTGCCTTGC AGAATTTGGT AAACCTCGTC 

4 51 CAAAAC GGCG TATTGGTCGG CGACGCGCAG GCGGAACAGC TGATCAGGCT 

501 GACGCAGGTC AACCGCACCA TCCGTTCGCA CACTTTCAAC CCCGACGAAT 

551 TCATCGCCCA AGTCAAAGTG TCTGAAGCCG ATTTGCAGAA GTTTTATAAC 

601 GCAAACAAAA AAGACTACCT GCTTCCCAAA GCGGTCAAAT TGGAATATGT 

651 CGCCTTGAAT CTGAAAGACT TTGCAGACAA ACAGACCGTC AGCGAAACAG 

701 AAGTGAAAAA TGCGTTTGAA GAGCGCGTGG CGCGTTTGCC GGCAAATGAA 

751 GCCAAACCTT CTTTCGAGCA GGAAAAAGCC GCCGTCGAAA ACGAATTGAA 

801 AATGAAAAAG GCGGTTGCCG ACTTCAATAA GGCAAAAGAA AAGCTGGGCG 

851 ATGACGCGTT CAACCATCCT TCCTCGCTTG CCGAAGCCGC CAAAAACAGC 

901 GGTTTGAAAG TCGAAACCCA AGAAACTTGG CTGAGCAGGC AGGATGCGCA 

951 AATGTCCGGT ATGCCCGAAA ACCTGATCAA TGCCGTATTC AGCGACGACG 

1001 TATTGAAGAA AAAACACAAT TCCGAAGTGC TGACCATCAA CAGCGAAACC 

1051 GCGTGGGTCG TCCGCGCCAA AGAAGTCCGC GAAGAGAAAA CCCTGCCGTT 

1101 TGCCGAAGCC AAAGACGCGG TACGTCAGGC TTATATCCGT ACCGAAGCCG 

1151 CCAAACTT 



This corresponds to the amino acid sequence <SEQ ID 832; ORF 230.a>: 

a230 .pep (partial) 

1 MFHSIEKYRT PAQVLLGLIA LTFVGFGVST VS HPGADYIV QVGDEKISDH 

51 SINNAIQNEQ ADGGGPSRDA VFQSLLQRAY LKQGAKLMGI SVSSEQIKQI 

101 IVDDPNFHDA NGKFDHALLN RYLSQRHMSE DQFVEEIRDQ FALQNLVNLV 

151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKV SEADLQKFYN 

201 ANKKDYLLPK AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPANE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLT INSET 

351 AWVVRAKEVR EEKTLPFAEA KDAVRQAYIR TEAAKL 

m230/a230 99.2% identity in 386 aa overlap 

10 20 30 40 50 60 

m230.pep MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEKISDHSINNAIQNEQ 
I I ! I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I | | | I | | | | | | | | | | | | | l | | | | | 
a230 MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEKISDHSINNAIQNEQ 

10 20 30 40 50 60 



70 80 90 100 110 120 

m230.pep ADGGG PS PDAVFQSLLQRAYLKQGAKLMG I SVSSEQIKQI I VDDPNFHDANGKFDHALLN 

MINI! M I I I I I I I II I I I I I I I I I M I I I I I I I I I ! I I | | | | | | | | | | | | | | | | | | 
a230 " ADGGGPSRDAVFQSLLQRAYLKQGAKLMGI SVSSEQIKQI I VDDPNFHDANGKFDHALLN 

70 80 90 100 110 120 



130 140 150 160 170 180 
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RYLSQRHMSEDQFVEEIRDQFALQNLVNLVQNGVLVGDAQAEQLIRLTQVNRTIRSHTFN 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
RYLSQRHMSEDQFVEEIRDQFALQNLVNLVQNGVLVGDAQAEQLIRLTQVNRTIRSHTFN 
130 140 150 160 170 180 

190 200 210 220 230 240 

PDEFIAQVKVSEADLQKFYNANKKDYLLPQAVKLEYVALNLKDFADKQTVSETEVKNAFE 
I I I I I I I I I I M I I I I I I I I I I I I I I I I I : I I i I I I I I I I I I II I I I I I I I I I I I I I | | I 
PDEFIAQVKVSEADLQKFYNANKKDYLLPKAVKLEYVALNLKDFADKQTVSETEVKNAFE 

190 200 210 220 230 240 

250 260 270 280 290 300 

ERVARLPANEAKPSFEQEKAAVENELKMKKAVADFNKAKEKLGDDAVNHPSSLAEAAKNS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ERVARLPANEAKPSFEQEKAAVENELKMKKAVADFNKAKEKLGDDAFNHPSSLAEAAKNS 

250 260 270 280 290 300 

310 320 330 340 350 360 

GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWWRAKEVR 
I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I | I I 
GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLT INSET AWVVRAKEVR 

310 320 330 340 350 360 

370 380 
EEKTLPFAEAKDAVRQAYIRTEAAKL 
I I I I I I I I M I I I I I I I I I I I I I I I I 

EEKTLPFAEAKDAVRQAYIRTEAAKL 
370 380 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 833>: 

g230-l. seq 

1 ATGTTCCATT CCATCGAAAA ATACAGAACA CCCGCCCAAG TCTTATTAGG 

51 CCTGATTGCA TTAACTTTTG TCGGCTTCGG CGTCAGCACG GTTTCCCATC 

101 CGGGCGCCGA CTACATCGTC CAAGTGGGCG ACGAAAAAAT CAGCGAGCAC 

151 TCAATCAACA ACGCCATGCA GAACGAGCAG GCGGACGGCG GCAGCCCTTG 

201 GCGCGACGCG GTGTTCCAAT CCCTGCTGCA ACGCGCCTAC CTGAAACAGG 

251 GCGCGAAGCT GATGGGCATT TCGGTTTCTT CCGAACAAAT CAAGCAGATG 

301 ATTGTGGACG ATCCCAATTT CCACGACGCA AACGGCAAAT TCAGTCACGC 

351 GCTTTTGAGT CAATACCTGT CGCAACGCCA TATGTCTGAA GACCAGTTTG 

401 TCGAAGAAAT CCGCGATCAG TTTGCCTTGC AGAATTTGGT AAGCCTCGTC 

4 51 CAAAACGGCG TATTGGTCGG CGACGCGCAG GCGGAACAGC TGATCAGGCT 

501 GACGCAGGTC AACCGCACCA TCCGTTCGCA CACTTTCAAC CCCGACGAGT 

551 TCATCGCCCA AGTCAAAGCG TCTGAAGCCG ATTTGCAGAA ATTTTATAAT 

601 GCGAACAAAA AAGACTATCT GCTGCCGCAG GCGGTCAAAT TGGAATATGT 

651 CGCCTTGAAT CTGAAGGATT TTGCAGACAA GCAGACCGTC AGTGAAACGG 

701 AAGTGAAAAA TGCGTTTGAA GAGCGCGTGG CGCGTTTGCC GGCACATGAA 

751 GCCAAACCTT CTTTCGAGCA GGAAAAAGCC GCCGTCGAAA ACGAATTGAA 

801 AATGAAAAAG GCGGTTGCCG ACTTCAACAA GGCAAAAGAA AAGCTGGGCG 

851 ACGATGCGTT CAATCATCCC TCCTCGCTTG CCGAAGCCGC CAAAAACAGC 

901 GGTTTGAAAG TGGAAACCCA AGAAACTTGG CTGAGCAGGC AGGACGCACA 

951 AATGTCCGGC ATGCCCGAAA ACCTAATCAA TGCCGTATTC AGCGACGACG 

1001 TATTGAAGAA AAAACACAAT TCCGAAGTGC TGACCATCAA CAGCGAAACC 

1051 GCGTGGGTCG TCCGCGCCAA AGAAGTCCGC GAAGAAAAAA ACCTACTGTT 

1101 TGAAGAAGCC AAAGATGCGG TGCGTCAGGC CTATATCCGT ACCGAAGCCG 

1151 CCAAACTTGC CGAAAACAAG GCAAAAGAAG TGCTTACCCA ACTGAACGGC 

1201 GGCAAGGCAG TTGACGTGAA ATGGTCGGAA GTGTCCGTTT TGGGCGCGCA 

1251 GCAGGCAAGG CAGTCCATGC CGCCCGAGGC TTATGCGGAA CTGCTGAAAG 

1301 CAAAACCGGC AAACGGCAAA CCCGCCTATG TCAGACTGAC CGGTCTGCCG 

1351 GCACCCGTGA TTGTCGAGGC GCAGGCAGTC ACGCCTCCGG AGGATATTGC 

1401 CGCACAGCTT CCTCCTGCGA AACAGGCTTT GGCGCAACAG CAGTCTGCCA 

14 51 ATACTTTCGA CCTGCTGATC CGCTATTTCA ACGGAAAAAT CAAACAGACT 

1501 AAAGGAGCAC AATCGGTTGA CAACGGCGAT GGTCAGTAA 

This corresponds to the amino acid sequence <SEQ ID 834; ORF 230-l.ng>: 

g230-l.pep 

1 MFHSIEKYRT PAQVLLGLIA LTFVGFGVST VS HPGADYIV QVGDEKISEH 

51 SINNAMQNEQ ADGGSPWRDA VFQSLLQRAY LKQGAKLMGI SVSSEQIKQM 

101 IVDDPNFHDA NGKFSHALLS QYLSQRHMSE DQFVEEIRDQ FALQNLVSLV 



m230 . pep 
a230 

m230 .pep 
a230 

m230.pep 
a230 

m230.pep 
a230 

m230 .pep 
a230 
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151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKA SEADLQKFYN 

201 ANKKDYLLPQ AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPAHE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWVVRAKEVR EEKNLLFEEA KDAVRQAYIR TEAAKLAENK AKEVLTQLNG 

4 01 GKAVDVKWSE VSVLGAQQAR QSMPPEAYAE LLKAKPANGK PAYVRLTGLP 

451 APVIVEAQAV TPPEDIAAQL PPAKQALAQQ QSANTFDLLI RYFNGKIKQT 

501 KGAQSVDNGD GQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 835>: 

m230-l .seq 

1 ATGTTCCATT CCATCGAAAA ATACAGAACG CCCGCCCAAG TCCTTTTGGG 

51 CCTGATTGCA TTAACCTTCG TCGGCTTCGG GGTCAGCACG GTATCCCATC 

101 CGGGTGCCGA CTACATCGTC CAAGTGGGCG ACGAAAAAAT CAGCGACCAC 

151 TCCATCAACA ACGCCATACA GAACGAACAG GCGGACGGCG GCGGCCCTTC 

201 GCGCGACGCG GTGTTCCAAT CCCTGCTGCA ACGCGCCTAC CTGAAACAGG 

2 51 GCGCGAAGCT GATGGGCATT TCGGTTTCTT CCGAACAAAT CAAGCAAATT 

301 ATCGTGGACG ATCCCAATTT CCACGAC.GCA AACGGCAAAT TCGACCACGC 

351 GCTTTTAAAC CGCTACCTTT CCCAACGCCA TATGTCTGAA GACCAGTTTG 

401 TCGAAGAAAT CCGCGATCAG TTTGCCTTGC AGAATTTGGT AAACCTCGTC 

451 CAAAACGGCG TATTGGTCGG CGACGCGCAG GCGGAACAGC TGATCAGGCT 

501 GACACAGGTC AACCGCACCA TCCGTTCGCA CACTTTCAAC CCCGACGAGT 

551 TCATCGCCCA AGTCAAAGTG TCTGAAGCCG ATTTGCAGAA ATTTTATAAT 

601 GCGAACAAAA AAGACTATCT GCTGCCGCAG GCGGTCAAAT TGGAATATGT 

651 CGCCTTGAAT CTGAAGGATT TTGCAGACAA GCAGACCGTC AGTGAAACGG 

701 AAGTGAAAAA TGCATTTGAA GAGCGCGTGG CGCGTTTGCC GGCAAATGAA 

7 51 GCCAAACCTT CTTTCGAGCA GGAAAAAGCC GCCGTCGAAA ACGAATTGAA 

801 AATGAAAAAG GCGGTTGCCG ACTTCAACAA GGCAAAAGAA AAATTGGGCG 

851 ACGATGCGTT CAACCATCCT TCCTCGCTTG CCGAAGCCGC CAAAAACAGC 

901 GGTTTGAAAG TCGAAACCCA AGAAACTTGG CTGAGTAGGC AGGACGCGCA 

951 AATGTCCGGT ATGCCCGAAA ACCTGATCAA TGCCGTATTC AGCGACGACG 

1001 TATTGAAGAA AAAACACAAT TCCGAAGTGC TGACCATCAA CAGCGAAACC 

1051 GCGTGGGTCG TCCGCGCCAA AGAAGTCCGC GAAGAGAAAA CCCTGCCGTT 

1101 TGCCGAAGCC AAAGACGCGG TACGTCAGGC TTATATCCGT ACCGAAGCCG 

1151 CCAAACTTGC CGAAAACAAG GCAAAAGACG TGCTTACCCA ACTGAACGGC 

1201 GGCAAGGCTG TTGACGTGAA ATGGTCGGAA GTGTCCGTTT TGGGCGCACA 

1251 GCAGGCAAGG CAGTCCATGC CGCCCGAGGC TTATGCGGAA CTGCTGAAAG 

1301 CAAAACCGGC AAACGGCAAA CCCGCCTACG TCAGGCTGAT CGGTCTGCCG 

1351 GCACCCGTGA TTGTCGAAGT ACAGGCTGTA ACCCCGCCGG ATGATATCGC 

1401 CGCACAGCTT CCGCTTGCAA AACAGGCTTT GGCGCAACAG CAGTCTGCCA 

1451 ATACTTTCGA CTTGTTGATA CGTTATTTCA ACGGCAAAAT CAAACAGACC 

1501 AAAGGAGCGC AATCGGTCGA CAACGGCGAC GGTCAGTAA 

This corresponds to the amino acid sequence <SEQ ID 836; ORF 230- 1>: 

m230-l.pep 

1 MFHSIEKYRT PAQVLLGLIA LTFVGFGVST VS HPGADYIV QVGDEKISDH 

51 SINNAIQNEQ ADGGGPSRDA VFQSLLQRAY LKQGAKLMGI SVSSEQIKQI 

101 IVDDPNFHDA NGKFDHALLN RYLSQRHMSE DQFVEEIRDQ FALQNLVNLV 

151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKV SEADLQKFYN 

201 ANKKDYLLPQ AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPANE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWVVRAKEVR EEKTLPFAEA KDAVRQAYIR TEAAKLAENK AKDVLTQLNG 

401 GKAVDVKWSE VSVLGAQQAR QSMPPEAYAE LLKAKPANGK PAYVRLIGLP 

451 APVIVEVQAV TPPDDIAAQL PLAKQALAQQ QSANTFDLLI RYFNGKIKQT 

501 KGAQSVDNGD GQ* 

m230-l/g230-l 96.3% identity in 512 aa overlap 



10 20 30 40 50 60 

MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEKISDHSINNAIQNEQ 
I I I M I I I I I I I ! I I I I I 11! I II I I i I I ! I I I t I I I I I I I I I I i I I I : i M I I I :] I I I 
MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEKISEHSINNAMQNEQ 

10 20 30 40 50 60 

70 80 90 100 110 120 

ADGGGPSRDAVFQSLLQRAYLKQGAKLMGI SVSSEQIKQI I VDDPNFH DANGKFDHALLN 
I I I I : I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I : I I I I I I I I I I I I I | : | | I | : 
ADGGSPWRDAVFQSLLQRAYLKQGAKLMGISVSSEQIKQMIVDDPNFHDANGKFSHALLS 

70 80 90 100 110 120 



m230-l . pep 
g230-l 

m230-l.pep 
g230-l 



m230-l .pep 



130 140 150 160 170 180 

RYLSQRHMSEDQFVEEIRDQFALQNLVNLVQNGVLVGDAQAEQLIRLTQVNRTIRSHTFN 
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: I I I I I I I I I I I I I I I I i I i t I I I I I I : t I M I I I I I I I I t I I I I I I I I I I I I M I I I I I 
g2 3 0 - 1 QYLSQRHMS EDQFVEEI RDQFALQNLVSLVQNGVLVGDAQAEQLI RLTQVNRT I RSHTFN 

130 140 150 160 170 180 



190 200 210 220 230 240 

m230-l.pep PDEFIAQVKVSEADLQKFYNANKKDYLLPQAVKLEYVALNLKDFADKQTVSETEVKNAFE 
I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I ! I I I I I I I I I I I 
g230-l PDEFIAQVKASEADLQKFYNANKKDYLLPQAVKLEYVALNLKDFADKQTVSETEVKNAFE 

190 200 210 220 230 240 



250 260 270 280 290 300 

m230-l . pep ERVARLPANEAKPSFEQEKAAVENELKMKKAVADFNKAKEKLGDDAFNHPSSLAEAAKNS 
I I I I I I I I : I I I I I I I I I I I I I I I I I I I I ! I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g230-l ERVARLPAHEAKPSFEQEKAAVENELKMKKAVADFNKAKEKLGDDAFNHPSSLAEAAKNS 

250 260 270 280 290 300 

310 320 330 340 350 360 

m230-l -pep GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWVVRAKEVR 
I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 1 I ! I I I I I I 
g230-l GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWVVRAKEVR 

310 320 330 340 350 360 



370 380 390 400 410 420 

m230-l.pep EEKTLPFAEAKDAVRQAYIRTEAAKLAENKAKDVLTQLNGGKAVDVKWSEVSVLGAQQAR 
I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I : II I I I I I I I I I I I I I I I I I I I M I I I I 
g2 3 0 - 1 EEKNLLFEE AKDAVRQA Y I RTEAAKLAENKAKEVLTQLNGGKAVDVKWSEVS VLGAQQAR 

370 380 390 400 410 420 

430 440 450 460 470 480 

m230-l .pep QSMPPEAYAELLKAKPANGKPAYVRLIGLPAPVIVEVQAVTPPDDIAAQLPLAKQALAQQ 
I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I : I I I I I I I I I I I I I I I 
g230-l QSMPPEAYAELLKAKPANGKPAYVRLTGLPAPVIVEAQAVTPPEDIAAQLPPAKQALAQQ 

430 440 450 460 470 480 



490 500 510 

m230~l .pep QSANTFDLLIRYFNGKIKQTKGAQSVDNGDGQX 
I I I I I I I I I I I I I I I I I i 1 I I I I I I I I I I I I I I 
g230-l QSANTFDLLIRYFNGKIKQTKGAQSVDNGDGQX 

490 500 510 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 837>: 



a230-l. seq 

1 ATGTTCCATT 

51 CCTGATTGCA 

101 CGGGTGCCGA 

151 TCCATCAACA 

201 GCGCGACGCG 

251 GCGCGAAGCT 

301 ATCGTGGACG 

351 GCTTTTAAAC 

401 TCGAAGAAAT 

4 51 CAAAACGGCG 

501 GACGCAGGTC 

551 TCATCGCCCA 

601 GCAAACAAAA 

651 CGCCTTGAAT 

701 AAGTGAAAAA 

751 GCCAAACCTT 

801 AATGAAAAAG 

851 ATGACGCGTT 

901 GGTTTGAAAG 

951 AATGTCCGGT 

1001 TATTGAAGAA 

1051 GCGTGGGTCG 

1101 TGCCGAAGCC 

1151 CCAAACTTGC 

1201 GGCAAGGCTG 

1251 GCAGGCAAGG 

1301 CAAAACCGGC 

1351 GCACCCGTGA 

1401 CGCACAGCTT 

1451 ATACTTTCGA 

1501 AAAGGAGCGC 



CCATCGAAAA 
TTAACCTTCG 
CTACATCGTC 
ACGCCATACA 
GTGTTCCAAT 
GATGGGCATT 
ATCCCAATTT 
CGCTACCTTT 
CCGCGATCAG 
TATTGGTCGG 
AACCGCACCA 
AGTCAAAGTG 
AAGACTACCT 
CTGAAAGACT 
TGCGTTTGAA 
CTTTCGAGCA 
GCGGTTGCCG 
CAACCATCCT 
TCGAAACCCA 
ATGCCCGAAA 
AAAACACAAT 
TCCGCGCCAA 
AAAGACGCGG 
CGAAAACAAG 
TTGACGTGAA 
CAGTCCATGC 
AAACGGCAAA 
TTGTCGAAGT 
CCGCTTGCAA 
CTTGTTGATA 
AATCGGTCGA 



ATACAGAACG 
TCGGCTTCGG 
CAAGTGGGCG 
GAACGAACAG 
CCCTGCTACA 
TCGGTTTCTT 
CCACGACGCA 
CCCAACGTCA 
TTTGCCTTGC 
CGACGCGCAG 
TCCGTTCGCA 
TCTGAAGCCG 
GCTTCCCAAA 
TTGCAGACAA 
GAGCGCGTGG 
GGAAAAAGCC 
ACTTCAATAA 
TCCTCGCTTG 
AGAAACTTGG 
ACCTGATCAA 
TCCGAAGTGC 
AGAAGTCCGC 
TACGTCAGGC 
GCAAAAGACG 
ATGGTCGGAA 
CGCCCGAGGC 
CCCGCCTACG 
ACAGGCTGTA 
AACAGGCTTT 
CGTTATTTCA 
CAACGGCGAC 



CCCGCCCAAG 
GGTCAGCACG 
ACGAAAAAAT 
GCGGACGGCG 
ACGCGCCTAC 
CCGAACAAAT 
AACGGCAAAT 
TATGTCTGAA 
AGAATTTGGT 
GCGGAACAGC 
CACTTTCAAC 
ATTTGCAGAA 
GCGGTCAAAT 
ACAGACCGTC 
CGCGTTTGCC 
GCCGTCGAAA 
GGCAAAAGAA 
CCGAAGCCGC 
CTGAGCAGGC 
TGCCGTATTC 
TGACCATCAA 
GAAGAGAAAA 
TTATATCCGT 
TGCTTACCCA 
GTGTCCGTTT 
TTATGCGGAA 
TCAGGCTGAT 
ACCCCGCCGG 
GGCGCAACAG 
ACGGCAAAAT 
GGTCAGTAA 



TCCTTTTGGG 
GTATCCCATC 
CAGCGACCAC 
GCGGCCCTTC 
CTGAAACAGG 
CAAGCAGATT 
TCGACCACGC 
GACCAGTTTG 
AAACCTCGTC 
TGATCAGGCT 
CCCGACGAAT 
GTTTTATAAC 
TGGAATATGT 
AGCGAAACAG 
GGCAAATGAA 
ACGAATTGAA 
AAGCTGGGCG 
CAAAAACAGC 
AGGATGCGCA 
AGCGACGACG 
CAGCGAAACC 
CCCTGCCGTT 
ACCGAAGCCG 
ACTGAACGGC 
TGGGCGCACA 
CTGCTGAAAG 
CGGTCTGCCG 
ATGATATCGC 
CAGTCTGCCA 
CAAACAGACC 
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This corresponds to the amino acid sequence <SEQ ID 838; ORF 230-La>: 

a230-l.pep 

1 MFHSIEKYRT PAQVLLGLIA LTFVGFGVST VS HPGADYIV QVGDEKISDH 

51 SINNAIQNEQ ADGGGPSRDA VFQSLLQRAY LKQGAKLMGI SVSSEQIKQI 

101 IVDDPNFHDA NGKFDHALLN RYLSQRHMSE DQFVEEIRDQ FALQNLVNLV 

151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKV SEADLQKFYN 

201 ANKKDYLLPK AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPANE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWVVRAKEVR EEKTLPFAEA KDAVRQAYIR TEAAKLAENK AKDVLTQLNG 

4 01 GKAVDVKWSE VSVLGAQQAR QSMPPEAYAE LLKAKPANGK PAYVRLIGLP 

451 APVIVEVQAV TPPDDIAAQL PLAKQALAQQ QSANTFDLLI RYFNGKIKQT 
501 KGAQSVDNGD GQ* 



a230-l/m230-l 99.8% identity in 512 aa overlap 



10 20 . 30 40 50 60 

a230-l.pep MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEKISDHSINNAIQNEQ 
I I I I I I I I I I I I I I 1 I I I I I I II I I I I I! I I I I I I I 1 I I I I I I I ! I I I I I I I I I I I I I I I 
m230-l MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEKISDHSINNAIQNEQ 

10 20 30 40 50 60 



70 80 90 100 110 120 

a230-l . pep ADGGGPSRDAVFQSLLQRAYLKQGAKLMGI SVSSEQIKQI I VDDPNFHDANGKFDHALLN 
I I M I I I I I I I I I ! I I I I I I I I I I i M I I I I I I I II I I I I I I II i i I I I I I I I I I I I ! I I 

m2 3 0 - 1 ADGGGPSRDAVFQSLLQRAYLKQGAKLMGI SVSSEQI KQI I VDD PNFH DANGKFDHALLN 

70 80 90 100 110 120 



130 140 150 160 170 180 

a2 3 0 - 1 . pep RYLSQRHMSEDQFVEEI RDQFALQNLVNLVQNGVLVGDAQAEQLI RLTQVNRT I RSHTFN 
I I I I I I I I I I I I \ I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m230-l RYLSQRHMSEDQFVEEI RDQFALQNLVNLVQNGVLVGDAQAEQLI RLTQVNRT I RSHTFN 

130 140 150 160 170 180 



190 200 210 220 230 240 

a230-l . pep PDEFIAQVKVSEADLQKFYNANKKDYLLPKAVKLEYVALNLKDFADKQTVSETEVKNAFE 
I I I I I I I II I I I I I I I I I I I I II I I I I I I : I I I I I I I I ! I I I I I I I I I I I ! I I I I M I I I 
m230~l PDEFIAQVKVSEADLQKFYNANKKDYLLPQAVKLEYVALNLKDFADKQTVSETEVKNAFE 

190 200 210 220 230 240 



250 260 270 280 290 300 

a230-l.pep ERVARLPANEAKPSFEQEKAAVENELKMKKAVADFNKAKEKLGDDAFNHPSSLAEAAKNS 
I I I I I I I I I I I I II I I I I! I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
m230-l ERVARLPANEAKPSFEQEKAAVENELKMKKAVADFNKAKEKLGDDAFNHPSSLAEAAKNS 

250 260 270 280 290 300 



310 320 330 340 350 360 

a230-l.pep GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWVVRAKEVR 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m230-l GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWVVRAKEVR 
310 320 330 340 350 360 



370 380 390 400 410 420 

a230-l.pep EEKTLPFAEAKDAVRQAYIRTEAAKLAENKAKDVLTQLNGGKAVDVKWSEVSVLGAQQAR 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I 
m230-l EEKTLPFAEAKDAVRQAYIRTEAAKLAENKAKDVLTQLNGGKAVDVKWSEVSVLGAQQAR 

370 380 390 400 410 420 

430 440 450 460 470 480 

a230-l.pep QSMPPEAYAELLKAKPANGKPAYVRLIGLPAPVIVEVQAVTPPDDIAAQLPLAKQALAQQ 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I II 
m230-l QSMPPEAYAELLKAKPANGKPAYVRLIGLPAPVIVEVQAVTPPDDIAAQLPLAKQALAQQ 

430 440 450 460 470 480 



490 500 510 

a230-l.pep QSANTFDLLI RYFNGKIKQTKGAQSVDNGDGQX 
I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m230-l QSANTFDLLI RYFNGKIKQTKGAQSVDNGDGQX 

490 500 510 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 839>: 



1 atgtcaaaac gaaaatccat aaaccgtccg tatcaaaaac cggcggaact 

51 gccgccgttg caaaataatc cgccatttta ccgtaaaaac cgccgcctga 

101 acttttttat cgcggcagac ggcggttgcg cgtctccgca aaaatgcagg 

151 gcgcgcggtt ttcagacggc atttgccgtt caaggccgtg cggtgtcttt 

201 accaaatgcc caaccattcg cccacggaat ccatccaatc cttattgccc 

251 ccgccgctcc tgcctgcccg gcggtacgcc cacggcgctt gcggattttt 

301 agctttccac aatcctttgc gttccctttc cgcctgaatt tgagcgtcgg 

351 catagtcggc aaaatccgcc ttatcctgct gttctttagc ataactttta 

401 taatgccacg ccgccccgtc ctgcacctgc atcaggttca aatcggtttt 

451 gccggcggat acctgcgcca cttcgcgctg atagcggtcg gtttcaaaca 

501 cacgtacact gactttccta ccctccgccg ccgcgcgcag gttgtcgcgc 

551 gaacgtgtac cgtaagcctg tttcatctcc ggtgcgtcga tatacgccat 

601 ccgaatttta tgtttcgcgc cgtcgccgtc gatgacgtga agggtatcgc 

651 cgtcatagac tttggacacc gtgcctgtgt agctgtggcc ggatttcgcc 

701 gatgcccgtc ggcgaacggg cgcgtcgaaa cccacgtccc ctgcagtgcc 

751 gagtacgtcg agtacggcaa ccgccgtccg caccgcctca ctgtcatatc 

801 ccgtataacc caacgcgccc aaaagcgaca gggcgacggg aagccatttc 

851 atgatttttt taatctgcat atttttcaaa tgccgatgcc gtctgaacat 

901 ctctga 



This corresponds to the amino acid sequence <SEQ ID 840; ORF 231.ng>: 

g231 .pep 



1 MSKRKSINRP YQKPAELPPL QNNPPFYRKN RRLNFFIAAD GGCASPQKCR 

51 ARGFQTAFAV QGRAVSLPNA QPFAHGIHPI LIAPAAPACP AVRPRRLRIF 

101 SFPQSFAFPF RLNLSVG IVG KIRLILLFFS ITFI MPRRPV LHLHQVQIGF 

151 AGGYLRHFAL IAVGFKHTYT DFPTLRRRAQ WARTCTVSL FHLRCVDIRH 

201 PNFMFRAVAV DDVKGIAVID FGHRACVAVA GFRRCPSANG RVETHVPCSA 

251 EYVEYGNRRP HRLTVISRIT QRAQKRQGDG KPFHDFFNLH IFQMPMPSEH 

301 L* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 84 1>: 

m231.seq (partial) 



1 ATGTCAAAAC GAAAATCCAT AAACCGTCCG TATCAAAAAC CGGCGGAACT 

51 GCCGCCGTTG CAAAATAATC CGCCATTTTA CCGTAAAAAC CGCCGCCTGA 

101 ACTTTTTTAT CGCGGCAGAC GGCGGTTGCG CGTCTCCGCA AAAATGCAGG 

151 GCGCGCGGTT TTCAGACGGC ATTTGCCGTT CAAAGCCGTG CGGTGTCTTT 

201 ACCAAATGCC CAACCATTCG GC 



This corresponds to the amino acid sequence <SEQ ID 842; ORF 23 1>: 

m231.pep (partial) 



1 MSKRKSINRP YQKPAELPPL QNNPPFYRKN RRLNFFIAAD GGCASPQKCR 
51 ARGFQTAFAV QSRAVSLPNA QPFG 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 23 1 shows 98.6% identity over a 73 aa overlap with a predicted ORF (ORF 23 1 .ng) 
from N. gonorrhoeae: 



g231 . seq 



m231/g231 



m231.pep 



10 20 30 40 50 60 

MSKRKS INRPYQKPAELPPLQNNPPFYRKNRRLNFFI AADGGCAS PQKCRARGFQTAFAV 



g231 




10 20 30 40 50 60 



m231 .pep 



70 

QSRAVSLPNAQPFG 



g23l 




70 80 90 100 110 120 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 843>: 
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a231.seq (partial) 

1 ATGTCAAAAC GAAAATCCAT AAACCGTCCG TATCAAAAAC CGGCGGAACT 

51 GCCGCCGTTG CAAAATAATC CGCCATTTTA CCGTAAAAAC CGCCGCCTGA 

101 ACTTTTTTAT CGNGGCAGAC •GGCGGTTGCG CGTCTCCGCA AAAATGCAGG 

151 GCGCGCGGTT TTCAGACGGC ATTTGCCGTT CAAAGCCGTG CGGTGTCTTT 

201 ACCAAATGCC CAACCATTCG CCCACGGCAT CCATCCAATC CTTATTGCCC 

251 CCGCCGCTCC TGCCTGCCCG GCGGTACGCC CACGGCGCTT GCGGATTTTT 

301 AGCTTTCCAC AATCCTTTGC GTTCCCTTTC CGCCTGAATT TGAGCGTCGG 

351 CATAATCGGC AAAATCCGCC TTATCCTGCT GTTCTTTAGC ATAACTTTTA 

401 TAATGCCACG CCGCCCCGTC CTGCACCTGC ATCAGGTTCA AATCGGTTTT 

4 51 GCCGACAGAA ACCTGCGCCA CTTCGCGCTG GTAGCGGTCG GTGTCGAACA 

501 CGCGGACGCT GACTTTCCTG CCTTCCGCCG CCGCGCGCAG GTTGTCGCGC 

551 GAACGCGTGC CGTAAGCCTG TTTCATCTCC GGCGCGTCGA TATACGCCAT 

601 CCGGATTTTG TGTTTCGCGC CGTCGCCGTC GATAACGTGA AGGGTGTCGC 

651 CGTCATAGAC TTTGGACACC GTGCCTGTGT AGCGGTGGCC GGATTTCGCC 

701 GATGCTCGGC GGCGGGCGGG CGCGTCGGAA CCCGCGTCCC CTGCCGCGCC 

751 GAGTACGTCG AGTACGGCAA CCGCCGTCCG CACCGCCTCG CTGCCGTACC 

801 CCGTATAACC CAACGCACCC AAAAGCGACA AGGCGACGGG AAGCCATTTC 

851 ATGATTTTTT TAATCTGCAT ATTTTTCAAA TGCCGATGCC GTCTGAACAT 

901 ATC 

This corresponds to the amino acid sequence <SEQ ID 844; ORF 217.a>: 

a231.pep (partial) 

1 MSKRKSINRP YQKPAELPPL QNNPPFYRKN RRLNFFIXAD GGCASPQKCR 

51 ARGFQTAFAV QSRAVSLPNA QPFAHGIHPI LIAPAAPACP AVRPRRLRIF 

101 SFPQSFAFPF RLNLSVG IIG KIRLILLFFS ITFI MPRRPV LHLHQVQIGF 

151 ADRNLRHFAL VAVGVEHADA DFPAFRRRAQ VVARTRAVSL FHLRRVDIRH 

201 PDFVFRAVAV DNVKGVAVID FGHRACVAVA GFRRCSAAGG RVGTRVPCRA 

251 EYVEYGNRRP HRLAAVPRIT QRTQKRQGDG KPFHDFFNLH IFQMPMPSEH 

301 I 

m231/a231 98.6% identity in 73 aa overlap 

10 20 30 40 50' 60 

m231 . pep MSKRKSINRPYQKPAELPPLQNNPPFYRKNRRLNFFIAADGGCASPQKCRARGFQTAFAV 
M I I I I I M I I I I I I I I I I I I | | I I I I I I I I I I I I I I | | | | | | | | | | | | | | j | j | | | | | 
a231 MSKRKSINRPYQKPAELPPLQNNPPFYRKNRRLNFFIXADGGCASPQKCRARGFQTAFAV 

10 20 30 40 50 60 

■ * 70 
m231 .pep QSRAVSLPNAQPFG 
I I I I I I I I I I I I I : 

a231 QSRAVSLPNAQPFAHGIHPILIAPAAPACPAVRPRRLRIFSFPQSFAFPFRLNLSVGIIG 

70 80 90 100 110 120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 845>: 

g231-l . seq 

1 ATGTCAAAAC GAAAATCCAT AAACCGTCCG TATCAAAAAC CGGCGGAACT 

51 GCCGCCGTTG CAAAATAATC CGCCATTTTA CCGTAAAAAC CGCCGcCTGA 

101 ACTTTTTTAT CGCGGCAGAC GGCGGTTGCG CGTCTCCGCA AAAATGCAGG 

151 GCGCGCGGTT TTCAGACGGC ATTTGCCGTT CAAGGCCGTG CGGTGTCTTT 

201 ACCAAATGCC CAACCATTCG CCCACGGAAT CCATCCAATC CTTATTGCCC 

251 CCGCCGCTCC TGCCTGCCCG GCGGTACGCC CACGGCGCTT GCGGATTTTT 

301 AGCTTTCCAC AATCCTTTGC GTTCCCTTTC CGCCTGAATT TGAGCGTCGG 

351 CATAGTCGGC AAAATCCGCC TTATCCTGCT GTTCTTTAGC ATAACTTTTA 

401 TAATGCCACG CCGCCCCGTC CTGCACCTGC ATCAGGTTCA AATCGGTTTT 

451 GCCGGCGGAT ACCTGCGCCA CTTCGCGCTG ATAGCGGTCG GTTTCAAACa 

501 CaCgTaCaat gagtttcgtA ccctccGCCG ccgcgcgCAG GTTGtcgcGC 

551 GAACgTGTAC CGTAagcgtg TTtcatctcc GGTGCgtcGA TATACGCCaT 

601 cCgAATTTta tGTttcgcgc cgtcgcCgtc gATGACGTGA AGGGtatcGC 

651 CgtcATAGAC TTTGGACACC Gtgcctgcgt AGctGTGGCC GGATttcgc 

This corresponds to the amino acid sequence <SEQ ID 846; ORF 231-l.ng>: 

g231-l.pep 

1 MSKRKSINRP YQKPAELPPL QNNPPFYRKN RRLNFFIAAD GGCASPQKCR 
51 ARGFQTAFAV QGRAVSLPNA QPFAHGIHPI LIAPAAPACP AVRPRRLRIF 
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101 SFPQSFAFPF RLNLSVG IVG KIRLILLFFS ITFI MPRRPV LHLHQVQIGF 
151 AGGYLRHFAL IAVGFKHTYN EFRTLRRRAQ VVARTCTVSV FHLRCVDIRH 
201 PNFMFRAVAV DDVKGIAVID FGHRACVAVA GFR 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 847>- 

m231-l.soq 

1 ATGTCAAAAC GAAAATCCAT AAACCGTCCG TATCAAAAAC CGGCGGAACT 

51 GCCGCCGTTG CAAAATAATC CGCCATTTTA CCGTAAAAAC CGCCGCCTGA 

101 ACTTTTTTAT CGCGGCAGAC GGCGGTTGCG CGTCTCCGCA AAAATGCAGG 

151 GCGCGCGGTT TTCAGACGGC ATTTGCCGTT CAAAGCCGTG CGGTGTCTTT 

201 ACCAAATGCC CAACCATTCG CCCACGGCAT CCATCCAATC CTTATTGCCC 

251 CCGCCGCTCC TGCCTGCTCG GCGGTACGCC CACGGCGCTT GCGGATTTTT 

301 AGCTTTCCAC AATCCTTTGC GTTCCCTTTC CGCCTGAATT TGAGCGTCGG 

351 CATAATCGGC AAAATCCGCC TTATCCTGCT GTTCTTTAGC ATAACTTTTA 

401 TAATGCCACG CCGCCCCGTC CTGCACCTGC ATCAGGTTCA AATCGGTTTT 

4 51 GCCGACAGAA ACCTGCGCCA CTTCGCGCTG GTAGCGGTCG GTATCGAACA 

501 CGCGCACGCT GACTTTCCTG CCTTCCGCCG CCGCGCGCAG GTTGTCGCGC 

551 GAACGCGTGC CGTAAGCCTG TTTCATCTCC GGCGCGTCGA TATACGCCAT 

601 CCGGATTTTG TGTTTCGCGC CGTCGCCGTC GATAACGTGA AGGGTGTCGC 

651 CGTCATAGAC TTTGGACACC GTGCCTGTGT AGCGGTGGCC GGATTTCGCC 

701 GATGCTCGGC GGCGGGCGGG CGCGTCGGAA CCCGCGTCCC CTGCCGCGCC 

7 51 GAGTACGTCG AGTACGGCAA CCGCCGTCCG CACCGCCTCG CTGCCGTACC 
801 CCGTATAACC CAACGCACCC AAAAGCGACA GGGCGACGGG AAGCCATTTC 

8 51 ATGATTTTTT TAATCTGCAT ATTTTTCAAA TGCCGATGCC GTCTGAACAT 
901 ATCGGAATCG GATTTCAGAC GGCATCTTAA 

This corresponds to the amino acid sequence <SEQ ID 848; ORF 231-1>: 

m231-l.pep 

1 MSKRKSINRP YQKPAELPPL QNNPPFYRKN RRLNFFIAAD GGCASPQKCR 

51 ARGFQTAFAV QSRAVSLPNA QPFAHGIHPI LIAPAAPACS AVRPRRLRIF 

101 SFPQSFAFPF RLNLSVG IIG KIRLILLFFS ITFI MPRRPV LHLHQVQIGF 

151 ADRNLRHFAL VAVGIEHAHA DFPAFRRRAQ VVARTRAVSL FHLRRVDIRH 

201 PDFVFRAVAV DNVKGVAVID FGHRACVAVA GFRRCSAAGG RVGTRVPCRA 

251 EYVEYGNRRP HRLAAVPRIT QRTQKRQGDG KPFHDFFNLH IFQMPMPSEH 

301 IGIGFQTAS* 

g231-l/m231-l 87.0% identity in 262 aa overlap 

10 20 30 40 50 60 

g231-l.pep MSKRKSINRPYQKPAELPPLQNNPPFYRKNRRLNFFIAADGGCASPQKCRARGFQTAFAV 

I 1 M I I I i I I I I I I I I I I I I I I I I I I I I (I | | | | | | | | i | f | | | | { | | | | | | | | | | | ), | 

m231-l MSKRKSINRP YQKPAELPPLQNNPPFYRKNRRLNFFIAADGGCASPQKCRARGFQTAFAV 

10 20 30 40 N 50 60 

70 80 90 100 110 120 

g231-l .pep QGRAVSLPNAQPFAHGIHPILIAPAAPACPAVRPRRLRIFSFPQSFAFPFRLNLSVGIVG 
IHIIIIIIIIIIIIIIIIIIIIIIIIII I Ml I M II I M III f I I I I I I M t I I I : I 
m231-l QSRAVSLPNAQPFAHGIHPILIAPAAPACSAVRPRRLRIFSFPQSFAFPFRLNLSVGIIG 

70 80 90 100 110 120 

130 140 150 160 " 170 180 

g231-l.pep KIRLILLFFSITFIMPRRPVLHLHQVQIGFAGGYLRHFALIAVGFKHTYNEFRTLRRRAQ 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M | | | : | | | : : | : : : | : : | M I I 
KIRLILLFFSITFIMPRRPVLHLHQVQIGFADRNLRHFALVAVGIEHAHADFPAFRRRAQ 
130 140 150 160 170 180 



m231-l 



190 200 210 220 230 240 

g231-l . pep VVARTCTVSVFHLRCVDIRHPNFMFRAVAVDDVKGIAVIDFGHRACVAVAGFRXCPSANG 
Mill :||:|||| illll|:|:M!!lli:!l!:||il||||||||||||| j : \ : \ 
m2 3 1 - 1 VVARTRAVSLFHLRRVDI RHPDFVFRAVAVDN VKGVAVI DFGHRAC VAVAGFRRCS AAGG 

190 • 200 210 220 230 240 

250 260 
g23l-l .pep CVETHVPCSAEYVVXGNRRPHR 
11:1111111 I I I I I I I 
RVGTRVPCRAEYVEYGNRRPHRLAAVPRITQRTQKRQGDGKPFHDFFNLHIFQMPMPSEH 
250 260 270 280 290 300 



m231-l 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 849>: 

a231-l.seq 

1 ATGTCAAAAC GAAAATCCAT AAACCGTCCG TATCAAAAAC CGGCGGAACT 
51 GCCGCCGTTG CAAAATAATC CGCCATTTTA CCGTAAAAAC CGCCGCCTGA 
101 ACTTTTTTAT CGCGGCAGAC GGCGGTTGCG CGTCTCCGCA AAAATGCAGG 
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151 GCGCGCGGTT TTCAGACGGC ATTTGCCGTT CAAAGCCGTG CGGTGTCTTT 

201 ACCAAATGCC CAACCATTCG CCCACGGCAT CCATCCAATC CTTATTGCCC 

251 CCGCCGCTCC TGCCTGCCCG GCGGTACGCC CACGGCGCTT GCGGATTTTT 

301 AGCTTTCCAC AATCCTTTGC GTTCCCTTTC CGCCTGAATT TGAGCGTCGG 

3 51 CATAATCGGC AAAATCCGCC TTATCCTGCT GTTCTTTAGC ATAACTTTTA 

4 01 TAATGCCACG CCGCCCCGTC CTGCACCTGC ATCAGGTTCA AATCGGTTTT 
451 GCCGACAGAA ACCTGCGCCA CTTCGCGCTG GTAGCGGTCG GTGTCGAACA 
501 CGCGGACGCT GACTTTCCTG CCTTCCGCCG CCGCGCGCAG GTTGTCGCGC 
551 GAACGCGTGC CGTAAGCCTG TTTCATCTCC GGCGCGTCGA TATACGCCAT 
601 CCGGATTTTG TGTTTCGCGC CGTCGCCGTC GATAACGTGA AGGGTGTCGC 
651 CGTCATAGAC TTTGGACACC GTGCCTGTGT AGCGGTGGCC GGATTTCGCC 
701 GATGCTCGGC GGCGGGCGGG CGCGTCGGAA CCCGCGTCCC CTGCCGCGCC 
751 GAGTACGTCG AGTACGGCAA CCGCCGTCCG CACCGCCTCG CTGCCGTACC 
801 CCGTATAACC CAACGCACCC AAAAGCGACA AGGCGACGGG AAGCCATTTC 
851 ATGATTTTTT TAATCTGCAT ATTTTTCAAA TGCCGATGCC GTCTGAACAT 
901 ATCGGAATCG GATTTCAGAC GGCATCTTAA 

This corresponds to the amino acid sequence <SEQ ID 850; ORF 231-l.a>: 

a231-l .pep 

1 MSKRKSINRP YQKPAELPPL QNNPPFYRKN RRLNFFIAAD GGCASPQKCR 

51 ARGFQTAFAV QSRAVSLPNA QPFAHGIHPI LIAPAAPACP AVRPRRLRIF 

101 SFPQSFAFPF RLNLSVG IIG KIRLILLFFS ITFI MPRRPV LHLHQVQIGF 

151 ADRNLRHFAL VAVGVEHADA DFPAFRRRAQ VVARTRAVSL FHLRRVDIRH 

201 PDFVFRAVAV DNVKGVAVID FGHRACVAVA GFRRCSAAGG RVGTRVPCRA 

251 EYVEYGNRRP HRLAAVPRIT QRTQKRQGDG KPFHDFFNLH IFQMPMPSEH 

301 IGIGFQTAS* 

a231-l/m231-l 99.0% identity in 309 aa overlap 

10 20 30 40 50 60 

a231-l . pep MSKRKSINRPYQKPAELPPLQNNPPFYRKNRRLNFFIAADGGCASPQKCRARGFQTAFAV 
I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m231-l MSKRKSINRP YQKPAELPPLQNNPPFYRKNRRLNFFIAADGGCASPQKCRARGFQTAFAV 

10 20 30 40 50 60 

70 80 90 100 110 120 

a231-l .pep QSRAVSLPNAQPFAHGIHPILIAPAAPACPAVRPRRLRI FSFPQSFAFPFRLNLSVGIIG 
I I 1 I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I II I I I I I I I I 11 I I I I I I I I I I I I I 
m231-l QSRAVSLPNAQPFAHGIHPILIAPAAPACSAVRPRRLRIFSFPQS FAFPFRLNLSVGIIG 

70 80 90 100 110 120 

130 140 150 160 170 180 

a231-l . pep KIRLILLFFS I TFIMPRRPVLHLHQVQIGFADRNLRHFALVAVGVEHADADFPAFRRRAQ 
I M I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I II I 
m2 3 1 - 1 KIRLILLFFSI T FIMPRRPVLHLHQVQIGFADRNLRHFAL V AVG I EHAH ADFPAFRRRAQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

a231-l.pep VVARTRAVSLFHLRRVDIRHPDFVFRAVAVDNVKGVAVIDFGHRACVAVAGFRRCSAAGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m2 3 1 - 1 VVARTRAVSLFHLRRVDI RHPDFVFRAVAVDN VKGVAVI DFGHRAC VAVAGFRRCS AAGG 

190 200 210 220 230 240 

250 260 270 280 290 300 

a231-l.pep RVGTRVPCRAEYVEYGNRRPHRLAAVPRITQRTQKRQGDGKPFHDFFNLH IFQMPMPSEH 
I M I I I I I I I I I I I ( I I I M 1 I M I I I I I II I I I I I I II II I I I I I I I I I I I I I i I I I I I 
m231-l RVGTRVPCRAEYVEYGNRRPHRLAAVPRITQRTQKRQGDGKPFHDFFNLHIFQMPMPSEH 

250 260 270 280 290 300 

310 

a231-l .pep IGIGFQTASX 
I I I I I I I I I I 
m231-l IGIGFQTASX 
310 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 85 1>: 

g232 . seq 

1 atgatgggca acagcctgat tgaatccggt acgtttgtcg ccatcctgtt 

51 tggtcagatt ttgggaacgg cggttgccgg cgcgccgcct tatattgtcg 

101 ggatactggt tttgctggtc gccgtcggag gaacggccgg cagcctgttt 
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151 atgccgtccg tacccgccaa ggctgccgat acccaaatcg agtggaatat 

201 tgtccgtggt acaaaatccc tgctgcgtga aacggtgcgg cacaatcccg 

251 tttttaccgc cattatcggc atctcgtggt tttggtttgt cggcgcggtt 

301 tataccacgc aactgccgac ctttacccaa atccatttgg gcggcaacga 

351 taatgttttt aacctgatgc ttgctttgtt ttccatcggt attgccgccg 

401 gttcggtact gtgtgccaag ttcggcaggg aacggctgat gttggcttgg 

451 gtaacggttg gtgcgttggg ttcgacggtt tgcgg.cctgg ttttggtgtg 

501 gctgacgcac ggacaccgtt ttgaagggct gaacggcatt ttttggtttt 

551 tatcgcaagg atgggcatac cccgtgatgg cggtgatgac gctgatcggc 

601 tttttcggcg gatttttctc cgttccgctc tatacctggc tgcaaaccgc 

651 cagcagcgag actttccgcg cccgcgccgt tgccgccaac aatatcgtta 

701 acggcatctt tatggtttcc gccgccgttt tgagcgcggt attgctgttt 

751 ttgtttgaca gcatttccct gctgtatctg attgtcgcct tgggcaatat 

801 tccgttggcg gtatttttga ttaagcgcga aaggcggttt ttaggcgcgg 

851 cggcaatcag gaaaaaacct tga 



This corresponds to the amino acid sequence <SEQ ID 852; ORF 232.ng>: 

g232 .pep 



1 MMGNSLIESG TFVAILFGQI LGTAVAGAPP YIVGILVLLV AVGGTAGSLF 

51 MPSVPAKAAD TQIEWNIVRG TKSLLRETVR HNP VFTAIIG ISWFWFVGAV 

101 YTTQLPTFTQ IHLGGNDNVF NLMLALFSIG IAAGSVL CAK FGRERLMLAW 

151 VTVGALGSTV CGLVLVW LTH GHRFEGLNGI FWFLSQGWAY P VMAVMTLIG 

201 FFGGFFSVPL YTWLQTASSE TFRARAVAAN NIVNG IFMVS AAVLSAVLLF 

251 LFDSISLLYL IVALGNIPLA VFLIKRERRF LGAAAIRKKP * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 853>: 



1 ATGATGGGCA ACAGCCTGAT TGAATCGGGT ACGTTTGTCG CCATCCTGTT 

51 CGGTCAGATT TTGGGAACGG CGGTGGCAGG TGTACCGCCT TATATTGTCG 

101 GGATACTGGT TTTGCTGGTC GCCGTCGGAG GCACGGTCGG CAGCCTGTTT 

151 ATGCCGTCCG TACCCGCCAA GGCTGCCGAT ACACAAATTG AGTGGAATAT 

201 TGTCCGTGGC ACAAAATCCC TGCTGCGTGA AACGGTGCGG CACAAGCCCG 

251 TTTTTACCGC CATTATCGGT ATTTCGTGGT TTTGGTTTGT CGGCGCGGTT 

301 TATACCACGC AACTGCCGAC CTTTACCCAA ATCCATCTGG GCGGCAACGA 

351 CAATGTTTTC AACCTGATGC TTGCTCTGTT TTCCATCGGT ATTGCCGCCG ' 

401 GTTCGGTACT GTGTGCCAAG TTCAGCAkGG AACGCCTGAT GTTGGCTTGG 

451 GTAACGGTTG GTGCGTTGGG TTTGACGGTT TGCGGCTTGG TTTTGGTGTG 

501 GCTGACGCAC GGACACCGTT TTGAAGGGCT GAACGGCATT TTTTrGTTTT 

551 TATCGCAAGG ATGGGCATAT CCCGTGATGG CGGTGATGAC GCTGATCGGC 

601 TTTTTCGGCG GATTTTTCTC CGTTCCGCTC TATACCt (g) TG CAAACCGCCa 

651 TAGCGAGaTT TCCGCGCCCg GCCGTTGCCG CCAACAATAT CGTTAACGGT 

701 ATTTTTATGG TTTCCGCTGC CGTTTTGAGC GCGGTGTTGC TGTTTTTGTT 

751 TGACAGCATT TCCTTGTTGT ATCTGATTGT CGCTTTGGGC AATATTCCGT 

801 TGTCGGTATT TTTGATTAAG CGCGAAAGGC GGTTTTTAGG CGCGGCGGCA' 

851 ATCAGGAAAA AACCTTGA 



This corresponds to the amino acid sequence <SEQ ID 854; ORF 232>: 

m232 .pep 



1 MMGNSLIESG TFVAILFGQI LGTAVAGVPP YIVGILVLLV AVGGTVGSLF 

51 MPSVPAKAA D TQIEWNIVRG TKSLLRETVR HKP VFTAIIG ISWFWFVGAV 

101 YTTQLPTFTQ IHLGGNDNVF N LMLALFSIG IAAGSVL CAK FSXERLMLAW 

151 VTVGALGLTV CGLVLV WLTH GHRFEGLNGI FXFLSQGWAY P VMAVMTLIG 

201 FFGGFFSV PL YTVQTAIARF PRPAVAANNI VNG IFMVSAA VLSAVLLFLF 

251 DSISLLYLIV ALGNIPLSVF LIKRERRFLG AAAIRKKP* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 232 shows 94.1% identity over a 290 aa overlap with a predicted ORF (ORF 232.ng) 
from N. gonorrhoeae: 



m232 . seq 



m232/g232 



m23 2 .pep 



10 20 30 40 50 60 

MMGNSLIESGTFVAILFGQILGTAVAGVPPYIVGILVLLVAVGGTVGSLFMPSVPAKAAD 



g232 
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10 



20 



30 



40 



50 



60 



70 80 90 100 110 120 

m232 .pep TQIEWNIVRGTKSLLRETVRHKPVFTAIIGISWFWFVGAVYTTQLPTFTQIHLGGNDNVF 

II I II IMIMMIMI IIMMIII MMMI 1 1 MM MM I MM IMIIIMi Ml 

g23 2 TQIEWNIVRGTKSLLRETVRHNPVFTAIIGISWFWFVGAVYTTQLPTFTQIHLGGNDNVF 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 3 2 . pep NLMLALFSIGIAAGSVLCAKFSXERLMLAWVTVGALGLTVCGLVLWLTHGHRFEGLNGI 

I I I I I I II I I I I I I I I I II I I : II II M M 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

g2 3 2 NLMLALFS IGI AAGSVLCAKFGRERLMLAWTVGALGSTVCGLVLVWLTHGHRFEGLNGI 

130 140 150 160 170 180 

190 200 210 220 230 

m2 3 2 . pep FXFLSQGWAYPVMAVMTLIGFFGGFFSVPLYT - VQTAI ARFPRP - AVAANNI VNGI FMVS 

I II I M 1 1 1 II I Ml I M II I ill M II M I MM - I 1 1 II M 1 1 1 1 1 1 1 1 1 

g232 FWFLSQGWAYPVMAVMTLIGFFGGFFSVPLYTWLQTASSETFRARAVAANNI VNGI FMVS 

190 200 210 220 230 240 

240 250 260 270 280 289 

m232 . pep AAVLSAVLLFLFDS I SLLYLI VALGNI PLSVFLI KRERRFLGAAAI RKKPX 

M MM Ml II Mill MMM III I! II Ml I M I II I M MMMM I 

g23 2 AAVLSAVLLFLFDS I SLLYLI VALGNI PLAVFL I KRERRFLGAAAIRKKP 

250 260 270 280 290 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 85 5>: 



a232.seq 
1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 



ATGTACGCTA 
ACCTCTTTTC 
AAACCGCGCT 
. TTCCTGCCTG 
GCCGTATTTC 
ACAAGGCCGT 
GCGGTGGCGG 
GTGTCTGTTT 
ACGCCATCCT 
AGCCTGATTG 
GGGGACTGCG 
TGCTGGTCGC 
CCCGCCAAGG 
AAAATCCCTG 
TTATCGGTAT 
CTGCCGACCT 
CCTGATGCTT 
GTGCCAAGTT 
GCGTTGGGTT 
ACACCGTTTT 
GGGCATATCC 
TTTTTCTCCG 
TTTCCGCGCC 
TGGTTTCCGC 
ATTTCCTTGT 
ATTTTTGATT 
AAAAACCTTG 



AAAAAGGCGG 
GCTACGCAGT 
GTTTGTGATG 
CCGGACAGAT 
CTGTTTTCCT 
TTTGGCGCGT 
CATACGGGTT 
TGCATGGGCG 
GCCCGATTAT 
AATCGGGTAC 
GTGGCAGGTG 
CGTAGGAGGC 
CTGCCGATAC 
CTGCGTGAAA 
TTCGTGGTTT 
TTACCCAAAT 
GCCCTGTTTT 
CAGCAGGGAA 
TGACGGTTTG 
GAAGGGCTGA 
CGTGATGGCG 
TTCCGCTCTA 
CGCGCCGTTG 
TGCCGTTTTG 
TGTATCTGAT 
AAGCGCGAAA 
A 



TTTGGGACTG 
TTCTCGGCGC 
ATTGGGTTTT 
GTTGAACTTG 
CGCTGTCGGG 
TGGGCCAAGG 
TTATATCCGG 
CGCAATCGAC 
CTCGACGACA 
GTTTGTCGCC 
TACCGCCTTA 
ACGGTCGGCA 
ACAAATTGAG 
CGGTGCGGCA 
TGGTTTGTCG 
CCATCTAGGC 
CCATCGGTAT 
CGGCTGAGGT 
CGGCTTGGTT 
ACGGCATTTT 
GTGATGACGC 
TACCTGGCTG 
CCGCCAACAA 
AGCGCGGTGT 
TGTCGCTTTG 
GGCGGTTTTT 



GTTAAAAGCC 
GTTCAACGAC 
ACGGTTTGGG 
GGCGCGTTGC 
GCAGTTGGGT 
TGCTGGAAAT 
TCTGCCCCGC 
GCTGTTCGGG 
AAGAGTTGAT 
ATCCTGTTCG 
TATTGTCGGG 
GCCTGTTTAT 
TGGAATATTG 
CAAGCCCGTT 
GCGCGGTTTA 
GGCAACGACA 
TGCCGCCGGT 
TGGCTTGGGT 
TTGGTGTGGC 
TTGGTTTTTA 
TGATCGGCTT 
CAAACCGCCA 
TATCGTTAAC 
TGCTGTTTTT 
GGCAATATTC 
AGGCGCGGCG 



GCCGTTTCGC 
AATGTGTTCA 
GCAAAACGGC 
TGTTTATTTT 
AACAAATTCG 
GATCATTATG 
TGCTTTTGGC 
CCGCTGAAAT 
GATGGGCAAC 
GTCAGATACT 
ATACTGGTTT 
GCCGTCCGTA 
TCCGGGGTAC 
TTTACCGCCA 
TACCACGCAA 
ATGTTTTCAA 
TCGGTACTGT 
AACGGTTGGT 
TGACGCACGG 
TCGCAAGGAT 
TTTCGGCGGA 
GTAGCGAGAC 
GGTATTTTTA 
GTTTGACAGC 
CGTTGTCGGT 
GCAATCAGGA 



This corresponds to the amino acid sequence <SEQ ID 856; ORF 232. a>: 

a232 .pep 

1 MYAKKGGLGL VKSRRFAPLF ATQFLGAFND NVFKTALFVM IGFYGLGQNG 

51 FLPAGQ MLNL GALLFILPYF LFS SLSGQLG NKFD KAVLAR WAKVLEMIIM 

101 AVAAYGFYIR SAP LLLACLF CMGAQSTLFG PLKYAILPDY LDDKELMMGN 

151 SLIESGT FVA ILFGQILGTA VAGV PPYIVG ILVLLVAVGG TVGSLFM FSV 

201 PAKAADTQIE WNIVRGTKSL LRETVRHKP V FTAIIGISWF WFVGAV YTTQ 

251 LPTFTQIHLG GNDNVFNLML ALFSIGIAAG SVLCAKFSRE RLRLAWVTVG 



FCT/UfcOT/tWIO 
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301 ALGLTVCGLV LV WLTHGHRF EGLNGIFWFL SQGWAYP VMA VMTLIGFFGG 

351 FFSVPLYTWL QTASSETFRA RAVAANNIVN G IFMVSAAVL SAVLLFLF DS 

401 ISL LYLIVAL GNIPLSVFLI KRERRFLGAA AIRKKP* 

m232/a232 95.9% identity in 290 aa overlap 

10 20 30 

m232 . pep MMGNSLIESGTFVAILFGQILGTAVAGVPP 

I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I 
a232 ACLFCMGAQSTLFGPLKYAILPDYLDDKELMMGNSLIESGTFVAILFGQILGTAVAGVPP 
120 130 140 150 160 170 

40 50 60 70 80 90 

m232 . pep YIVGILVLLVAVGGTVGSLFMPSVPAKAADTQIEWNIVRGTKSLLRETVRHKPVFTAIIG 
I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a232 YIVGILVLLVAVGGTVGSLFMPSVPAKAADTQIEWNIVRGTKSLLRETVRHKPVFTAIIG 
180 190 200 210 220 230 

100 110 120 130 140 150 

m232 .pep ISWFWFVGAVYTTQLPTFTQIHLGGNDNVFNLMLALFSIGIAAGSVLCAKFSXERLMLAW 
I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I III III 
a232 ISWFWFVGAVYTTQLPTFTQIHLGGNDNVFNLMLALFSIGIAAGSVLCAKFSRERLRLAW 
240 250 260 270 280 290 

160 170 180 190 200 210 

m232 . pep VTVGALGLTVCGLVLVWLTHGHRFEGLNGIFXFLSQGWAYPVMAVMTLIGFFGGFFSVPL 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a232 VTVGALGLTVCGLVLVWLTHGHRFEGLNGIFWFLSQGWAYPVMAVMTLIGFFGGFFSVPL 

300 310 320 330 340 350 

220 230 240 250 260 

m232 . pep YT-VQTAIARFPRP-AVAANNIVNGIFMVSAAVLSAVLLFLFDSISLLYLIVALGNIPLS 

II : I I I : : I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I II I II 
a232 YTWLQTAS SET FRARAVAANN I VNGI FMVS AAVLS AVLLFLFDS I SLL YL I VALGN I PLS 

360 370 380 390 400 410 



270 280 289 

m232 .pep VFLIKRERRFLGAAAIRKKPX 
I I I I I I I I I I I I I I I I I I I I I 
a232 VFLIKRERRFLGAAAIRKKPX 
420 430 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 85 7>: 

g233 . seq 

1 atgaaacgca aaaatatcgc gctgattccc gccgccggca tcggggtgcg 

51 tttcggtgcg gacaaaccca agcaatatgt cgaaatcgga agcaaaaccg 

101 ttttagaaca tgtacttggg atttttgaac ggcatgaggc cgtcgatttg 

151 accgtcgttg tcgtctcgcc cgaagacacg tttgccgata aggttcagac 

201 ggcatttcca caggttcggg tgtggaaaaa cggtggacag acccgcgccg 

251 aaactgtccg caacggtgtg gcaaaactgt tggaaaccgg tttggcggcg 

3 01 gaaaccgaca atattctggt acacgatgcc gcccgctgct gcctgccgtc 

3 51 tgaagctctg gcgcggttga tagaacaggc gggcaacgcc gccgaaggcg 

4 01 ggattttggc agttcccgtt gccgatacgc tcaagcgcgc agaaagcgga 
451 caaatcagtg caactgtcga ccgttcgggg ctttggcagg cgcaaacgcc 
501 gcagcttttt caagcgggtt tgctgcaccg cgcattggct gcggaaaact 
551 tgggcggcat taccgatgaa gcgtccgccg tggaaaaact gggtgtgcgt 
601 ccgctactga tacagggcga cgcgcgcaat ttgaaactga cgcagccgca 
651 ggacgcatac atcgtcaggc tgctgctcaa tgccgtctga 

This corresponds to the amino acid sequence <SEQ ID 858; ORP 233.ng>: 

g233 .pep 

1 MKRKNIALIP AAGIGVRFG A DKPKQYVEIG SKTVLEHVLG IFERHEAVDL 

51 TWWSPEDT FADKVQTAFP QVRVWKNGGQ TRAETVRNGV AKLLETGLAA 

101 ETDNILVHDA ARCCLPSEAL ARLIEQAGNA AEGGILAVPV ADTLKRAESG 
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151 QISATVDRSG LWQAQTPQLF QAGLLHRALA AENLGGITDE ASAVEKLGVR 
201 PLLIQGDARN LKLTQPQDAY IVRLLLNAV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 859>: 

m233.seq (partial) 



1 


ATGAAGCGCA 


51 


TTTCGGTGCG 


101 


TTTTAGAACA 


151 


ACCGTCGTTG 


201 


GGCATTTCCA 


251 


AAACCGTCCG 


301 


GAAACCGACA 


351 


TGAAGCTTTG 


401 


GGATTTTGGC 


451 


AACATT .... 



This corresponds to the amino acid sequence <SEQ ID 860; ORF 233>: 

m233.pep (partial) 

1 MKRKNIALIP AAGIGARFG A DKPKQYVEIG SKTVLEHTIG I FERHEAVDL 
51 TWWSPEDT FADKVQTAFP QVRVWKNGGQ TRAETVRNGV AKLLETGLAA 
101 ETDNILVHDA ARCCLPSEAL TRLIEQAGNA AEGGILAIPI ADTLKCADGG 
151 NI 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 233 shows 93.4% identity over a 152 aa overlap with a predicted ORF (ORF 233.ng) 
from N. gonorrhoeae: 

m233/g233 



10 20 30 40 50 60 

m233 .pep MKRKNIALIPAAGIGARFGADKPKQYVEIGSKTVLEHTIGIFERHEAVDLTWWSPEDT 

II 1 1 Ml I III Mlhl II I ill I III Mill MM !::| II Ml Mil I II 1 1 MINI 

g233 MKRKNIALIPAAGIGVRFGADKPKQWEIGSKWLEHVLGIFERHEAVDLTVVVVSPEDT 

10 20 30 40 50 60 



70 80 90 100 110 120 

m23 3 . pep FADKVQTAFPQVRWKNGGQTRAETVRNGVAKLLETGLAAETDNILVHDAARCCLPSEAL 

MMMMIMMIMMMMMMMMMMMMMMMMMMMMMMM 

g23 3 FADKVQTAFPQVRWKNGGQTRAETVRNGVAKIjLETGLJUVETDNILVHDAARCCLPSETVL 

70 80 90 100 110 120 



130 140 150 

TRL I EQAGNAAEGGI LAI P I ADTLKCADGGNI 

: II II 1 1 1 M 1 1 II 1 1 h h 1 1 1 1 1 h M M 

ARLIEQAGNAAEGGILAVPVADTLKRAESGQISATVDRSGLWQAQTPQLFQAGLLHRALA 
130 140 150 160 170 180 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 861 >: 

a233 . seq 

1 ATGAAGCGCA AAAATATCGC GCTGATTCCC GCCGCCGGCA TCGGGGCGCG 

51 TTTCGGTGCG GACAAACCCA AGCAATATGT CGAAATCGGA AGCAAAACCG 

101 TTTTAGAACA TACGATTGGG ATTTTTGAAC GGCATGAGGC CGTCGATTTG 

151 ACCGTCGTTG TCGTCTCGCC CGAAGACACG TTTGCCGATA AGGTTCAGAC 

201 GGCATTTCCA CAGGTTCGGG TGTGGAAAAA CGGCGGACAG ACCCGCGCCG 

251 AAACTGTCCG CAACGGTGTG GCAAAATTGT TGGAAACCGG TTTGGCGGCG 

301 GAAACCGACA ATATTCTGGT ACACGATGCC GCGCGTTGCT GCCTGCCGTC 

351 TGAAGCTTTG ACGCGGTTGA TAGAACAGGC GGGCAACGCT GCCGAAGGTG 

4 01 GGATTTTGGC AATTCCCGTT GCCGATACGC TCAAGTGCGC GGACGGTGGG 

451 AACATTAGTG CAACCGTCGA GCGGACGAGC CTTTGGCAGG CGCAAACGCC 

501 GCAGCTTTTC CGCGCCGGGC TGCTGCACCG CGCATTGGCT GCGGAAAACT 

551 TGGACGGCAT TACCGATGAA GCGTCCGCCG TGGAAAAATT GGGCATCCGC 

601 CCTTTGCTGG TGCAGGGCGA CGCGCGCAAT TTGAAACTGA CGCAGCCGCA 

651 GGACGCATAC ATCGTCAGGC TGCTGCTCGA TGCCGTCTGA 



m233 -pep 
g233 
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This corresponds to the amino acid sequence <SEQ ID 862; ORF 233.a>: 

a233.pep 

1 MKRKNIALIP AAGIGARFG A DKPKQYVEIG SKTVLEHTIG IFERHEAVDL 

51 TVWVSPEDT FADKVQTAFP QVRVWKNGGQ TRAETVRNGV AKLLETGLAA 

101 ETDNILVHDA ARCCLPSEAL TRLIEQAGNA AEGGILAIPV ADTLKCADGG 

151 N I SAT VERTS LWQAQTPQLF RAGLLHRALA AENLDGITDE ASAVEKLGIR 

201 PLLVQGDARN LKLTQPQDAY IVRLLLDAV* 

m233/a233 99.3% identity in 1 52 aa overlap 

10 20 30 40 50 60 

MKRKNIALIPAAGIGARFGADKPKQYVEIGSKTVLEHTIGIFERHEAVDLTWWSPEDT 
IIMIIIIIIIllIIIIIIIMIIIIMIIIIIIIIIlllllijiiMMIIIIIIIIII 
MKRKN I ALI PAAGIGARFGADKPKQYVEIGSKTVLEHT IG I FERHEAVDLTVWVS PE DT 
10 20 30 40 50 60 

7 0 ■ 80 90 100 110 120 

FADKVQTAFPQVRVWKNGGQTRAETVRNGVAKLLETGLAAETDNILVH DAARCCLPSEAL 

I I I M I I I I I I I I II I 1 1 1 1 1 1 1 I 1 1 II I I I I I I I I I I I I I I | | | | | | | | | | | | | 1 1 | | j 

FADKVQTAFPQVRVWKNGGQTRAETVRNGVAKLLETGLAAETDN I LVH DAARCCLPSEAL 
70 . 80 90 100 110 120 

130 140 150 

TRLIEQAGNAAEGGILAIPIADTLKCADGGNI 
MMIIIIIIIIIIIIMl:|||||||||||| 

TRLIEQAGNAAEGGILAIPVADTLKCADGGNISATVERTSLWQAQTPQLFRAGLLHRALA 
130 140 150 160 170 180 

AENLDGITDEASAVEKLGIRPLLVQGDARNLKLTQPQDAYIVRLLLDAVX 
190 200 210 220 230 



m233 . pep 
a233 

m233 . pep 
a233 

m233 .pep 
a233 

a233 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 863>: 

g234.seq 

1 atgaaaaccg tttccgccgc catcgctttt gccgccgctg ccgtttcact 

51 gaccggctgt gcgaccgagt cctcacgcag cctcgaggtt gcaaaagtcg 

101 cctcctgcaa tacgcaatat cacggtgttc gcaccccgat ttccgtcgga 

151 acattcgaca accgctccag cttccaaaaa ggcattttct ccgacagtga 

201 agaccgtctg ggcagccagg caaaaaccat cctggtaaca cacctgcaac 

251 aaaccaaccg cttcaacgta ctgaaccgca ccaaccttag cgcattgaaa 

301 caggaatccg gcatttccgg caaagcgcag aacctgaaag gcgcagatta 

351 tgtcgttacc ggcgatgtaa ccgaattcgg acgcagagat gtcggcgatc 

401 atcagctctt cggcattttg ggtcgcggca aatcgcaaat cgcctatgca 

451 aaagtggctc tgaatatcgt caacgtcaat acttccgaaa tcgtctattc 

501 cacacagggc gcgggcgaat acgcactttc caaccgcgaa atcatcggtt 

551 tcggcggcac ttccggctac gatgcgactt tgaacggcaa agttttagac 

601 ttggcaatcc gcgaagccgt cgacaacttg gttcaggctg tcgacaacgg 

651 cgcatggcaa tccaaccgtt aa 

This corresponds to the amino acid sequence <SEQ ID 864; ORF 234.ng>: 

g234 .pep 

1 MKTVSAAIAF AAAA VSLTGC ATESSRSLEV AKVASCNTQY HGVRTPISVG 

51 TFDNRSSFQK GIFSDSEDRL GSQAKTILVT HLQQTNRFNV LNRTNLSALK 

101 QESGISGKAQ NLKGADYWT GDVTEFGRRD VGDHQLFGIL GRGKSQIAYA 

151 KVALNIVNVN TSEIVYSTQG AGEYALSNRE IIGFGGTSGY DATLNGKVLD 

2 01 LAI REAVDNL VQAVDNGAWQ SNR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 865>: 

m234.seq (partial) 

1 . . . GGCGCGGGCG AATACGCACT TTCCAACCGt GAAATCATCG GTTTCGGCGG 

51 CACTTCCGGC TACGATGCGA CTTTGAACGG CAAAGTTTTA GACTTGGCAA 

101 TCCGCGAAGC . gTCAACAGC CTGGTTCAGG CTGTTGACAA CGGCGCATGG 

151 CAACCCAACC GTTAA 

This corresponds to the amino acid sequence <SEQ ID 866; ORF 234>: 
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m234 .pep (partial) 

1 . . GAGEYALSNR EIIGFGGTSG YDATLNGKVL DLAIREAVNS LVQAVDNGAW 
51 QPNR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 234 shows 94.4% identity over a 54 aa overlap with a predicted ORF (ORF 234.ng) 
from N. gonorrhoeae: 

m234/g234 

10 20 30 

m2 3 4 pep GAGEYALSNREI IGFGGTSGYDATLNGKVL 

I I I I 1 I I I I I I I I I I I I I I II I I I I I I I I I 
g2 34 LGRGKSQIAYAKVALNIVNVNTSEIVYSTQGAGEYALSNRE I IGFGGTSGYDATLNGKVL 

140 150 160 170 180 190 

40 50 
m2 3 4 . pep DLAI REAVNS LVQAVDNGAWQPNRX 

I I II I II |::| MM I lllll Ml 
g 2 3 4 DLAIREAVDNLVQAVDNGAWQSNRX 

200 210 220 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 867>: 

a234 . seq (partial) 

1 AACCGCACCT ATTTGAACGC ATTAAAACAG GAATCCGGCA TTTCCGGCAA 

51 AGCGCATAAC CTGAAAGGCG CAAATTATGT CGNNACCGGC GATGTAACCG 

101 AATTCGGACG CANAGATGTC GGCGATCATC AGCTCTTCGG CATTTTGGGT 

151 CGCGGCAAAT CGCAAATCGC CTATGCAAAA GTGGCTCTGA ATATCGTCAA 

201 CGTCAATACT TCCGAAATCG TCTATTCCGC ACAGGGCGCG GGCGAATACG . 

251 CACTTTCCAA CCGTGAAATC ATCGGTTTCG GCGGCACTTC CGGCTACGAT 

301 GCGACTTTGA ACGGCAAAGT TTTAGACTTG GCAATCCGCG AAGCCGTCAA 

351 CAGCCTGGTT CAGGCTGTTG ACAACGGCGC ATGGCAACCC AACCGTTAA 

This corresponds to the amino acid sequence <SEQ ID 868; ORF 234,a>: 

a234 .pep (partial) 

1 NRTYLNALKQ ESGISGKAHN LKGANYVXTG DVTEFGRXDV GDHQLFGILG 

51 RGKSQIAYAK VALNIVNVNT SEIVYSAQGA GEYALSNREI IGFGGTSGYD 

101 ATLNGKVLDL AI REAVNS LV QAVDNGAWQP NR* 

m234/a234 1 00.0% identity in 54 aa overlap 

10 

30 

m234 .pep 

GAGEYALSNREI IGFGGTSGYDATLNGKVL 

I I I I I I I I I I 1 I I t I I I I I I I I I I I I I I I I 

a234 

LGRGKSQIAYAKVALNIVNVNTSEIVYSAQGAGEYALSNRE I IGFGGTSGYDATLNGKVL 

50 60 70 80 90 100 

40 50 
m234 .pep DLAI REAVNS LVQAVDNGAWQPNRX 

I I I I I I I I II I I I I I I I I I I I I I I 1 
a 234 DLAI REAVNS LVQAVDNGAWQPNRX 

110 120 130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 869>: 

g235 . seq 
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1 atgaaacctt tgattttagg gcttgccgcc gtgttggctc tgtctgcctg 

51 ccaagttcga aaagctcccg acctcgacta cacgtcattc aaagaaagca 

101 aaccggcttc aattttggtg gttccgccgc tgaacgagtc gcctgatgtc 

151 aacggcactt gggggatgct ggcttcgacc gccgcgccga tttccgaagc 

201 cggctattac gtctttcccg ccgcagtcgt ggaggaaacc ttcaaagaaa 

251 acggcttgac caatgccgcc gatattcacg ccgtccggcc ggaaaaactg 

301 catcaaattt tcggcaatga tgcggttttg tacattacgg ttaccgaata 

351 cggcacttca tatcaaattt tagacagcgt gacgaccgta tccgccaaag 

401 cacggctggt cgattcccgc aacgggaaag agttgtggtc gggttcggcc 

451 agcatccgcg aaggcagcaa caacagcaac agcggcctgt tgggggcttt 

501 ggtcggcgca gtggtcaatc agattgccaa cagcctgacc gaccgcggtt 

551 atcaggtttc caaaaccgcc gcatacaacc tactgtcgcc ctattcccgc 

601 aacggtatct tgaaaggtcc gagattcgtc gaagagcagc ccaaataa 

This corresponds to the amino acid sequence <SEQ ID 870; ORF 235.ng>: 

g235.pep 

1 MKPLILGLAA VLALS ACQVR KAPDLDYTSF KESKPASILV VPPLNESPDV 

51 NGTWGMLAST AAPISEAGYY VFPAAWEET FKENGLTNAA DIHAVRPEKL 

101 HQIFGNDAVL YITVTEYGTS YQILDSVTTV SAKARLVDSR NGKELWSGSA 

151 SIREGSNNSN SGLLGALVGA WNQIANSLT DRGYQVSKTA AYNLLSPYSR 

201 NGILKGPRFV EEQPK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 871 >: 

m235 . seq 

1 ATGAAACCTT TGATTTTAGG GCTTGCCGCC GTGTTGGCGC TGTCTGCCTG 

51 CCAAGTTCAA AAAGCGCCCG ATTTCGACTA CACGTCATTC AAGGAAAGCA 

101 AACCGGCTTC AATTTTGGTG GTTCCGCCGC TGAACGAATC GCCCGATGTC 

151 AACGGAACAT GGGGTGTACT GGCTTCGACC GCCGCGCCGC TTTCCGAAGC 

201 CGGCTATTAC GTCTTCCCCG CCGCAGTCGT GGAGGAAACC TTCAAAGAAA 

251 ACGGCTTGAC CAATGCCGCC GATATTCACG CCGTCCGGCC GGAAAAACTG 

301 CATCAGATTT TCGGCAATGA TGCGGTTTTG TACATTACGG TTACCGAATA 

351 CGGCACTTCA TATCAAATTT TAGACAGCGT GACGACCGTA TCCGCCAAAG 

4 01 CACGGCTGGT CGATTCCCGC AACGGAAAAG AGTTGTGGTC GGGTTCGGCC 

4 51 AGCATCCGCG AAGGCAGCAA CAACAGCAAC AGCGGCCTGT TGGGGGCTTT 

501 GGTCAGCGCA GTGGTCAATC AGATTGCCAA CAGCCTGACC GACCGCGGTT 

551 ATCAGGTTTC CAAAACCGCC GCATACAACC TGCTGTCGCC CTATTCTCAC 

601 AACGGCATCT TGAAAGGTCC GAGATTCGTT GAAGAGCAGC CCAAATAA 

This corresponds to the aminq acid sequence <SEQ ID 872; ORF 23 5>: 

m235.pep 

1 MKPLILGLAA VLALS ACQVQ KAPDFDYTSF KESKPASILV VPPLNESPDV 

51 NGTWGVLAST AAPLSEAGYY VFPAAWEET FKQNGLTNAA DIHAVRPEKL 

101 HQIFGNDAVL YITVTEYGTS YQILDSVTTV SAKARLVDSR NGKELWSGSA 

151 SIREGSNNSN SGLLGALVSA WNQIANSLT DRGYQVSKTA AYNLLSPYSH 

201 NGILKGPRFV EEQPK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 235 shows 96.7% identity over a 215 aa overlap with a predicted ORF (ORF 235.ng) 
from N. gonorrhoeae: 

m235/g235 

m235 .pep 
g235 



m23 5 .pep 
g235 

70 80 90 100 110 120 

130 140 150 160 170 180 



10 20 30 40 50 . 60 

MKPLI LGLAAVLALS ACQVQKAPDFDYTS FKESKPAS I LWP PLNES PDVNGTWGVLAST 

1 1 1 M M Mi M Ml i I Ihlllhll !,l II II I Ml I ! i III 1 1 II iiil Mhl M I 

MKP L I LGLAAVLALS ACQVRKAPDLDYTS FKE S KP AS I LWP PLNE S P DVNGTWGMLAS T 
10 20 30 40 50 60 

70 80 90 100 110 120 

AAPLS E AG YWFPAAWEETFKQNGLTNAAD I HAVRPEKLHQ I FGNDAVL YITVTEYGTS 

MhM I III M II I II IMMM III Ml II III M 1 1 M 1 1 1 M II M IM III Ml 

AAP I SEAGY YVF PAAWEETFKENGLTNAADI HAVRPEKLHQ I FGNDAVLY I TVTE YGTS 
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m23 5 . pep YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 

II IIMIII M II IIIIIMIIMill II I II II Ml 1 1 Ml I II I MM i i II i! ! 

g235 YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 

130 140 150 160 170 180 



190 200 210 

m235 .pep DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPKX 

M Mi MM I Ml IMIIhlllli II 1 1 1 M II : I 

g23 5 DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPKX 

190 200 210 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 873>: 

a235 . seq 

1 ATGAAACCTT TGATTTTAGG GCTTGCCGCC GTGTTGGCGC TGTCTGCCTG 

51 CCAAGTTCAA AAAGCGCCCG ATTTCGACTA CACGTCATTC AAGGAAAGCA 

101 AACCGGCTTC AATTTTGGTG GTTCCGCCGC TGAACGAATC GCCCGATGTC 

151 AACGGAACAT GGGGTGTACT GGCTTCGACC GCCGCGCCGC TTTCCGAAGC 

201 CGGCTATTAC GTCTTCCCCG CCGCAGTCGT GGAGGAAACC TTCAAACAAA 

251 ACGGCTTGAC CAATGCCGCC GATATTCACG CCGTCCGGCC GGAAAAACTG 

301 CATCAGATTT TCGGCAATGA TGCGGTTTTG TACATTACGG TTACCGAATA 

351 CGGCACTTCA TATCAAATTT TAGACAGCGT GACGACCGTA TCCGCCAAAG 

4 01 CACGGCTGGT CGATTCCCGC AACGGAAAAG AGTTGTGGTC GGGTTCGGCC 

4 51 AGCATCCGCG AAGGCAGCAA CAACAGCAAC AGCGGCCTGT TGGGGGCTTT 

501 GGTCAGCGCA GTGGTCAATC AGATTGCCAA CAGCCTGACC GACCGCGGTT 

551 ATCAGGTTTC TAAAACCGCC GCATACAACC TGCTGTCGCC CTATTCTCAC 

601 AACGGCATCT TGAAAGGTCC GAGATTCGTC GAAGAGCAGC CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 874; ORF 235. a>: 

a235 . pep 

1 MKPLILGLAA VLALSA CQVQ KAPDFDYTSF KESKPASILV VPPLNESPDV 

51 NGTWGVLAST AAPLSEAGYY VFPAAWEET FKQNGLTNAA DIHAVRPEKL 

101 HQIFGNDAVL YITVTEYGTS YQILDSVTTV SAKARLVDSR NGKELWSGSA 

151 SIREGSNNSN SGLLGALVSA WNQIANSLT DRGYQVSKTA AYNLLSPYSH 

201 NGILKGPRFV EEQPK* 

m235/a235 100.0% identity in 215 aa overlap 

10 20 30 40 50 60 

m235.pep MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
I I M I I I I I I I I I 1 I I I I I I I I I I ! I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I 
a235 MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 

10 20 30 40 50 60 

70 80 90 100 110 120 

m235 . pep AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 ii 1 1 r 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

a235 AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 

70 80 90 100 110 120 

130 140 150 160 170 180 

m235.pep YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I | | 
a235 YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 

130 140 150 160 170 180 



190 200 210 

m235 .pep DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPKX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a235 DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPKX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 875>: 

g236 . seq 
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1 ATGGCGCGTT TCGCCTTCTC CGCCGACATT CTCCGCACAG CGTTTGCAGA 

51 CGGTTTCATA ACCTGCAACC GCGCCCACAT CGCGGGTGTA ATGCCAGCAG 

101 CGTTCGCATT TTTCGCCGTC GCTGGCTTTG GCGGCAACGG CAAGTTCATC 

151 ACCGACTTTC ACTTCTGCTT TAGACACCAG CAGGGCAAAG CGCAATTCTT 

201 CGCCCAAAGC ATTCAGATAG CCGGCCATTT CTTCCGGCGC GGTAATTTCG 

251 GCTTCCGCCT GCAAggacga accgacagTT TTGTcggcGC GCAAAGGCTC 

3 01 GAtagcggcg gTTACTGCTT CGCGCGCTTC GCGGATTGCC GTCCATTTTT 

3 51 TCACCAGTTC GGCTTCGGCT TTTTCGTTGA TGGCCGGGAA CTCGTGCCAA 
401 GTATGGAAGA GGACGCTGTC TTCTTCGCCG CCGCCGATGA TGTCCCACGC 

4 51 TTCTTCGCCG GTGAAGCACA AAATCGGTGC AATCAAGAGA ACCAGGCTGC 
501 GCGTGATGTG GTACAGGGCG GTTTGCGCGC TGCGGCGGGC GCGGCTGTCG 
551 GCTTTGGTGG TGTAGAGGCG GTCTTTCAGG ATGTCGAGGT AGAACGCGCC 
601 CAAGTCTTCC GAGCAGAAAG AAACAATGTC TTTCACGGCG AAGTGGAAGG 
651 CATAGCGCGG ATAGTAACCG CCTGCCAAAC GCTCTTGCAG CCGCCGCGCC 
701 AATACCAAGG CGTAGCGGTC GATTTCCACC ATATCCGCCT . GTTGCACGGC 
751 . ATCTTCAATC GGATTAAAGT CGCTCAAATT GGCAAAcagG AAGCTCAAGG 
801 TATTGCGGAT GCGGCGGTAG CTTTCGGTAA CGCGTTTGAG GATTTCTTTG 
851 GAAatcgCCA ATtcgccgct gTAATCGGTG GATGCCGCCC ACAGGCGCAG 
901 GATGTCCGCG CCGAATTCGT TATAGACTTC CTGCGGCGCG ACGACGTTGC 
951 CGATGGATTT CGACATTTTG CGGCCGTTTT GGTCAACCAC GAAACCGTGG 

10 01 GTCAGCAGCT GTTTATACGG TGCGCGTCCC ATGGATGA 

This corresponds to the amino acid sequence <SEQ ID 876; ORF 236.ng>: 

g236.pep 

1 MARFAFSADI LRTAFADGFI TCNRAH IAGV MPAAFAFFAV AGF GGNGKFI 

51 TDFHFCFRHQ QGKAQFFAQS IQIAGHFFRR GNFGFRLQGR TDSFVGAQRL 

101 DSGGYCFARF ADCRPFFHQF GFGFFVDGRE LVPSMEEDAV FFAAADDVPR 

151 FFAGEAQNRC NQENQAARDV VQGGLRAAAG AAVGFGGVEA VFQDVEVERA 

201 QVFRAERNNV FHGEVEGIAR IVTACQTLLQ PPRQYQGVAV DFHHIRLLHG 

251 IFNRIKVAQI GKQEAQGIAD AAVAFGNAFE DFFGNRQFAA VIGGCRPQAQ 

301 DVRAEFVIDF LRRDDVADGF RHFAAVLVNH ETVGQQLFIR CASHG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 877>: 

m236.seq (partial) 

1 . . TTGCACGGAC GAACCGACGG TTTTGTCGGC GCGCAAAGGC TCGATGGCGG 

51 CGGTTACCGC TTCGCGGGCT TCGCGGATTG CCGTCCATTT TTTCACCAGT 

101 TCGGCTTCGG TTTTTTCGTT GATGGTCGGG AACTCGTGCC AAGTATGGAA 

151 GAGGACGCTG TCkTCTTCGC CGCCGCCGwT GAyGTCCCAC GCTTCTTCGC 

201 CGGTGAAGCA CAAAATCGGT GCAATCAAGA GAACCAAACT GCGTGTGATG 

251 TGATACAGGG CAGTTTGTGC GCTGCGGCGT GCATGGCTGT CTGCTTTGGT 

301 GGTGTAGAGG CGGTCTTTCA GGATGTCGAG GTAGAACGCA CCCAAGTCTT 

351 CCGAGCAGAA AGAAACArTG TCTTTTACGG CAAAGTGGaA kGCATAACGC 

401 GGATAGTAAT CGCCTGCCAG ACACTCTTGC AGCTGACGTG CCAATACCAC 

451 GGCGTAGCGG TCGATTTCCA CCATATCCGC CTGTTGCACG GCATCTTCAA 

501 TCGGATTAAA GTCGCTCAAG TTGGCAAACA AAAAGCTCAA GGTATTGCGG 

551 ATACGGCGGT AgCTTTCGGT TACGCGTTTG AGGATTTCTT TGGAAATCGC 

601 CAATTCGCCG CTGTAATCGG TAGATGCCGC CCACAGGCGC AGGATGTCTG 

651 CGCCGAATTC GTTATAAACC TCTTGCGGTG CAACGACGTT GCCGATGGAT 

701 TTCGACATTT TTTTGCCTTC GCCGTCGACA ACGAAACCAT GGGTCAGCAG 

751 CTGTTTATAC GGCGCGCGAC CCATTGA 

This corresponds to the amino acid sequence <SEQ ID 878; ORF 236>: 

m236 .pep (partial) 

1 . . LHGRTDGFVG AQRLDGGGYR FAGFADCRPF FHQFGFGFFV DGRELVPSME 
51 EDAVXFAAAX DVPRFFAGEA QNRCNQENQT ACDVIQGSLC AAACMAVCFG 
101 GVEAVFQDVE VERTQVFRAE RNXVFYGKVE XITRIVIACQ TLLQLTCQYH 
151 GVAVDFHH I R LLHGIFNRIK VAQVGKQKAQ GIADTAVAFG YAFEDFFGNR 
201 QFAAVIGRCR PQAQDVCAEF VINLLRCNDV ADGFRHFFAF AVDNETMGQQ 
251 LFIRRATH* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 236 shows 82.9% identity over a 258 aa overlap with a predicted ORF (ORF 236.ng) 
from N. gonorrhoeae: 

m236/g236 



FCi/usyy/uyj4o 
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10 20 30 

m236 . pep LHGRTDGFVGAQRLDGGGYRFAGFADCRPF 

hlllhlllllllhlll II MIIMI 

9 236 FRHQQGKAQFFAQSIQIAGHFFRRGNFGFRLQGRTDSFVGAQRLDSGGYCFARFADCRPF 
60 70 80 90 100 110 

40 50 60 70 80 90 

m23 6 . pep FHQFGFGFFVDGRELVPSMEEDAVXFAAAXDVPRFFAGEAQNRCNQENQTACDVIQGSLC 

MM M I II 1 1 II MM II Mill MM 1 1 1 1 1 1 1 1 1 1 II I M 1 1 1 1 : 1 I hi hi 

g236 FHQFGFGFFVDGRELVPSMEEDAVFFAAADDVPRFFAGEAQNRCNQENQAARDWQGGLR 
120 130 140 150 160 170 

!00 HO 120 130 140 150 

m23 6 . pep AAACMAVCFGGVEAVFQDVEVERTQVFRAERNXVFYGKVEXITRIVIACQTLLQLTCQYH 
Ml I I I I I I | M I I II I I I h II I II I I I I h h I I h I I I I I II | | | | |: 
g2 3 6 AAAGAAVGFGGVEAVFQDVEVERAQVFRAERNNVFHGEVEGI ARI VTACQTLLQPPRQYQ 

180 190 200 210 220 230 

1^0 170 180 190 200 210 

m23 6 . pep GVAVDFHHIRLLHGIFNRIKVAQVGKQKAQGIADTAVAFGYAFEDFFGNRQFAAVIGRCR 
M I M I I I I I I II I I II II II I h I I h I I II I I : II I I I II I I I II I I II M II I II 
9236 GVAVDFHHIRLLHGIFNRIKVAQIGKQEAQGIADAAVAFGNAFEDFFGNRQFAAVIGGCR 
240 250 260 270 280 290 

220 230 240 250 259 

m2 3 6 . pep PQAQDVCAE FV I NLLRCNDVADG FRH FFAFAVDNE TMGQQLF I RRATHX 

M 1 1 1 1 1 1 1 1 h M I M M M II II I MM hill I III hi 

923 6 PQAQDVRAE FVIDFLRRDDVADGFRHFAAVLVNHETVGQQLF I RCASHG 

300 310 320 330 340 



TCGCCTTCTC CGCCGACATT CTCTGCACAG CGTTTGCAGA 
GCCTGCAACC GCGCCCACAT CGCGGGTGTA GTGCCAGCAG 
TTTCACCATC ACTGGCTTTA GCGGCAACGG CAAGTTCGCT 
ACTTCTGCTT TAGACACCAG CAAAGCAAAG CGCAATTCTT 
ATTCAGATAG CCGGCCATTT CTTCCGGCGC GGTAATTTCG 
GCAAGGACGA ACCGACGGTT TTGTCGGCGC GCAAAGGCTC 
GTTACCGCTT CGCGGGCTTC GCGGATTGCC GTCCATTTTT 
GGCTTCGGCT TTTTCGTTGA TGGTCGGGAA CTCGTGCCAA 
GCACGCTGTC TTCTGCGCCG CCGCCGATGA TGTCCCACGC 
GTGAAGCACA AAATCGGTGC AATCAAGAGA ACCAGGCTGC 
GTACAGGGCG GTTTGCGCGC TGCGGCGGGC GCGGCTGTCG 
TATAGAGGCG GTCTTTCAGG ATATCGAGGT AGAACGCGCC 
GAGCAGAAAG AAACCATTTC TTTCACGGCA AAGTGGAAGG 
AT AAAAAT C A CCGGCAACGC GTTCTTGCAG CCGCCTTGCC 
CATAGCGGTC GATTTCCACC ATATCCGCCT GTTGCACGGC 
GGATTGAAGT CGCTCAAGTT GGCAAACAAA AAGCTCAAGG 
ACGGCGGTAG CTTTCGGTTA CGCGCTTGAG GATTTCTTTG 
ATTCGCCGCT GTAATCGGTG GATGCCGCCC ACAGGCGCAG 
CCGAACTCGT TATACACTTC TTGCGGCGCG ACGACGTTGC 



This corresponds to the amino acid sequence <SEQ ID 880; ORF 236.a>: 

a236.pep 

1 MARFAFSADI LCTAFADGFM ACNRAH IAGV VPAAFAFFTI TGF SGNGKFA 

51 AYFHFCFRHQ QSKAQFFAQS IQIAGHFFRR GNFGFGLQGR TDGFVGAQRL 

101 DGGGYRFAGF ADCRPFFHQF GFGFFVDGRE LVPSMEKHAV FCAAADDVPR 

151 FFAGEAQNRC NQENQAARDV VQGGLRAAAG AAVGFGGIEA VFQDIEVERA 

201 QVFRAERNHF FHGKVEGITR IKITGNAFLQ PPCQHQGIAV DFHHIRLLHG 

251 IFNRIEVAQV GKQKAQGIAD TAVAFGYALE DFFGNRQFAA VIGGCRPQAQ 

301 DVRAELVIHF LRRDDVADGF RHFAPVLIHH ETMGQQLFVR RATH* 



a236 . seq 




1 


ATGGCGCGTT 


51 


CGGTTTCATG 


101 


CGTTCGCATT 


151 


GCCTACTTTC 


201 


CGCCCAAAGC 


251 


GCTTCGGCTT 


301 


GATGGCGGCG 


351 


TCACCAGTTC 


401 


GTATGGAAAA 


451 


TTCTTCGCCG 


501 


GCGTGATGTG 


551 


GCTTTGGTGG 


601 


CAAGTCTTCC 


651 


CATAACGCGG 


701 


AACACCAAGG 


751 


ATCTTCAATA 


801 


TATTGCGGAT 


851 


GAAATCGCCA 


901 


GATGTCCGCG 


951 


CGATGGATTT 


1001 


GTCAGCAGCT 



WU yy/57280 
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m236/a236 8 1 .0% identity in 258 aa overlap 

10 20 30 

m2 3 6 . pep LHGRTDGFVGAQRLDGGGYRFAGFADCRPF 

I : I I I M II I I I I I I I I I I I | | | | | | M M 
a236 FRHQQSKAQFFAQSIQIAGHFFRRGNFGFGLQGRTDGFVGAQRLDGGGYRFAGFADCRPF 
60 70 80 90 100 110 

40 50 60 70 80 90 

m236 . pep FHQFGFGFFVDGRELVPSMEEDAVXFAAAXDVPRFFAGEAQNRCNQENQTACDVIQGSLC 
I I I I I I I I I I I I I I I I I I I I : II Ml | | | | I | | | | | | | | | | | | j | : j ||:||:| 

FHQFGFGFFVDGRELVPSMEKHAVFCAAADDVPRFFAGEAQNRCNQENQAARDWQGGLR 
120 130 140 150 160 170 



a236 



100 HO 120 130 140 150 

m236 . pep AAACMAVCFGGVEAVFQDVEVERTQVFRAERNXVFYGKVEXITRIVIACQTLLQLTCQYH 
Ml M 111:111111:1111:11111111 I : I II I I I I I I : : : : I , | j : : 
AAAGAAVGFGGIEAVFQDIEVERAQVFRAERNHFFHGKVEGITRIKITGNAFLQPPCQHQ 
180 190 200 210 220 230 



a236 



160 170 180 190 200 210 

m23 6 . pep GVAVDFHHIRLLHGIFNRIKVAQVGKQfCAQGIADTAVAFGYAFEDFFGNRQFAAVIGRCR 

I : I I I I I I I I I I I : I i I I I I I I I I I I I I I I I i I I IT : I I I I I I I I I I f I I I II 

GIAVDFHHIRLLHGIFNRIEVAQVGKQKAQGIADTAVAFGYALEDFFGNRQFAAVIGGCR 
240 250 260 270 280 290 



a236 



220 230 240 250 259 

m236 . pep PQAQDVCAEFVINLLRCNDVADGFRHFFAFAVDNETMGQQLFIRRATHX 
Mill! Il:||::|| :IMIilll| : : I I I I f 1 I I : I I I I I I 
a236 PQAQDVRAELVIHFLRRDDVADGFRHFAPVLIHHETMGQQLFVRRATHX 
300 310 320 330 340 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 88 1>: 

g237 . seq 

1 atgcgggaca aggttggcgg taatatcgca ctccccgccc cacgaatatt 

51 cgattctaac atcggcaagc tgcggaaaaa ctttaagcat atcttggcgg 

101 acaagctcgg tcatacgcgc aggattgtcg ataaattcgt tatccttacc 

151 gccgaaaagc agcctgccgt ccgcgctgag gcggtaataa tccaaaatat 

201 ggcggttgtc gcatactgcc atattgttgc ggataagccc ttttgtgcgc 

251 gcgcccaagg gttcggtggc aataataaag gtgctgacgg caatcgcctt 

301 gcgttccaaa ggccggaata tcgggttcaa accgacataa gtattgacgg 

351 catagaccac atttttacac tcgacgctgc cttcgggcgt gtaaaccagc 

401 caaccgtttt gatacggttc gatgcgcgtc atcggggatt gctcgaaaat 

451 ctgcgcgccg gcttcggcag cggcgctggc aacacccaac gtgtaattga 

501 gcggatgaag atgcccggac aagggatcga actgtgcgcc ttggtacata 

551 tcgctgtcaa gctgctgttt caactcggct ttatcccaaa gttgataatg 

601 actcgcaccg taatgccgtt gggcgtgttc atgccactgc tgcaactctt 

651 cccaatgctg cggacggacg gcaaccgtgg cataaccgcg ctgccaatcg 

701 caatcgatgg catgtttgcg gacgcgttcg tccaccagtt cgaccgcctg 

751 caaagactgt tgccaaaacc attgcgcctg ctccaagccg acctgttttt 

801 caatttcccc cataccgcag gcgtagtcgc tgataacctg cccgccactc 

851 ctgccggacg cgccgaagcc gatacgtgcg gcttccaaaa cgacggcttc 

901 atgtccgtgt tccgccagcg gcaatgcggt acacaaaccg ctcaaaccgc 

951 cgccgataat gcaggtttcg gctttcagac ggcattggag tttcggataa 

1001 acagtatgcg gattaaccga actaaaataa taagaaggca gatattcttg 

1051 aaaatcaggg cgaatcattg tgtttgcttt atcgggtata ttttcggacg 

1101 gaatgataca gactgtcggg ccatatcgtc caaacagaaa atcggttga 

This corresponds to the amino acid sequence <SEQ ID 882; ORF 237.ng>: 
g237.pep 

1 MRDKVGGNIA LPAPRIFDSN IGKLRKNFKH ILADKLGHTR RIVDKFVILT 

51 AEKQPAVRAE AVIIQNMAW AYCHIVA DKP FCARAQGFGG NNKGADGNRL 



txrivusyy/uyMo 
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101 AFQRPEYRVQ TDISIDGIDH I FTLDAAFGR VNQPTVLIRF DARHRGLLEN 

151 LRAGFGSGAG NTQRVIERMK MPGQGIEL CA LVHIAVKLLF QLGFI PKLIM 

201 TR TVMPLGVF MPLLQLFPML RTDGNRGITA LPIAIDGMFA DAFVHQFDRL 

251 QRLLPKPLRL LQADLFFNFP HTAGWADNL PATPAGRAEA DTCGFQNDGF 

3 01 MSVFRQRQCG TQTAQTAADN AGFGFQTALE FRINSMRINR TKIIRRQIFL 

351 KIRANHCVCF IGYIFGRNDT DCRAISSKQK IG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 883>: 

m23 7 . seq 

1 ATGCGGGACA AGGTTGGCGG TAATGTCGCA CTCCCCGCCC CACGAATATT 

51 CGATTTTGAC ATCGGCAAGC TGCGGAAAAA CTTTAAGCAT ATCTTGGCGG 

101 ACAAGCTCGG TCATaCGCTC AGGATTGTCG ATAAACTCGT TATCCTTACC 

151 GCCGAAAAGC AGTCTGCCGT CCGCGCTGAG GCGGTAATAA TCCAAAATAT 

201 GGCGGTTGTC GCATACTGCC ATATTGTTAC GGATAAGCCC TTTTGCGCGC 

251 GCCCCCAAGG GTTCGGTCGC AATAATAAAG GTGCTGACAG CAATCGCCTT 

301 GCGTTCCAAA GGCCGGAATA TCGGGTTCAA ACCTGCATAA GTATTGACAG 

3 51 CATAGACCAC ATTTTTGCAC TCGACGCTGC CTTCGGGCGT GTAAACCAGC 

401 CAACCGTTTT GATGCGGTTC GATGCACGTC ATCGGGGATT GCTCGAAAAT 

451 CTGCGCACCG GCTTCGGCAG CGGCACGAGC GATGCCCAAA GTGTAAGTGA 

501 GCGGATGCAG GTGTCCGGAT AAGGGGTCGA ATTGTGCCCC TTGGTACATA 

551 TCGCTGTCAA GCTGCTGTTT CAACTCGGCT TTATCCCAAA GTTGATAATG 

601 ACTCGCACCG TAATGCCGTT GGGCGTGTTC ATGCCACTGC TGCAACTCTT 

651 CCCAATGCTG CGGACGGACG GCAACCGTGG CATAACCGCG CTGCCAATCA 

701 CAATCGACGG CATGTTTGCG GACGCGTTCG TCCACCAGTT CGACCGCCTG 

751 CAAAGACTGT TGCCAAAACC ATTGCGCCTG CTCCAAGCCG ACCTGTTTTT 

801 CAATTTCCCC CATACCGCAG nCGTAATCGC TGATAACCTG CCCGCCACTC 

851 CGTCCCGACG CGCCGAAACC GATACGCGCG GCTTCCAACA CAACCGTTTC 

901 ATGTCCCTGC TCCGCCAAGG GCAATGCAGT GCACAAACCA CCCAATCCGC 

951 CGCCGATGAT ACAGGTATCG GTTTTCAGAC GGCATTGAAG TTtCGGATAA 

1001 ACAGTATGAG GATTAACCGA ACTGAAATAA TAAGAAGGCA GATATTCTTG 

1051 AAAATCAGGG CGAATCATTG TGTTTGCTTT ATCAGGTGTA TTTTCGGACG . 

1101 GAATGATACA GGCTGTCGGG CCATATCGTC CAwACAGAAA ATCGGTTGA 

This corresponds to the amino acid sequence <SEQ ED 884; ORF 237>: 

m23 7 . pep 

1 MRDKVGGNVA LPAPRIFDFD IGKLRKNFKH ILADKLGHTL RIVDKLVILT 

51 AEKQSAVRAE AVIIQNMAW AYCHIVT DKP FCARPQGFGR NNKGADSNRL 

101 AFQRPEYRVQ TCISIDSIDH IFALDAAFGR VNQPTVLMRF DARHRGLLEN 

151 LRTGFGSGTS DAQSVSERMQ VSGXGVEL CP LVHIAVKLLF QLGFI PKLIM 

2 01 TR TVMPLGVF MPLLQLFPML RTDGNRGITA LPITIDGMFA DAFVHQFDRL 

251 QRLLPKPLRL LQADLFFNFP HTAXVIADNL PATPSRRAET DTRGFQHNRF 

301 MSLLRQGQCS AQTTQSAADD TGIGFQTALK FRINSMRINR TEI IRRQIFL 

351 KIRANHCVCF IRCIFGRNDT GCRAISSXQK IG* 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N. gonorrhoeae 

ORF 237 shows 86. 1 % identity over a 382 aa overlap with a predicted ORF (ORF 237.ng) 
from K gonorrhoeae: 

m237/g237 

m23 7 .pep 
g237 



m237 .pep 
g237 



10 20 30 40 50 60 

MRDKVGGNVALPAPRIFDFD IGKLRKNFKH I LAD KLGHTLR I VDKLV I LTAEKQSAVRAE 

lllllllhllMMIM H I MMMI ! 1 1 1 1 II 1 1 1 1 1 1 1 1 : 1 1 1 1 1| 1 1 Mill 

MRDKVGGNIALPAPRIFDSNIGKLRKNFKHILADKLGHTRRIVDKFVILTAEKQPAVRAE 
10 20 30 40 50 60 

70 80 90 100 110 120 

AV I IQNMAWAYCHI VTDKP FCARPQGFGRNNKGADSNRLAFQRPE YRVQTC I S I DS I DH 

IMilllllllllllhlllllll 1 1 1 1 IIMIhlllllllllMIII 1 1 f I = I f I 

AVIIQNMAWAYCHIVADKPFCARAQGFGGNNKGADGNRLAFQRPEYRVQTDISIDGIDH 
70 80 90 100 110 120 



m237 .pep 



130 140 150 160 170 180 

IFALDAAFGRVNQPTVLMRFDARHRGLLENLRTGFGSGTSDAQSVSERMQVSGXGVELCP 



g237 



m2 37 .pep 
g237 



m23 7 .pep 
g237 



m2 37 .pep 
g237 



m23 7 -pep 

g237 

370 380 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 885>: 

a237 . seq 

1 ATGCGGGACA AGGTTGGCGG TAATGTCGCA CTCCCCGCCC CACGAATATT 

51 CGATTTTGAC ATCGGCAAGC TGCGGAAAAA CTTTAAGCAT ATCTTGGCGG 

101 ACAAGCTCGG TCATACGCGC GGGATTGTCG ATAAACTCGT TATCCTTACC 

151 GCCGAAAAGC AGTCTGCCGT CCGCGCTGAG GCGGTAATAA TCCAAAATAT 

201 GACGGTTGTC GCATACTGCC ATATTGTTGC GGATAAGCCC TTTTGCACGC 

251 GCGCCCAAGG GTTCTGTGGC AATAATAAAG GTGCTGACAG CAATCGCCTT 

301 GCGCTCCAAA GGCTTGAATA TCGGATTCAA ACCGGCATAA GTATTGACGG 

351 CGTACACCAG ATTTTTGCAT TCGACGCTGC CTTCGGGGGT GTAAACCAGC 

401 CAACCGTTTT GATAAGGTTC AATGCGTATC ATGGGAGAAT GCTCAAAAAT 

4 51 CTTCGTACCA GCTTCGGCAG CGGCGCGGGC GATGCCCAAC GTGTAATTGA 

501 GCGGATGGAG ATGCCCGGAC AAGGGATCGA ACTGTGCGCC TTGGTACATA 

551 TCGCTGTCAA GCTGCTGCTT CAGTTCAGTG T TAT CCC AG A GTTGATAATG 

601 AGTTGCACCG TAATATTTTT GGGCGTGCTC ATGCCATTGT TGCAATTCTT 

651 CCCAATGCTG CGAACGGATG GCAACCGTGG CATAACCGCG CTGCCAATCG 

701 CAATCAATGG CATGTTTGCG GACGCGTTCG TCCACCAGTT CGACCGCCTG 

751 CAAAGACTGT TGCCAAAACC ATTGCGCTTG CTCCAAACCG ACCTGTTTTT 

801 CAATTTCCTC CATACCGCAG GCGTAATCGC TGATAACCTG CCCGCCACTC 

851 CGTCCCGACG CGCCGAAACC GATACGCGCG GCTTCCAACA CAACCGTTTC 

901 ATGTCCCTGC TCCGCCAAGG GCAATGCAGT GCACAAACCA CTCAATCCGC 

951 CGCCGATGAT ACAGGTATCG GTTTTCAGAC GGCATTGAAG TTTCGGATAA 

1001 ACAGTATGAG GATTAACCGA ACTGAAATAA TAAGAAGGCA GATATTCTTG 

1051 AAAATCAGGG CGAATCATTG TGTTTGCTTT ATCGGGTATA TTTTCGGACG 

1101 GAATGATACA GGCTGTCGAG CCATATCGTC CAAACAGAAA ATCGGTTGA 

This corresponds to the amino acid sequence <SEQ ID 886; ORF 237.a>: 

a237 .pep 

1 MRDKVGGNVA LPAPRIFDFD IGKLRKNFKH ILADKLGHTR GIVDKLVILT 

51 AEKQSAVRAE AVIIQNMTVV AYCHIVA DKP FCTRAQGFCG NNKGADSNRL 

101 ALQRLEYRIQ TGISIDGVHQ I FAFDAAFGG VNQPTVLIRF NAYHGRMLKN 

151 LRTSFGSGAG DAQRVIERME MPGQGIEL CA LVHIAVKLLL QFSVI PELIM 

201 SCTVIFLGVL MPLL QFFPML RTDGNRGITA LPIAINGMFA DAFVHQFDRL 

251 QRLLPKPLRL LQTDLFFNFL HTAGVIADNL PATPSRRAET DTRGFQHNRF 

301 MSLLRQGQCS AQTTQSAADD TGIGFQTALK FRINSMRINR TEIIRRQIFL 

351 KIRANHCVCF IGYIFGRNDT GCRAISSKQK IG* 
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I I M I I I I I I II Ml MM M I Ml Ml I MMM M M-M | |||:: | |:||| 
IFTLDAAFGRVNQPTVLIRFDARHRGLLENLRAGFGSGAGNTQRVIERMKMPGQGIELCA 
130 140 150 160 170 180 

190 200 210 220 230 240 

LVH I AVKLLFQLGFI PKLIMTRTVMPLGVFMPLLQLFPMLRTDGNRGI TALP I T I DGMFA 

II II I Mi M Mill II MIMM llllllll I! IIMIMMMI II MM I M II Ml 

LVHIAVKLLFQLGFIPKLIMTRTVMPLGVFMPLLQLFPMLRTDGNRGITALPIAIDGMFA 
190 200 210 220 230 240 

250 260 270 280 290 300 

DAFVHQFDRLQRLLPKPLRLLQADLFFNFPHTAXVIADNLPATPSRRAETDTRGFQHNRF 

II 1 1 Ml II I MM I III 1 1! Ml III llllll MM I Mllh 1 1 hi I Mh-- I 

DAFVHQFDRLQRLLPKPLRLLQADLFFNFPHTAGWADNLPATPAGRAEADTCGFQNDGF 
250 260 270 280 290 300 

310 320 330 340 350 360 

MSLLRQGQCSAQTTQSAADDTGIGFQTALKFRINSMRINRTEIIRRQIFLKIRANHCVCF 

IhMI I MM IMMIMM M Ml M Ml I M 1 1 1 M h M 1 1 M M M M I M M I 

MS VFRQRQCGTQTAQTAADNAGFGFQTALE FR INSMR INRTKI I RRQI FLKI RANHCVC F 
310 320 330 340 350 360 

370 380 
I RC I FGRNDTGCRAI S SXQKI GX 

I lllllll MUM Mill 

IGYIFGRNDTDCRAISSKQKIGX 
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m237/a237 85.6% identity in 382 aa overlap 

10 20 30 40 50 60 

m237.pep MRDKVGGNVALPAPRIFDFDIGKLRKNFKHILADKLGHTLRIVDKLVILTAEKQSAVRAE 
I I I I I I I I I I I M I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
a237 MRDKVGGNVALPAPRIFDFDIGKLRKNFKHILADKLGHTRGIVDKLVILTAEKQSAVRAE 

10 20 30 40 50 60 



70 80 90 100 110 120 

m2 3 7 . pep AVI IQNMAVVAYCHI VTDKPFCARPQGFGRNNKGADSNRLAFQRPEYRVQTC I S I DS I DH 

I I I I I I I : I I I I I I I I : I I I I I : I II I I I I I I I I I I I I : I I I I I : ! I III!:: : 
a237 AVIIQNMTWAYCHIVADKPFCTRAQGFCGNNKGADSNRLALQRLEYRIQTGISIDGVHQ 

70 80 90 100 110 120 

130 140 150 160 170 180 

m237 .pep IFALDAAFGRVNQPTVLMRFDARHRGLLENLRTGFGSGTSDAQSVSERMQVSGXGVELCP 

111:11111 1111111:11:1 I : I : I I I I : I I I I : : I I I I | | | : : | I : I I I 
a237 IFAFDAAFGGVNQPTVLIRFNAYHGRMLKNLRTSFGSGAGDAQRVIERMEMPGQGIELCA 

130 140 150 160 170 180 



190 200 210 220 230 240 

m237 .pep LVHIAVKLLFQLGFIPKLIMTRTVMPLGVFMPLLQLFPMLRTDGNRGITALPITIDGMFA 
I I I I I I I I I : I : : I I : I I I : I I : I I I : M I I I : I I 1 I I I I I I I I M II I I : I : M I I 
a237 LVHIAVKLLLQFSVIPELIMSCTVIFLGVLMPLLQFFPMLRTDGNRGITALPIAINGMFA 

190 200 210 220 230 240 



250 260 270 280 290 300 

m237 . pep DAFVHQFDRLQRLLPKPLRLLQADLFFNFPHTAXVIADNLPATPSRRAETDTRGFQHNRF 
I I I I I I I I I I I I I I I I I I II I I : I I I I I I Ml I I ! I I I I I I I I I II I I I I I I I I I I I I 
a237 DAFVHQFDRLQRLLPKPLRLLQTDLFFNFLHTAGVIADNLPATPSRRAETDTRGFQHNRF 

250 260 270 280 290 300 



310 320 330 340 350 360 

m237 .pep MSLLRQGQCSAQTTQSAADDTGIGFQTALKFRINSMRINRTEIIRRQIFLKIRANHCVCF 
I I I I I I I I I I I I I I I I I II I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a237 MSLLRQGQCSAQTTQSAADDTGIGFQTALKFRINSMRINRTEIIRRQIFLKIRANHCVCF 

310 320 330 340 350 360 



370 380 
m237 .pep IRCIFGRNDTGCRAISSXQKIGX 
I I ! I I I I I I I I I I I I I I I I I 
a237 IGYIFGRNDTGCRAISSKQKIGX 

370 380 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 887>: 

g238 . seq 

1 atgaatttgc ctattcaaaa attcatgatg ctgttggcag cggcaatatc 

51 gatgctgcat atccccatta gtcatgcgaa cggtttggat gcccgtttgc 

101 gcgatgatat gcaggcaaaa cactacgaac cgggtggcaa ataccatctg 

151 tttggtaatg ctcgcggcag tgttaaaaat cgggtttgcg ccgtccaaac 

201 atttgatgca actgcggtcg gccccatact gcctattaca cacgaacgga 

251 caggatttga aggtgttatc ggctatgaaa cccatttttc aggacacgga 

301 cacgaagtac acagtccgtt cgataatcat gattcaaaaa gcacttctga 

351 tttcagcggc ggcgtagacg gcggttttac cgtttaccaa cttcatcgga 

401 cagggtcgga aatacatccc gcagacggat atgacgggcc tcaaggcggc 

451 ggttatccgg aaccacaagg ggcaagggat atatacagct accatatcaa 

501 aggaacttca accaaaacaa agataaacac tgttccgcaa gccccttttt 

551 cagaccgctg gctaaaagaa aatgccggtg ccgcttccgg ttttctcagc 

601 cgtgcggatg aagcaggaaa actgatatgg gaaaacgacc ccgataaaaa 

651 ttggcgggct aaccgtatgg atgatattcg cggcatcgtc caaggtgcgg 

701 ttaatccttt tttaacgggt tttcaagggg tagggattgg ggcaattaca 

751 gacagtgcgg taagcccggt cacagataca gccgctcagc agactctaca 

801 aggtattaat gatttaggaa atttaagtcc ggaagcacaa cttgccgccg 
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851 cgagcctatt acaggacagt gcctttgcgg taaaagacgg catcaattcc 

901 gccagacaat gggctgatgc ccatccgaat ataacagcaa cagcccaaac 

951 tgcccttgcc gtagcagagg ccgcaggtac ggtttggcgc ggtaaaaaag 

1001 tagaacttaa cccgaccaaa tgggattggg ttaaaaatac cggctataaa 

1051 aaacctgctg cccgccatat gcagactgta gatggggaga tggcaggggg 

1101 gaatagaccg cctaaatcta taacgtcgga aggaaaagct aatgctgcaa 

1151 cctatcctaa gttggttaat cagctaaatg agcaaaactt aaataacatt 

1201 gcggctcaag atccaagatt gagtctagct attcatgagg gtaaaaaaaa 

1251 ttttccaata ggaactgcaa cttatgaaga ggcagataga ctaggtaaaa 

1301 tttgggttgg tgagggtgca agacaaacta gtggaggcgg atggttaagt 

1351 agagatggca ctcgacaata tcggccacca acagaaaaaa aatcacaatt 

1401 tgcaactaca ggtattcaag caaattttga aacttatact attgattcaa 

1451 atgaaaaaag aaataaaatt aaaaatggac atttaaatat taggtaa 

This corresponds to the amino acid sequence <SEQ ID 888; ORF 238.ng>: 

g23 8 .pep 

1 MNLPIQKFMM LLAAAISMLH IPISHAN GLD ARLRDDMQAK HYEPGGKYHL 

51 FGNARGSVKN RVCAVQTFDA TAVGPILPIT HERTGFEGVI GYETHFSGHG 

101 HEVHSPFDNH DSKSTSDFSG GVDGGFTVYQ LHRTGSEIHP ADGYDGPQGG 

151 GYPEPQGARD IYSYHIKGTS TKTKINTVPQ APFSDRWLKE NAGAASGFLS 

2 01 RADEAGKLIW ENDPDKNWRA NRMDDIRGIV QGAVNPFLTG FQGVGIGAIT 
251 DSAVSPVTDT AAQQTLQGIN DLGNLSPEAQ LAAASLLQDS AFAVKDGINS 

3 01 ARQWADAHPN I TATAQTALA VAEAAGTVWR GKKVELNPTK WDWVKNTGYK 
351 KPAARHMQTV DGEMAGGNRP PKSITSEGKA NAATYPKLVN QLNEQNLNNI 
401 AAQDPRLSLA IHEGKKNFPI GTATYEEADR LGKIWVGEGA RQTSGGGWLS 

.4 51 RDGTRQYRPP TEKKSQFATT GIQANFETYT IDSNEKRNKI KNGHLNIR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 889>: 

m23 8 . seq 

1 ATGAATTTGC CTATTCAAAA ATTCATGATG CTGTTTGCAG CAGCAATATC 

51 GTTGCTGCAA ATCCCCATTA GTCATGCGAA CGGTTTGGAT GCCCGTTTGC 

101 GCGATGATAT GCAGGCAAAA CAcTACGAAC CGGGTGGTAA ATACCATCTG 

151 TTTGGTAATG CTCGCGGCAG TGTTAAAAAG CGGGTTTACG CCGTCCAGAC 

201 ATTTGATGCA ACTGCGGTCA GTCCTGTACT GCCTATTACA CACGAACGGA 

251 CAGGGTTTGA AGGTGTTATC GGTTATGAAA CCCATTTTTC AGGGCACGGA 

301 CATGAAGTAC ACAGTCCGTT CGATCATCAT GATTCAAAAA GCACTTCTGA 

3 51 TTTCAGCGGC GGTGTAGACG GCGGTTTTAC TGTTTACCAA CTTCATCGAA 

4 01 CAGGGTCGGA AATCCATCCG GAGGATGGAT ATGACGGGCC GCAAGGCAGC 
451 GATTATCCGC CCCCCGGAGG AGCAAGGGAT ATATACAGCT ATTATGTCAA 
501 AGGAACTTCA ACAAAAACAA AGACTAATAT TGTCCCTCAA GCCCCATTTT 
551 CAGACCGTTG GCTAAAAGAA AATGCCGGTG CCGCCTCTGG TTTTTTCAGC 
601 CGTGCGGATG AAGCAGGAAA ACTGATATGG GAAAGCGACC CCAATAAAAA 
651 TTGGTGGGCT AACCGTATGG ATGATGTTCG CGGCATCGTC CAAGGTGCGG 
701 TTAATCCTTT TTTAATGGGT TTTCAAGGAG TAGGGATTGG GGCAATTACA 
751 GACAGTGCAG TAAGCCCGGT CACAGATACA GCCGCGCAGC AGACTCTACA 
801 AGGTATTAAT GATTTAGGAA AATTAAGTCC GGAAGCACAA CTTGCTGCCG 
851 CGAGCCTATT ACAGGACAGT GCTTTTGCGG TAAAAGACGG TATCAACTCT 
901 GCCAAACAAT GGGCTGATGC CCATCCAAAT ATAACAGCTA CTGCCCAAAC 
951 TGCCCTTTCC GCAGCAGAGG CCGCAGGTAC GGTTTGGAGA GGTAAAAAAG 

1001 TAGAACTTAA CCCGACTAAA TGGGATTGGG TTAAAAATAC CGGTTATAAA 

1051 AAACCTGCTG CCCGCCATAT GCAGACTTTA GATGGGGAGA TGGCAGGTGG 

1101 GAATAAACCT ATTAAATCTT TACCAAACAG TGCCGCTGAA AAAAGAAAAC 

1151 AAAATTTTGA GAAGTTTAAT AGTAACTGGA GTTCAGCAAG TTTTGATTCA 

12 01 GTGCACAAAA CACTAACTCC CAATGCACCT GGTATTTTAA GTCCTGATAA 

1251 AGTTAAAACT CGATACACTA GTTTAGATGG AAAAATTACA ATTATAAAAG 

1301 ATAACGAAAA CAACTATTTT AGAATCCATG ATAATTCACG AAAACAGTAT 

1351 CTTGATTCAA ATGGTAATGC TGTGAAAACC GGTAATTTAC AAGGTAAGCA 

14 01 AGCAAAAGAT TATTTACAAC AACAAACTCA TATCAGGAAC TTAGACAAAT 

1451 GA 

This corresponds to the amino acid sequence <SEQ ID 890; ORF 238>: 

m23 8 .pep 

1 MNLPIQKFMM LFAAAISLLQ IPISHAN GLD ARLRDDMQAK HYEPGGKYHL 

51 FGNARGSVKK RVYAVQTFDA TAVSPVLPIT HERTGFEGVI GYETHFSGHG 

101 HEVHSPFDHH DSKSTSDFSG GVDGGFTVYQ LHRTGSEIHP EDGYDGPQGS 
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151 DYPPPGGARD IYSYYVKGTS TKTKTNIVPQ APFSDRWLKE NAGAASGFFS 

201 RADEAGKLIW ESDPNKNWWA NRMDDVRGIV QGAVNPFLMG FQGVGIGAIT 

251 DSAVSPVTDT AAQQTLQGIN DLGKLSPEAQ LAAASLLQDS AFAVKDGINS 

301 AKQWADAHPN ITATAQTALS AAEAAGTVWR GKKVELNPTK WDWVKNTGYK 

351 KPAARHMQTL DGEMAGGNKP IKSLPNSAAE KRKQNFEKFN SNWSSASFDS 

401 VHKTLTPNAP GILSPDKVKT RYTSLDGKIT IIKDNENNYF RIHDNSRKQY 

4 51 LDSNGNAVKT GNLQGKQAKD YLQQQTHIRN LDK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 238 shows 86.0% identity over a 401 aa overlap with a predicted ORF (ORF 238.ng) 
from N. gonorrhoeae: 

m238/g238 



10 20 30 40 50 60 

m23 8 . pep MNLPIQKFMMLFAAAISLLQIPISHANGLDARLRDDMQAKHYEPGGKYHLFGNARGSVKK 

M 1 1 1 1 1 Ml hi 1 1 i hhlll I II 1 1 1 1 1 li i 1 1 1 1 1 1 M II II 1 1 1 M 1 1 II li II : 

g23 8 MNLP I QKFMMLLAAAI SMLH I P I SHANGLDARLRDDMQAKHYE PGGK YHLFGNARGS VKN 

10 20 30 40 50 60 



70 80 90 100 110 120 

m23 8 . pep RVYAVQTFDATAVSPVLPITHERTGFEGVIGYETHFSGHGHEVHSPFDHHDSKSTSDFSG 

II 1 1 1 1 1 1 1 1 IhhIM M II I II 1 1 1 1 1 1 II II 1 1 1 1 II I M 1 1 hi II 1 1 1 1 Ml I 

g23 8 RVCAVQTFDATAVGPILPITHERTGFEGVIGYETHFSGHGHEVHSPFDNHDSKSTSDFSG 

70 80 90 100 110 120 



130 140 150 160 1.70 180 

m23 8 . pep GVDGGFTVYQLHRTGSEIHPEDGYDGPQGSDYPPPGGARDIYSYYVKGTSTKTKTNIVPQ 

II III II III II llllll II Mill llh II I I M I M I MM 1 1 1 1 II i I III 

g23 8 GVDGGFTVYQLHRTGSEIHPADGYDGPQGGGYPEPQGARDIYSYHIKGTSTKTKINTVPQ 

130 140 150 160 170 180 



190 200 210 220 230 240 

m2 3 8 . pep APFSDRWLKENAGAASGFFSRADEAGKLIWESDPNKNWWANRMDDVRGIVQGAVNPFLMG 

1 1 II M 1 1 II II 1 1 II 1 1 M 1 1 M 1 1 M I M M I M M II 1 1 1 h 1 1 1 1 1 II 1 1 1 1 1 I 

g23 8 APFSDRWLKENAGAASGFLSRADEAGKLIWENDPDKNWRANRMDDIRGIVQGAVNPFLTG 

190 200 210 220 230 240 



250 260 270 280 290 300 

m238 . pep FQGVGI GAI TDSAVS PVTDTAAQQTLQG INDLGKLS PE AQLAAASLLQDS AFAVKDG INS 

M 1 1 1 1 1 1 II II 1 1 1 1 II 1 1 MMM I II I Ml M II 1 1 1 1 1 1 1 1 Ml 1 1 1 1 MMMM 

g23 8 FQGVGI GAI TDSAVS PVTDTAAQQTLQGINDLGNLSPEAQLAAASLLQDSAFAVKDG INS 

250 260 270 280 290 300 



310 320 330 340 350 360 

m2 3 8 . pep AKQWADAHPNITATAQTALSAAEAAGTVWRGKKVELNPTKWDWVKNTGYKKPAARHMQTL 

I M 1 1 1 1 1 III 1 1 II Ml MM I II I M li IM M II I II 1 1 II 1 1 1 M 1 1 1 1 II I M M 

g2 3 8 ARQWADAHPNITATAQTAIAVAEAAGTVWRGKKVELNPTKWDWVKNTGYKKPAARHMQTV 

310 320 330 340 350 360 

370 380 390 400 410 420 

m23 8 . pep DGEMAGGNKP I KSLPNSAAEKRKQNFEKFNSNWSS AS FDSVHKTLTPNAPG I LSPDKVKT 

IMIMIhl Ih = I - h - : 

g23 8 DGEMAGGNRPPKSI -TSEGKANAATYPKLVNQLNEQNLNNIAAQDPRLSLAIHEGKKNFP 

370 380 390 400 410 

430 440 450 460 470 480 

m23 8.pep RYTSLDGKITI I KDNENNYFRIHDNSRKQYLDSNGNAVKTGNLQGKQAKD YLQQQTHIRN 

g23 8 IGTATYEEADRLGKIWVGEGARQTSGGGWLSRDGTRQYRPPTEKKSQFATTGIQANFETY 
420 430 440 450 460 470 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 89 1>: 

a238 . seq (partial ) 

1 ATGAATTTGC CTATTCAAAA ATTCATGATG CTGTTTGCAG CAGCAATATC 

51 GTTGCTGCAA ATCCCCATTA GTCATGCGAA CGGTTTGGAT GCCCGTTTGC 

101 GCGATGATAT GCAGGCAAAA CACTACGAAC CGGGTGGTAA ATACCATCTG 

151 TTTGGTAATG CTCGCGGCAG TGTTAAAAAT CGGGTTTACG CCGTCCAAAC 

201 ATTTGATGCA ACTGCGGTCG GCCCCATACT GCCTATTACA CACGAACGGA 

251 CAGGATTTGA AGGCATTATC GGTTATGAAA CCCATTTTTC AGGACATGGA 

301 CATGAAGTAC ACAGTCCGTT CGATAATCAT GATTCAAAAA GCACTTCTGA 

351 TTTCAGCGGC GGCGTAGACG GTGGTTTTAC CGTTTACCAA CTTCATCGGA 

401 CAGGGTCGGA AATCCATCCG GAGGATGGAT ATGACGGGCC GCAAGGCAGC 

451 GATTATCCGC CCCCCGGAGG AGCAAGGGAT ATATACAGCT ACTATGTCAA 

501 AGGAACTTCA ACAAAAACAA AGAGTAATAT TGTTCCCCGA GCCCCATTTT 

551 CAGACCGCTG GCTAAAAGAA AATGCCGGTG CCGCCTCTGG TTTTTTCAGC 

601 CGTGCTGATG AAGCAGGAAA ACTGATATGG GAAAGCGACC CCAATAAAAA 

651 TTGGTGGGCT AACCGTATGG ATGATATTCG CGGCATCGTC CAAGGTGCGG 

701 TTAATCCTTT TTTAATGGGT TTTCAAGGAG TAGGGATTGG GGCAATTACA 

7 51 GACAGTGCAG TAAGCCCGGT CACAGATACA GCCGCGCAGC AGACTCTACA 

801 AGGTATTAAT CATTTAGGAA ATTTAAGTCC CGAAGCACAA CTTGCGGCTG 

851 CAACCGCATT ACAAGACAGT GCTTTTGCGG TAAAAGACGG TATCAATTCC 

901 GCCAGACAAT GGGCTGATGC CCATCCGAAT ATAACTGCAA CAGCCCAAAC 

951 TGCCCTTGCC GTAGCAGAGG CCGCAACTAC GGTTTGGGGC GGTAAAAAAG 

1001 TAGAACTTAA CCCGACCAAA TGGGATTGGG TTAAAAATAC CGGCTATAAA 

1051 ACACCTGCTG TTCGCACCAT GCATACTTTG GATGGGGAAA TGGCCGGTGG 

1101 GAATAGACCG CCTAAATCTA TAACGTCCAA CAGCAAAGCA GATGCTTCCA 

1151 CACAA 



This corresponds to the amino acid sequence <SEQ ID 892; ORF 238. a>: 

a238 .pep (partial) 

1 , MNLPIQKFMM LFAAAISLLQ IPISHA NGLD ARLRDDMQAK HYEPGGKYHL 

51 FGNARGSVKN RVYAVQTFDA TAVGPILPIT HERTGFEGII GYETHFSGHG 

101 HEVHSPFDNH DSKSTSDFSG GVDGGFTVYQ LHRTGSEIHP EDGYDGPQGS 

151 DYPPPGGARD IYSYYVKGTS TKTKSNIVPR APFSDRWLKE NAGAASGFFS 

201 RADEAGKLIW ESDPNKNWWA NRMDDIRGIV QGAVNPFLMG FQGVGIGAIT 

251 DSAVSPVTDT AAQQTLQGIN HLGNLSPEAQ LAAATALQDS AFAVKDGINS 

301 ARQWADAHPN ITATAQTALA VAEAATTVWG GKKVELNPTK WDWVKNTGYK 

351 TPAVRTMHTL DGEMAGGNRP PKSITSNSKA DASTQ 

m238/a238 91.9% identity in 385 aa overlap 

10 20 30 40 50 60 

m238.pep MNLPIQKFMMLFAAAISLLQIPISHANGLDARLRDDMQAKHYEPGGKYHLFGNARGSVKK 
I I M I I I I I I I I I I I I ! I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | j | | | , | j | , : 
a238 MNLPIQKFMMLFAAAISLLQIPISHANGLDARLRDDMQAKHYEPGGKYHLFGNARGSVBCN 

10 20 30 40 50 60 



70 80 90 100 110 120 

m238.pep RVYAVQTFDATAVSPVLPITHERTGFEGVIGYETHFSGHGHEVHSPFDHHDSKSTSDFSG 
I M I I M II I I M:|:| | || || | | | |j 1:1 II I I I II || Ml III I I I: I I I | | If |f II 
a238 RVYAVQTFDATAVGPILPITHERTGFEGIIGYETHFSGHGHEVHSPFDNHDSKSTSDFSG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m238.pep GVDGGFTVYQLHRTGSEIHPEDGYDGPQGSDYPPPGGARDIYSYYVKGTSTKTKTNIVPQ 
I IN I IIIMIM IIIIIIIIMI III IIIIIIMII II III IMIIIIMIM:||||: 
a238 GVDGGFTVYQLHRTGSEIHPEDGYDGPQGSDYPPPGGARDIYSYYVKGTSTKTKSNIVPR 

130 140 150 160 170 180 

190 200 210 220 230 240 

m238.pep APFSDRWLKENAGAASGFFSRADEAGKLIWESDPNKNWWANRMDDVRGIVQGAVNPFLMG 

M 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | : | | | | | | | | | | | j| | 

*238 APFSDRWLKENAGAASGFFSRADEAGKLIWESDPNKNWWANRMDDIRGIVQGAVNPFLMG 

190 200 210 220 230 240 



m238 . pep 



250 260 270 280 290 300 

FQGVGIGAITDSAVSPVTDTAAQQTLQGINDLGKLSPEAQLAAASLLQDSAFAVKDGINS 
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a238 



M I I I I I I I I I I I I I I I I I I | | | | M I I I I ! I: I! ' I I Mill: I I Mill II Mill 
FQGVGIGAITDSAVSPVTDTAAQQTLQGINHLGNLSPEAQLAAATALQDSAFAVKDGINS 
250 260 270 280 290 300 



310 320 330 340 350 360 

m238 . pep AKQWADAHPNITATAQTALSAAEAAGTVWRGKKVELNPTKWDWVKNTGYKKPAARHMQTL 
I ■ I I I I II I I t I I I I 1 I I I I I I I M! I I I I I I I I I I I I I I I I I I I I ||:| |:|| 
a2 3 8 ARQWADAHPNITATAQTALAVAEAATTVWGGKKVELNPTKWDWVKNTGYKT PAVRTMHTL 

310 320 330 340 350 360 



370 380 390 400 410 419 

m238 . pep DGEMAGGNKPIKSLP-NSAAEKRKQNFEKFNSNWSSASFDSVHKTLTPNAPGILSPDKVK 

I I I 1 I I I I : I I I : III: I 
a238 DGEMAGGNRPPKS ITSNSKADASTQ 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 893>: 

g239 . seq 

1 atgttccacc ataaaggtat tgcccgaaac cggcggatgg aggttttgtt 
51 tttctgccgc cgccctgatc gcttcgtgat tcgccaaacg cgcctgttgc 
101 agcctcattt gcgcataatc ctgctccaag gcgatttcct gttttttcgc 
151 cttgtccaaa gctgtgaagt tgagcctgta ctggttttgc tgcatcacaa 
201 cggaaaaagc ggaaacgcac accgcaagca gcagaaagaa . attcgatttg 
251 ttcattgccg ttcagacgtt tttctctgtt attattzccgg tatcggaccg 

301 gcagtccgct ccgccacacg caaaactgcg ctcctcgccc tcgggttggc 

351 ggcaatttcc gcttcacccg gctttaatgc cctgcccacg attttcaggg 

4 01 gcggatcggg caaatccgct tctctgaccg ccgcccagct cggcaggggc 
451 tcgtgttgcg aatatttttt gacaaactgc ttcacaatgc ggtcttccaa 
501 cgaatggaaa gcaatgaccg ccaaacgccc gccctctttc agacggcaca 
551 tgacctgcgg caataccgcc cctacttctt caagctcgcg gttaataaag 
601 atgcggattg cctggaaggt gcgcgtcgca ggatcctgcc cccgctcgcg 
651 agtacggacg ttttgtgcca cgatctgcgc cagcttgcgg gttgtatcga 
701 ttggactttc cgcccgttgc gcgacaatgg cgcgcacaat ctggcggcta 
751 aaccgctctt caccataa 

This corresponds to the amino acid sequence <SEQ ID 894; ORF 239.ng>: 

g23 9.pep 

1 MFHHKGIARN RRMEVLFFCR RPDRFVIRQT RLLQPHLRI I LLQGDFLFFR 

51 LVQSCEVEPV LVLLHHNGKS GNAHRKQQKE IRFVHCRSDV FLCYYSGIGP 

101 AVRSATRKTA LLALGLAAI S ASPGFNALPT I FRGGSGKSA SLTAAQLGRG 

151 SCCEYFLTNC FTMRSSNEWK AMTAKRPPSF RRHMTCGNTA PTSSSSRLIK 

2 01 MRIAWKVRVA GSCPRSRV RT FCATICASLR WSIGL SARC ATMARTIWRL 

2 51 NRSSP* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 895>: 

m239.seq 

1 ATGCTCCACC ATAAAGGTmy kGCCCGAAAC CGGCkGATGG AGGTTTTGTT 

51 TTTCTGCCGC CGCCCTGATC GCTTCGTGGT TCGCCAAACG CGCCTGTTGC 

101 AGCCTCATTT GCGCATAATC CTGCTCCAAG GCGATTTCCT GTTTTTTCGC 

151 CTTATCCAAA GCTGTGAAAT TGAGCCTGTA CTGGTTTTGC TGCATCACAA 

201 CGGAAAAAGC GGAAACGCAC ACCGCAAGCA GCAGAAGGAA ATTCAATTTG 

251 TTCATTGCCA TTCAGACGTT TTTCTCTGTG ATTGTTCCGG TATCGGACCG 

301 GCAGTCCGCT CCGCCACACG CAAAACCGCA CTTCTCGCCC TCGGATTGGC 

351 GGCAATTTCC GCCTCACCCG GCTTTAATGC CCTGCCCACG ATTTTCAGGG 

401 GCAGCTCGGG CAAATCCGCT TCCCTGaCCG CCGCCCAGCG CGGCAGGGGC 

451 GCGTGTTGCG AATATTTTTT GACAAACTGC TTCACAATGC GATCTTCCAA 

501 CGAATGGAAA GCAATGACCG CCAAACGTCC GCCCTCTTTC AGACGACACA 

5 51 TGACCTGCGG CAATACTGCC CCTACTTCTT CAAGCTCGCG GTTAATAAAG 
601 ATGCGGACCG CCTGGAAGGT GCGCGTCGCA GGATCCTGCC CCCGCTCGCG 
651 AGTACGGACG TTTTGTGCCA CGATCTGCGC CAGCTTGCGG GTTGTATCGA 
701 TTGGACTTTC CGCCCGTTGC GCAACAATGG CGCGCGCAAT cCGGCGGCTa 
751 AACCGCTCTT CACCATAA 

This corresponds to the amino acid sequence <SEQ ID 896; ORF 239>: 



\ 



WO yy/5728U 



FCI7U5yy/U*540 



548 

m239 .pep 

1 MLHHKGXARN RXMEVLFFCR RPDRFWRQT RLLQPHLRI I LLQGDFLFFR 
51 LIQSCEIEPV LVLLHHNGKS GNAHRKQQKE IQFVHCHSDV FLCDCSGIGP 
101 AVRSATRKTA LLALGLAAIS ASPGFNALPT IFRGSSGKSA SLTAAQRGRG 
151 ACCEYFLTNC FTMRSSNEWK AMTAKRPPSF RRHMTCGNTA PTSSSSRLIK 
201 MRTAWKVRVA GSCPRSRV RT FCATICASLR WSIGL SARC ATMARAIRRL 
251 NRSSP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 239 shows 93.7% identity over a 255 aa overlap with a predicted ORF (ORF 239.ng) 
from N. gonorrhoeae: 

m239/g239 

10 20 30 40 50 60 

m23 9 .pep 
g239 



m23 9 .pep 
g239 



m2 3 9 . pep 
g239 



m239 .pep 
g239 



m23 9 .pep 

g239 

250 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 897>: 

a239. seq 

1 ATGCTCCACC ATAAAGGTAT TGCCCGAAAC CGGCGGATGG AGGTTTTGTT 

51 TTTCTGCCGC CGCCCTGATC GCTTCGTGGT TCGCCAAACG CGCCTGTTGC 

101 AGCCTCATTT GCGCATAATC CTGCTCCAAG GCGATTTCCT GTTTTTTCGC 

151 CTTATCCAAA GCTGTGAAGT TGAGCCTGTA CTGGTTTTGC TGCATCACAA 

201 CGGAAAAAGC GGAAACGCAC ACCGCAAGCA GCAGAAGGAA ATTCAATTTG 

251 TTCATTGCCA TTCAGACGTT TTTCTCTGTG ATTGTTCCGG TATCGGACCG 

301 GCAGTCCGCT CCGCCACACG CAAAACCGCA CTTCTCGCCC TCGGATTGGC 

351 GGCAATTTCC GCCTCACCCG GCTTTAATGC CCTGCCCGCG ATTTTCAGGG 

4 01 GCGGCTCGGG CAAATCCGCT TCCCTGACCG CCGCCCAGCG CGGCAGGGGC 

4 51 GCGTGTTGCG AATATTTTTT GACAAACTGC TTCACAATGC GGTCTTCCAA 

501 CGAATGGAAA GCAATGACCG CCAAACGTCC GCCCTCTTTC AGACGACACA 

551 TGACCTGCGG CAATACTGCC CCTACTTCTT CAAGCTCGCG GTTAATAAAG 

601 ATGCGGATTG CCTGGAAGGT GCGCGTCGCA GGATCCTGCC CCCGCTCGCG 

651 AGTACGGACG TTTTGTGCCA CGATCTGCGC CAGCTTGCGG GTTGTATCGA 

701 TTGGACTTTC CGCCCGTTGC GCAACAATGG CGCGCGCAAT CTGGCGGCTA 

751 AACCGCTCTT CACCATAA 



MLHHKGXARNRXMEVLFFCRRPDRFWRQTRLLQ PHLRI I LLQGDFLFFRLI QS CE I EPV 

hllll 1 1 1 1 II I M M 1 1 M 1 1 Ml M 1 1 M M I II M Ml Ml II I M I M M II I 

MFHHKGIARNRRMEVLFFCRRPDRFVIRQTRLLQPHLRIILLQGDFLFFRLVQSCEVEPV 
10 20 30 40 50 60 

70 80 90 100 110 120 

LVLLHHNGKSGNAHRKQQKEIQFVHCHSDVFLCDCSGIGPAVRSATRKTALLALGLAAIS 

I M I M M 1 1 1 1 1 1 II I IMM 1 1 1 MM 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 r 

LVLLHHNGKS GNAHRKQQKE I RFVHCRSDVFLCYYSGIGPAVRSATRKTALIiALGLAAIS 
70 80 90 100 110 120 

130 140 150 160 170 180 

ASPGFNALPTIFRGSSGKSASLTAAQRGRGACCEYFLTNCFTMRSSNEWKAMTAKRPPSF 

I M II M 1 1 1 II I Ml I III 1 1 MM 1 1 Ml II M Ml 1 1 M 1 1 M M 1 1 M M M M I 

ASPGFNALPT I FRGGSGKSASLTAAQLGRGSCCEYFLTNCFTMRSSNEWKAMTAKRP PS F 
130 140 150 160 170 180 

190 200 210 220 230 240 

RRHMTCGNTAPTSSSSRLIKMRTAWKVRVAGSCPRSRVRTFCATICASLRWSIGLSARC 

Mill II Ml I llllll Mill I Mill III III III I MM Mil III II II I II III 

RRHMTCGNTAPTSSSSRLIKMRIAWKVRVAGSCPRSRVRTFCATICASLRWSIGLSARC 
190 200 210 220 230 240 

250 

ATMARAI RRLNRS S PX 

M 1 1 M I I M I II II 

ATMART I WRLNRS S PX 
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This corresponds to the amino acid sequence <SEQ ID 898; ORF 239.a>: 

a239.pep 

1 MLHHKGIARN RRMEVLFFCR RPDRFWRQT RLLQPHLRII LLQGDFLFFR 

51 LIQSCEVEPV LVLLHHNGKS GNAHRKQQKE IQFVHCHSDV FLCDCSGIGP 

101 AVRSATRKTA LLALGLAAI S ASPGFNALPA IFRGGS GKSA SLTAAQRGRG 

151 ACCEYFLTNC FTMRSSNEWK AMTAKRPPSF RRHMTCGNTA PTSSSSRLIK 

201 MRIAWKVRVA GSCPRSRVRT FCATICASLR VVSIGLSARC ATMARAIWRL 

251 NRSSP* 

m239/a239 97.3% identity in 255 aa overlap 

10 20 30 40 50 60 

m239 . pep MLHHKGXARNRXMEVLFFCRRPDRFWRQTRLLQPHLRIILLQGDFLFFRLIQSCEIEPV 
MINI I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I : t ! I 
a239 MLHHKGIARNRRMEVLFFCRRPDRFWRQTRLLQPHLRIILLQGDFLFFRLIQSCEVEPV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m23 9.pep LVLLHHNGKSGNAHRKQQKEIQFVHCHSDVFLCDCSGIGPAVRSATRKTALLALGLAAIS 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 ii 1 1 1 1 

a23 9 LVLLHHNGKSGNAHRKQQKEIQFVHCHSDVFLCDCSGIGPAVRSATRKTALLALGLAAIS 

70 80 90 100 110 120 

130 140 150 160 170 180 

m23 9.pep ASPGFNALPTIFRGSSGKSASLTAAQRGRGACCEYFLTNCFTMRSSNEWKAMTAKRPPSF 
I I I I I I I I I : I I I I : I I I I I I I I I II I I M I I I I I II I I I I I I I I I I I I I I II I I I I I I I 
a239 ASPGFNALPAIFRGGSGKSASLTAAQRGRGACCEYFLTNCFTMRSSNEWKAMTAKRPPSF 

130 140 150 160 170 180 

190 200 210 220 230 240 

• m23 9 . pep RRHMTCGNTAPTSSSSRLIKMRTAWKVRVAGSCPRSRVRT FCATICASLRWSIGLSARC 

I I I I I I I I I I I I I 11 I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I i I I I I I I 
a23 9 RRHMTCGNTAPTSSSSRLIKMRIAWKVRVAGSCPRSRVRT FCATICASLRWSIGLSARC 

190 200 210 220 230 240 

250 

m23 9 . pep ATMARAIRRLNRSSPX 
I I I I I I I I I I I I I I I 
a2 3 9 ATMARAI WRLNRS S PX 

250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 899>: 



g240 . seq 






1 


atgatagaag 


tcatacattt 


51 


ttgtgccgac 


gttggacgat 


101 


gggtaaacat 


gggtatcatc 


151 


ctgcgtattc 


agccgttcgt 


201 


tcgcaatcac 


gaacggtttg 


251 


acggcacggt 


tgcgccgctg 


301 


atcgtaggcg 


ggcgtattgg 


351 


ccaaaaccac 


caccgttccg 


401 


ttcaactctt 


cataggccaa 


451 


gatgtctttg 


ccgttttgcg 


501 


ccacaatatt 


gccctgccgg 


551 


ccatacaggc 


tgtcttcaag 


601 


gtcaatatcg 


gtaaatccga 


651 


aatggcgttt 


taa 



This corresponds to the amino acid sequence <SEQ ID 900; ORF 240.ng>: 

g24 0 .pep 

1 MIEVIHFFGA ETRRQFACAD VGRFLHNAAH IQRGVNMGI I AHGRRSDFIR 

51 LRIQPFVQIG FARIQCLRNH ERFDCRTRFD HIGYGT VAPL FAVCPAGSVG 

101 IVGGRIGQGE DFPRAGIQNH HRSGFCLMVF DRLVQLFIGQ GLNPLIEGKD 

151 DVFAVLRCFI ARGVQAVHNI ALPVPQNNFR AVFAIQAVFK RKFQTFLTFA 
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201 VNIGKSDDVC KQVAHRVMAF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 901>: 

m24 0 . seq 

1 ATGATAGAAG TCATACATTT CTTCGGCACC GAAACGCGCA GACAGTTTGC 

51 TTGTGCCGAC GTTGGACGAT TTCTGCATGA TGCCGCGCAC ATCCAAAGAG 

101 GGGTAAACAT GGGTATCGCG CACGGGAGAC GGTCCGATTT TATAAGGCTG 

151 CGTATTCAGC CGTTCGTTCA AATCGGTTTT GCCCGCATCC AATGCCTTCG 

2 01 CAATCACAAA CGGTTTGATT GCCGAACCGG GTTCGATCAT ATCGGTTACG 
251 GCACGGTTGC GCCGCTGTTC GCTGTCTGCC CGGCCGGGCC TGTTGGGATC 

3 01 GTAGGCGGGC GTATTGGCCA AGGCGAGGAT TTCCCCCGTG CGGGCATCCA 
351 ArACCACCAC CGTTCCGGCT TTTGCCTGAT GGTATTCGAC CGCCTTGTTC 
401 AACTCTTCAT AGGCCAAGGT CTGAATCCTC TGATCGAGGG AAAGGATGAT 
451 GTCTTTGCCG TTTTTCGGGG CTTTAkTGCG CGGGGAGTCC AAGCTGTCCA 
501 CAATATTGCC CTGCCGGTCC CGCAAAACGA CTTCCGCGCC GTCTTCGCCA 
551 TGCAAGCTGT CTTCAAGCGA AAGTTCCAAA CCTTCCTGAC CTTTGCCGTC 
601 AATATCGGTA AATCCGATGA CGTGTGCAAA CAGGTTGCCC ATCGGGTAAT 
651 GGCGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 902; ORF 240>: 

m24 0 . pep 

1 MIEVIHFFGT ETRRQFACAD VGRFLHDAAH IQRGVNMGIA HGRRSDFIRL 

51 RIQPFVQIGF ARIQCLRNHK RFDCRTGFDH IGYG TVAPLF AVCPAGPVGI 

101 VGGRIGQGED FPRAGIQXHH RSGFCLMVFD RLVQLFIGQG LNPLIEGKDD 

151 VFAVFRGFXA RGVQAVHNIA LPVPQNDFRA VFAMQAVFKR KFQTFLTFAV 

201 NIGKSDDVCK QVAHRVMAF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 240 shows 94.5% identity over a 220 aa overlap with a predicted ORF (ORF 240.ng) 
from N. gonorrhoeae: 

m240/g240 

10 20 30 40 50 59 

m240 .pep MI EV IHFFGTETRRQFACADVGRFLHD AAHI QRGVNMGI - AHGRRSDFI RLRI QP FVQ I G 

IN M Ml I U II Ml 1 1 Ml II 1 1 1 :|| 1 1 1! 1 1 1 1 1 1 1 1 1 M M I M II I MIM 1 1 

g24 0 MIEVIHFFGAETRRQFACADVGRFLHNAAHIQRGVNMGIIAHGRRSDFIRLRIQPFVQIG 

10 20 30 40 50 60 

60 70 80 90 100 110 119 

m24 0 .pep FARIQCLRNHKRFDCRTGFDHIGYGTVAPLFAVCPAGPVGIVGGRIGQGEDFPRAGIQXH 

MMMMMMMMI MMMMMMMMMI MMMMMMMMMM I 

g24 0 FARIQCLRNHERFDCRTRFDHIGYGTVAPLFAVCPAGSVGIVGGRIGQGEDFPRAGIQNH 

70 80 90 100 110 120 

120 130 140 150 160 170 179 

m24 0 . pep HRSGFCLMVFDRLVQLFIGQGLNPLIEGKDDVFAVFRGFXARGVQAVHNI ALPVPQNDFR 

Ml I M II III 1 1 Ml IMM III 1 1 1 M 1 1 M MM 1 MMMMMMMMMM 

g240 HRSGFCLMVFDRLVQLFIGQGLNPLIEGKDDVFAVLRCFIARGVQAVHNIALPVPQNNFR 

130 140 150 160 170 180 

180 190 200 210 220 

m2 4 0 . pep AVFAMQAVFKRKFQTFLTFAVNIGKSDDVCKQVAHRVMAFX 

MMMMMMMMMM MMMMMMMMMM 

g2 4 0 AVFAI QAVFKRKFQTFLTFAVNI GKSDDVCKQVAHRVMAF 

190 200 210 220 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 903>: 

a240.seq 

1 ATGATAGAAG TCATACATTT CTTCGGCACC GAAACGCGCA GACAGTTTGC 

51 TTGTGCCGAC GTTGGACGAT TTCTGCATGA TGCCGCGCAC ATCCAAAGAG 

101 GGGTAAACAT GGGTATCGCG CACGGGAGAC GGTCCGATTT TATAAGGCTG 

151 CGTATTCAGC CGTTCGTTCA AATCGGTTTT GCCCGCATCC AATGCCTTCG 

201 CAATCACAAA CGGTTTGATT GCCGAACCGG GTTCGATCAT ATCGGTTACG 
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251 GCACGGTTGC GCCGCTGTTC GCTGTCTGCC CGGCCGGGCC TGTTGGGATC 

301 GTAGGCGGGC GTATTGGCCA AGGCGAGGAT TTCCCCCGTG CGGGCATCCA 

351 AAACCACCAC CGTTCCGGCT TTTGCCTGAT GGTATTCGAC CGCCTTGTTC 

401 AACTCTTCAT AGGCCAAGGT CTGAATCCTC TGATCGAGGG AAAGGATGAT 

451 GTCTTTGCCG TTTTTCGGGG CTTTATTGCG CGGGGAGTCC AAGCTGTCCA 

501 CAATATTGCC CTGCCGGTCC CGCAAAACGA CTTCCGCGCC GTCTTCGCCA 

551 TGCAGGCTGT CTTCAAGCGA AAGTTCCAAA CCTTCCTGAC CTTTGCCGTC 

601 AATATCGGTA AATCCGATGA CGTGTGCAAA CAGGTTGCCC ATCGGGTAAT 

651 GGCGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 904; ORF 240.a>: 

a240 .pep 

1 MIEVIHFFGT ETRRQFACAD VGRFLHDAAH IQRGVNMGIA HGRRSDFIRL 

51 RIQPFVQIGF ARIQCLRNHK RFDCRTGFDH IGYGT VAPLF AVCPAGPVGI 

101 VGGRIGQGED FPRAGIQNHH RSGFCLMVFD RLVQLFIGQG LNPLIEGKDD 

151 VFAVFRGFIA RGVQAVHNIA LPVPQNDFRA VFAMQAVFKR KFQTFLTFAV 

201 NIGKSDDVCK QVAHRVMAF* 

m240/a240 99. 1 % identity in 2 1 9 aa overlap 

10 20 30 40 50 60 

m240.pep MIEVIHFFGTETRRQFACADVGRFLHDAAHIQRGVNMGIAHGRRSDFIRLRIQPFVQIGF 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I 
a240 MIEVIHFFGTETRRQFACADVGRFLHDAAHIQRGVNMGIAHGRRSDFIRLRIQPFVQIGF 

10 20 30 40 50 60 

70 80 90 100 110 120 

m24 0 . pep ARIQCLRNHKRFDCRTGFDHIGYGTVAPLFAVCPAGPVGIVGGRIGQGEDFPRAGIQXHH 
I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I M | | M | || 
a24 0 ARIQCLRNHKRFDCRTGFDHIGYGTVAPLFAVCPAGPVG I VGGRIGQGED FPRAGIQNHH 

70 80 90 100 110 120 

130 ' 140 150 160 170 180 

m24 0 . pep RSGFCLMVFDRLVQLFIGQGLNPLIEGKDDVFAVFRGFXARGVQAVHNIALPVPQNDFRA 

I M M I I I II I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I | | | | | | | | | | 

a24 0 RSGFCLMVFDRLVQLFIGQGLNPLIEGKDDVFAVFRGFIARGVQAVHNIALPVPQNDFRA 

130 140 150 160 170 180 

190 200 210 220 

m2 4 0 . pep V FAMQAV FKRK FQT FLT FAVN I G K S D DVCKQVAH RVMAFX 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I 
a2 4 0 V FAMQAV FKRKFQT FLT FAVN IGKS D DVCKQVAH RVMAFX 

190 200 210 220 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 905>: 

g241.seq 

1 ATGATAGAAG TCATACATTT CTTCGGCACC GAAACGCGCA GACAGTTTGC 

51 TTGTGCCGAC GTTGGACGAT TTCTGCATGA TGCCGCGCAC ATCCAAAGAG 

101 GGGTAAACAT GGGTATCGCG CACGGGAGAC GGTCCGATTT TATAAGGCTG 

151 CGTATTCAGC CGTTCGTTCA AATCGGTTTT GCCCGCATCC AATGCCTTCG 

201 CAATCACAAA CGGTTTGATT GCCGAACCGG GTTCGATCAT ATCGGTTACG 

251 GCACGGTTGC GCCGCTGTTC GCTGTCTGCC CGGCCGGGCC TGTTGGGATC 

3 01 GTAGGCGGGC GTATTGGCCA AGGCGAGGAT TTCCCCCGTG CGGGCATCCA 
351 ArACCACCAC CGTTCCGGCT TTTGCCTGAT GGTATTCGAC CGCCTTGTTC 

4 01 AACTCTTCAT AGGCCAAGGT CTGAATCCTC TGATCGAGGG AAAGGATGAT 
451 GTCTTTGCCG TTTTTCGGGG CTTTAkTGCG CGGGGAGTCC AAGCTGTCCA 
501 CAATATTGCC CTGCCGGTCC CGCAAAACGA CTTCCGCGCC GTCTTCGCCA 
551 TGCAAGCTGT CTTCAAGCGA AAGTTCCAAA CCTTCCTGAC CTTTGCCGTC 
601 AATATCGGTA AATCCGATGA CGTGTGCAAA CAGGTTGCCC ATCGGGTAAT 
651 GGCGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 906; ORF 241.ng>: 

g241 .pep 

1 MPTRPTRAAN PPTPTTWLQT AYCPRPPYRP PSVQTHTPHE PASSTCAAKS 
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51 ANRRENSHNA QPTYLLHPSN KNPSETEQTL FRRHQIPPSC RQSVWMTVR 
101 TVDMTVCDFL IGCIAHAFNR SFKADFHACQ RMVAVHHRLA VGNIGYTIDD 
151 NIAGFRIVRF KHHTDLDFNR ERARIFNTDQ LRIMLTERIV GRKRHFDRIA 
201 GILTVQRLFH QRENAWTAV QIRN RFFGFI QKLIVGIIHL IM QRNHGIFC 
251 NSHICPFRNS RLITGAF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 907>: 

m241.seq (partial) 

1 . . CGGCAATCAG TGGTGGTGAT GACCGTGCGG GCCGTGGACA TGACCGTGTG 

51 CGATTTCCTC ATCGGATGCA TCGCGCACGC TTTCAACTGT AGCCTTAAAG 

101 CGGATTTTCA TGCCTGCCAA AGGATGGTTG CCGTCCACCA CCGCCTTGCC 

151 GTCGGCAACA TCGGTTACAC GATAGACGAC AACATCGCCG GTTTCAGGAT 

201 CGTCGGCTTC AAACATCATG CCGACTTCGA CTTCAACAGG GAACACGCCC 

251 GCATCTTCGA TACGGACCAA CTCCGGATCC TGCTCGCCGA ACGCATCGTC 

301 GGGCGACAGC GCCACATCGA CCGTATCGCC GGCATCCTTA CCGTGCAACG 

3 51 CCTCTTCCAC CAAAGGGAAA ATGCCGTCGT AACCGCCGTG CAGATACGCA 

4 01 ATCGGTTCTT CGGTTTTGTC CAAAAGCTGA TTGTTGGCAT CATACATCTC 
4 51 ATAATGCAGC GAAACCACGG AATTTTTCAC GATAGCCATA TTTGTCCTTT 
501 CAGGAACAGC AGATTAATTA CAGGCGCATT CTAA 

This corresponds to the amino acid sequence <SEQ ID 908; ORF 24 1>: 

m241.pep (partial) 

1 ..RQSVWMTVR AVDMTVCDFL IGCIAHAFNC SLKADFHACQ RMVAVHHRLA 

51 VGNIGYTIDD NIAGFRIVGF KHHADFDFNR EHARIFDTDQ LRILLAERIV 

101 GRQRHIDRIA GILTVQRLFH QRENAWTAV QIRN RFFGFV QKLIVGIIHL 

151 IMQRNHGIFH DSHICPFRNS RLITGAF* ~ " 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 241 shows 91.5% identity over a 177 aa overlap with a predicted ORF (ORF 241.ng) 
from N. gonorrhoeae: 

m24l/g241 

10 20 30 

m241 .pep RQSVWMTVRAVDMTVCDFLIGCIAHAFNC 

1 1 1 1 !M 1 1 1 HI ! 1 1 M 1 1 M 1 1 1 1: 1 1 

9 241 QPTYLLHPSNKMPSETEQTLFRRHQIPPSCRQSVWMWRTVDMTVCDFLIGCIAHAFNR 

70 80. 90 100 110 120 

40 50 60 70 80 90 

m24 1 . pep S LKAD FHACQRMV AVHHRLAVGN I G YT I DDN I AG FR I VG FKHHAD FD FNREHAR I FDTDQ 

i ^ 1 1 1 r i 1 1 r 1 1 r r 1 1 1 1 1 1 1 1 1 f 1 1 j 1 1 1 1 1 f f 1 1 1 1 iiihhiMihiiihiii 

9 24 1 S FKADFHACQRMV AVHHRLAVGN I G YT I DDN I AG FR I VR FKHHTDLD FNRERAR I FNTDQ 

!30 140 150 160 170 180 

100 110 120 130 140 150 

m241 . pep LRILLAERIVGRQRHIDRIAGILTVQRLFHQRENAWTAVQIRNRFFGFVQKLIVGIIHL 
NI : l : lllll|:||:||||Mllllll|||||||Mllllllllllli:|||||||||| 
9 241 LRIMLTERIVGRKRHFDRIAGILTVQRLFHQRENAWTAVQIRNRFFGFI QKLIVGIIHL 

190 200 210 220 230 240 

160 170 
m241 . pep IMQRNHGIFHDSHICPFRNSRLITGAFX 

Illllllll UIIIIIIIIIIIIIIM 

9 2 4 1 I MQRNHG I FCNSH I CP FRNS RL I TGAFX 

250 260 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 909>: 

a241.seq 

1 ATGCCAACAC GTCCAACTCG CGCCGCAAAG CATCCAACCC CGCCAACCTG 

51 GCTTCAGACG GCATACTGCC CTCGTCCGCC ATATCGTCCG CCGTCCGTGC 

101 AAACGCATAC ACCGCATGAA CCGGCTTCCT CAACCTGCGC GGCAAAATCA 

151 GCGAACCGAC GGGAAAATTT TCATAATGCC CAACCGACAT ACCTTCTCCA 

201 TCCATCAAAC AAAATGCCGT CTGAAATGGA ACAAACCCTT TTCAGACGGC 
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251 ATCAGATACC TCCAAGCTGC CGGCAATCAG TGGTGGTGAT GACCGTGCGG 

301 ACCGTGGACA TGACCGTGTG CGATTTCCTC ATCGGATGCA TCGCGCACAC 

351 TTTCAACCGT AGCCTTAAAG CGGATTTTCA TGCCTGCCAA AGGATGGTTG 

401 CCGTCCACCA CCGCCTTACC GTCGGCAACA TCGGTTACAC GATAGACGAC 

451 AACATCGCCG GTTTCAGGAT CGTCGGCTTC AAACATCATG CCGACTTCGA 

501 CTTCAACAGG GAACACGCCC GCATCTTCAA TACGGACCAA CTCCGGATCC 

551 TGCTCGCCGA ACGCATCGTC GGGCGAAAGC GCCACATCGA CCGTATCGCC 

601 GGCATCCTTA CCGTGCAACG CCTCTTCCAC CAAAGGGAAA ATGCCGTCGT 

651 AACCGCCGTG CAGATACGCA ATCGGTTCTT CGGTTTTGTC CAAAAGCTGA 

701 TTGTTGGCAT CATACATCTC ATAATGCAGC GAAACCACGG AATTCTTCAC 

751 GATAGCCATA TTTGTCCTTT CAGGAACAGC AGATTAATTA CAGGCGCATT 

801 CTAA 



This corresponds to the amino acid sequence <SEQ ID 910; ORF 241.a>: 

a241.pep 

1 MPTRPTRAAK HPTPPTWLQT AYCPRPPYRP PSVQTHTPHE PASSTCAAKS 

51 ANRRENFHNA QPTYLLHPSN KMPSEMEQTL FRRHQIPPSC RQSVVVMTVR 

101 TVDMTVCDFL IGCIAHTFNR SLKADFHACQ RMVAVHHRLT VGNIGYTIDD 

151 NIAGFRIVGF KHHADFDFNR EHARIFNTDQ LRILLAERIV GRKRHIDRIA 

201 GILTVQRLFH QRENAWTAV QIRNR FFGFV QKLIVGIIHL IMQRNHGILH 

251 DSHICPFRNS RLITGAF* " " 



m241/a241 96.0% identity in 177 aa overlap 

o,i 10 20 30 

mz 4 1 . pep RQSWVMTVRAVDMTVCDFLIGCIAHAFNC 

„ n(nvTTU I I I I I I I I I I : I I I I Nihil 

QPTYLLHPSNKMPSEMEQTLFRRHQIPPSCRQSVWMTVRTVDMTVCDFLIGCIAHTFNR 
70 80 90 100 110 120 



a241 



40 50 60 70 80 90 

m24 1 . pep SLKADFHACQRMVAVHHRLAVGNIGYTIDDNIAGFRIVGFKHHADFDFNREHARIFDTDQ 

NMIIIIIIMIIIIIIhlllllllMIIIIMiiiiiMiiiiiiiii,,, 

SLKADFHACQRMVAVHHRLTVGNIGYTIDDNIAGFRIVGFKHHADFDFNREHARIFNTDQ 
130 140 150 160 170 180 



a241 



100 HO 120 130 140 150 

m241.pep LRILLAERIVGRQRHIDRIAGILTVQRLFHQRENAWTAVQIRNRFFGFVQKLIVGIIHL 

HIIIMIIIIhllllllllllllMllilllllllliiiiiiiH MllliM 

LRI LLAE R I VGRKRH I DR I AG I LT VQRL FHQRENAWTAVQI RNR FFGFVQKL I VG 1 1 H L 
190 200 210 220 230 240 



a241 



160 170 
m2 41.pep IMQRNHGIFHDSHICPFRNSRLITGAFX 
I I I I I I I I : I I I I I I I I I I I I I I I I I | | 
a241 IMQRNHGILHDSHICPFRNSRLITGAFX 

250 260 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEO ID 91 1>* 

g241-l.seq X 

1 ATGATAGAAG TCATACATTT CTTCGGCACC GAAACGCGCA GACAGTTTGC 

51 TTGTGCCGAC GTTGGACGAT TTCTGCATGA TGCCGCGCAC ATCCAAAGAG 

101 GGGTAAACAT GGGTATCGCG CACGGGAGAC GGTCCGATTT TATAAGGCTG 

151 CGTATTCAGC CGTTCGTTCA AATCGGTTTT GCCCGCATCC AATGCCTTCG 

201 CAATCACAAA CGGTTTGATT GCCGAACCGG GTTCGATCAT ATCGGTTACG 

251 GCACGGTTGC GCCGCTGTTC GCTGTCTGCC CGGCCGGGCC TGTTGGGATC 

301 GTAGGCGGGC GTATTGGCCA AGGCGAGGAT TTCCCCCGTG CGGGCATCCA 

351 ArACCACCAC CGTTCCGGCT TTTGCCTGAT GGTATTCGAC CGCCTTGTTC 

401 AACTCTTCAT AGGCCAAGGT CTGAATCCTC TGATCGAGGG AAAGGATGAT 

4 51 GTCTTTGCCG TTTTTCGGGG CTTTAkTGCG CGGGGAGTCC AAGCTGTCCA 

501 CAATATTGCC CTGCCGGTCC CGCAAAACGA CTTCCGCGCC GTCTTCGCCA 

551 TGCAAGCTGT CTTCAAGCGA AAGTTCCAAA CCTTCCTGAC CTTTGCCGTC 

601 AATATCGGTA AATCCGATGA CGTGTGCAAA CAGGTTGCCC ATCGGGTAAT 

651 GGCGTTTTAA 



This corresponds to the amino acid sequence <SEQ ID 912; ORF 241-1 .ng>: 
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g241-l.pep 

1 MPTRPTRAAN PPTPTTWLQT AYCPRPPYRP PSVQTHTPHE PASSTCAAKS 

51 ANRRENSHNA QPTYLLHPSN KMPSETEQTL FRRHQIPPSC RQSVVVMTVR 

101 TVDMTVCDFL IGCIAHAFNR SFKADFHACQ RMVAVHHRLA VGNIGYTIDD 

151 NIAGFRIVRF KHHTDLDFNR ERARIFNTDQ LRIMLTERIV GRKRHFDRIA 

201 GILTVQRLFH QRENAVVTAV QIRNR FFGFI QKLIVGIIHL IM QRNHGIFC 

251 NSHICPFRNS RLITGAF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 913>: 

m241-l . seq 

1 ATGCCAACAC GTCCAACTCG CGCTGCAAAC CCTCCAACCC CGCCAACCTG 

51 GCTTCAGACG GCATACTGCC CTCGTCCGCC ATATCGTCCG CCGTCCGTGC 

101 AAACGCGTAC ACCGCGTGAA CCGGCTTCCT CAACCTGCGC GGCAAAATCA 

151 GCGAACCGAC GGGAAAATTC TCATAATGCC CAACCGACAT ACCTTCTCCA 

201 TCCATCAAAC AAAATGCCGT CTGAAACGGA ACAAACCCTT TTCAGACGGC 

251 ATCAGATACC TCCAAGCTGC CGGCAATCAG TGGTGGTGAT GACCGTGCGG 

301 GCCGTGGACA TGACCGTGTG CGATTTCCTC ATCGGATGCA TCGCGCACGC 

351 TTTCAACTGT AGCCTTAAAG CGGATTTTCA TGCCTGCCAA AGGATGGTTG 

401 CCGTCCACCA CCGCCTTGCC GTCGGCAACA TCGGTTACAC GATAGACGAC 

4 51 AACATCGCCG GTTTCAGGAT CGTCGGCTTC AAACATCATG CCGACTTCGA 

501 CTTCAACAGG GAACACGCCC GCATCTTCGA TACGGACCAA CTCCGGATCC 

551 TGCTCGCCGA ACGCATCGTC GGGCGACAGC GCCACATCGA CCGTATCGCC 

601 GGCATCCTTA CCGTGCAACG CCTCTTCCAC CAAAGGGAAA ATGCCGTCGT 

651 AACCGCCGTG CAGATACGCA ATCGGTTCTT CGGTTTTGTC CAAAAGCTGA 

701 TTGTTGGCAT CATACATCTC ATAATGCAGC GAAACCACGG AATTTTTCAC 

751 GATAGCCATA TTTGTCCTTT CAGGAACAGC AGATTAATTA CAGGCGCATT 

801 CTAA 

This corresponds to the amino acid sequence <SEQ ID 914; ORF 241-1>: 

m241-l.pep 

1 MPTRPTRAAN PPTPPTWLQT AYCPRPPYRP PSVQTRTPRE PASSTCAAKS 

51 ANRRENSHNA QPTYLLHPSN KMPSETEQTL FRRHQIPPSC RQSVVVMTVR 

101 AVDMTVCDFL IGCIAHAFNC SLKADFHACQ RMVAVHHRLA VGNIGYTIDD 

151 NIAGFRIVGF KHHADFDFNR EHARIFDTDQ LRILLAERIV GRQRHIDRIA 

201 GILTVQRLFH QRENAVVTAV QIRNR FFGFV QKLIVGIIHL IM QRNHGIFH 

251 DSHICPFRNS RLITGAF* 

m241-l/g241-l 93.3% identity in 267 aa overlap 

10 20 30 40 50 60 

m241-l.pep MPTRPTRAANPPTPPTWLQTAYCPRPPYRPPSVQTRTPREPASSTCAAKSANRRENSHNA 
I 1 I I I II I I i M I I I I I i I I I I I ! I I I I I I I I I I : I I: I I I I I I I I I I I I ( I I I I I I M 
g241 MPTRPTRAANPPTPTTWLQTAYCPRPPYRPPSVQTHTPHEPASSTCAAKSANRRENSHNA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 4 1-1. pep QPTYLLHPSNKMPSETEQTLFRRHQIPPSCRQSVWMTVRAVDMTVCDFLIGCIAHAFNC 

I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I : I II I I I I I I I I I I I I I I ! 
g241 QPTYLLHPSNKMPSETEQTLFRRHQIPPSCRQSVVVMTVRTVDMTVCDFLIGCIAHAFNR 

70 80 90 100 110 120 

130 140 150 160 170 180 

m241-l.pep SLKADFHACQRMVAVHHRLAVGNIGYTIDDNIAGFRIVGFKHHADFDFNREHARIFDTDQ 
I : I I I I I I II I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I : I I I I : I I I 
g241 SFKADFHACQRMVAVHHRLAVGNIGYTIDDNIAGFRIVRFKHHTDLDFNRERARIFNTDQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

m241-l.pep LRILLAERIVGRQRHIDRIAGILTVQRLFHQRENAVVTAVQIRNRFFGFVQKLIVGIIHL 
I I I : I : I I I I I I : I I : I I I I I I I I I I I I I I I I I I II I I II I I I i I II I I : I 1 II I I I I I I 
g241 LRIMLTERIVGRKRHFDRIAGILTVQRLFHQRENAVVTAVQIRNRFFGFIQKLIVGIIHL 

190 200 210 220 230 240 

250 260 
m2 4 1 - 1 . pep IMQRNHGI FHDSHICPFRNSRLITGAFX 
I I I I I I II i : I I I I I I I I I I I 1 II II I 
g241 IMQRNHGI FCNSHICPFRNSRLITGAFX 

250 260 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 915>: 

a241-l . seq 

1 ATGCCAACAC GTCCAACTCG CGCCGCAAAG CATCCAACCC CGCCAACCTG 
51 GCTTCAGACG GCATACTGCC CTCGTCCGCC ATATCGTCCG CCGTCCGTGC 
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101 AAACGCATAC ACCGCATGAA CCGGCTTCCT CAACCTGCGC GGCAAAATCA 

151 GCGAACCGAC GGGAAAATTT TCATAATGCC CAACCGACAT ACCTTCTCCA 

201 TCCATCAAAC AAAATGCCGT CTGAAATGGA ACAAACCCTT TTCAGACGGC 

251 ATCAGATACC TCCAAGCTGC CGGCAATCAG TGGTGGTGAT GACCGTGCGG 

301 ACCGTGGACA TGACCGTGTG CGATTTCCTC ATCGGATGCA TCGCGCACAC 

351 TTTCAACCGT AGCCTTAAAG CGGATTTTCA TGCCTGCCAA AGGATGGTTG 

4 01 CCGTCCACCA CCGCCTTACC GTCGGCAACA TCGGTTACAC GATAGACGAC 

451 AACATCGCCG GTTTCAGGAT CGTCGGCTTC AAACATCATG CCGACTTCGA 

501 CTTCAACAGG GAACACGCCC GCATCTTCAA TACGGACCAA CTCCGGATCC 

551 TGCTCGCCGA ACGCATCGTC GGGCGAAAGC GCCACATCGA CCGTATCGCC 

601 GGCATCCTTA CCGTGCAACG CCTCTTCCAC CAAAGGGAAA ATGCCGTCGT 

651 AACCGCCGTG CAGATACGCA ATCGGTTCTT CGGTTTTGTC CAAAAGCTGA 

701 TTGTTGGCAT CATACATCTC ATAATGCAGC GAAACCACGG AATTCTTCAC 

751 GATAGCCATA TTTGTCCTTT CAGGAACAGC AGATTAATTA CAGGCGCATT 

801 CTAA 

This corresponds to the amino acid sequence <SEQ ID 916; ORF 241-l.a>: 

a241-l.pep 

1 MPTRPTRAAK HPTPPTWLQT AYCPRPPYRP PSVQTHTPHE PASSTCAAKS 

51 ANRRENFHNA QPTYLLHPSN KMPSEMEQTL FRRHQIPPSC RQSVVVMTVR 

101 TVDMTVCDFL IGCIAHTFNR SLKADFHACQ RMVAVHHRLT VGNIGYTIDD 

151 NIAGFRIVGF KHHADFDFNR EHARIFNTDQ LRILLAERIV GRKRHIDRIA 

201 GILTVQRLFH QRENAVVTAV QIRN RFFGFV QKLIVGIIHL IM QRNHGILH 

251 DSHICPFRNS RLITGAF* 

m241-l/a241-l 95.1% identity in 267 aa overlap 

10 20 30 40 50 60 

m2 4 1- 1 . pep MPTRPTRAANPPTPPTWLQTAYCPRPPYRPPSVQTRTPREPASSTCAAKSANRRENSHNA 
I I I I I I I I I : I I I II I I I I I I I I I I I I I I I I I I I : I I : I I I I I I N II I I I I I ! I Ml 
a241 MPTRPTRAAKHPTPPTWLQTAYCPRPPYRPPSVQTHTPHEPASSTCAAKS ANRRENFHNA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m241-l .pep QPTYLLHPSNKMPSETEQTLFRRHQIPPSCRQSVVVMTVRAVDMTVCDFLIGCIAHAFNC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I! I I I II : I I M I I I I I I I I I I I : I I 
a241 QPTYLLHPSNKMPSEMEQTLFRRHQIPPSCRQSVVVMTVRTVDMTVCDFLIGCIAHTFNR 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 4 1 - 1 . pe p SLKADFHACQRMVAVHHRLAVGNIGYTI DDN I AGFRI VGFKHH ADFDFNREHARI FDTDQ 
I I I I I I I I I I I I II I I I I I : I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I : I I I 
a241 SLKADFHACQRMVAVHHRLTVGNIGYTIDDNI AGFRI VGFKHHADFDFNREHARIFNTDQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

m241-l . pep LRILLAERIVGRQRHIDRIAGILTVQRLFHQRENAVVTAVQIRNRFFGFVQKLIVGIIHL 
I I I I M I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
a241 LRILLAERIVGRKRHIDRIAGILTVQRLFHQRENAVVTAVQIRNRFFGFVQKLIVGIIHL 

190 200 210 220 230 240 

250 260 
m2 4 1- 1 . pep IMQRNHGI FHDSHICPFRNSRLITGAFX 
I I I I I i I I : I I I I I I I I I I I I I I I I I I 1 
a241 IMQRNHGI LHDSHICPFRNSRLITGAFX 

250 260 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 91 7>: 



g242 . seq 




1 


atgatcggcg 


51 


cgctggcggg 


101 


agcaggaaca 


151 


cttgccgggc 


201 


tttcgtcgcg 


251 


gctttggcga 


301 


gaccaaacac 


351 


cgaggtattc 


401 


gcatccagca 


451 


tttcttccac 



aacttgttgt tttgttcgtg atcgagcact tcaagcaacg 

atcgccccga aagtcgctgc ccaatttgtc gatttcgtcg 

acgggtttct tacgcctgct tttgccatat tctgcaaaat 

atagagccga tataggtacg gcggtgcccg cggatttcgc 

cacgccgccc aaggccatac ggacatattt ccgccccgtt 

tggattcgcc caaagaggtt ttgcccacgc ccggagggcc 

agaatcggac ctttgagctt gtccatacgt ttttggacgg 

caaaatccgt tctttgactt tttccaggcc gtagtggtcg 

ccagtccggc tttggcgatg tctttgctga cgcgggattt 

ggcagtccga gcagggtgtc gatgtagttg cgtacgacgg 
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501 tggattcggc agacatcggc ggcatcattt tgagtttttt cagttcggac 

551 aggcattttt cttccgcttc tttggtcata cccgcctttt tgatgcctgc 

601 ctccaaggca tccagttcgc cgttttcgtc ttcttcgccc aattctttgt 

651 gtatcgcttt aatctgttcg ttcagataat attcgcgttg ggatttttcc 

701 atttggcgtt tgacgcgtcc gcgtatgcgt ttttcggcct gcataatgtc 

751 gagttcggat tccagctttg ccagcaggaa ttccatccgt ttgccgattt 

801 cgggaatctc caaaatctgt tggcgttgcg ccagtttcaa ctgcaaatgc 

851 gctgcgaccg tatcggttag 

This corresponds to the amino acid sequence <SEQ ID 918; ORF 242.ng>: 

g242 .pep 

1 MIGELWLFV IEHFKQRAGG IAPKVAAQFV DFVEQEQRVS YACFCHILQN 

51 LAGHRADIGT AVPADFAFVA HAAQGHTDIF PPRCFGDGFA QRGFAHARRA 

101 DQTQNRTFEL VHTFLDGEVF QNPFFDFFQA VWGIQHQSG FGDVFADAGF 

151 FLPRQSEQGV DWAYDGGFG RHRRHHFEFF QFGQAFFFRF FGHTRLFDAC 

201 LQGIQFAVFV FFAQFFV YR F NLFVQIIFAL GFFHLA FDAS AYAFFGLHNV 

251 EFGFQLCQQE FHPFADFGNL QNLLALRQFQ LQMRCDRIG* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 919>: 

m242 . seq 

1 ATGATCGGCA AACTTGTTGT TTTGTTCGGG ATCGAGCACT TCGAGCAACG 

51 CGCTGGCGGG ATCGCCTCGG AAGTCGTTAC CCAATTTGTC GATTTCGTCG 

101 AGCAGGAACA AGGGGTTTTT CACGCCGGCT TTTGCCATAT TCTGCAAAAT 

151 CTTACCGGGC ATAGAGCCGA TATAGGTGCG GCGGTGTCCC CTGATTTCGC 

201 TTTCGTCGCG CACGCCGCCC AAAGCCATGC GGACATATTT CCGCCCCGTT 

2 51 GCTTTGGCGA TGGATTCGCC CAAAGAGGTT TTGCCCACGC CCGGAGGGCC 

3 01 GACCAGGCAC AGAATCGGGC CTTTGAGTTT GTCCATACGT TTTTGGACGG 

3 51 CGAGGTATTC CAAAATCCGT TCTTTGACTT TTTCCAGGCC GTAGTGGTCG 

4 01 GCATCCAGCA CCAGTCCGGC TTTGGCGATG TCTTTGCTGA CGCGGGATTT 
451 TTTCTTCCAC GGCAGCTCGA GCAAAGTGTC GATGTAGTTG CGTACGACGG 
501 TGGATTCCGC AGACATCGGT GGCATCATTT TGAGCTTTTT CAGTTCGGAC 
551 AGGCATTTTT CTTCCGCTTC TTTGGTCATA CCCGCCTTTT TGATATCTGC 
601 TTCCAAGGCA TCCAGTTCGC CGTTTTCGTC TTCTTCGCCC AGTTCTTTGT 
651 GTATCGCTTT AATCTGTTCG TTCAGATAAT ATTCGCGCTG GGATTTTTCC 
701 ATTTGGCGTT TGACGCGTCC GCGTATGCGT TTTTCGGCCT GCATAATGTC 
751 GAGTTCGGAT TCCAGCTGTG CCAGCAGGAA TTCCATCCGT TTGCCGATTT 
801 CGGGAATTTC CAAAATCTGT TGGCGTTGCG CCAGTTTCAA CTGCAAATGC 
851 GCTGCGACCG TATCGGTTAG 

This corresponds to the amino acid sequence <SEQ ID 920; ORF 242>: 

m242 .pep 

1 MIGKLWLFG IEHFEQRAGG IASEWTQFV DFVEQEQGVF HAGFCHILQN 

51 LTGHRADIGA AVSPDFAFVA HAAQSHADIF PPRCFGDGFA QRGFAHARRA 

101 DQAQNRAFEF VHTFLDGEVF QNPFFDFFQA VWGIQHQSG FGDVFADAGF 

151 FLPRQLEQSV DWAYDGGFR RHRWHHFELF QFGQAFFFRF FGHTRLFDIC 

201 FQGIQFAVFV FFAQFFV YR F NLFVQIIFAL GFFHLA FDAS AYAFFGLHNV 

251 EFGFQLCQQE FHPFADFGNF QNLLALRQFQ LQMRCDRIG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 242 shows 90.3% identity over a 289 aa overlap with a predicted ORF (ORF 242.ng) 
from N. gonorrhoeae: 

m242/g24290 . 3% identity in 289 aa overlap 
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QNPFFDFFQAVVVGIQHQSGFGDVFADAGFFLPRQLEQSVDVVAYDGGFRRHRWHHFELF 
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QNPFFDFFQAVWGIQHQSGFGDVFADAGFFLPRQSEQGVDWAYDGGFGRHRRHHFEFF 
130 140 150 160 170 180 

190 200 210 220 230 240 

QFGQAFFFRFFGHTRLFDICFQGIQFAVFVFFAQFFVYRFNLFVQI I FALGFFHLAFDAS 

II MINIM MINIM Mill II MMM II IMM II M MMM I M M I IMM 

QFGQAFFFRFFGHTRLFDACLQGIQFAVFVFFAQFFVYRFNLFVQI I FALGFFHLAFDAS 
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250 260 270 280 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 92 1>: 

a242.seq 

1 ATGATCGGCG AACTTGTTGT TTTGCTCGGG ATCAAGCACT TCGAGCAACG 

51 CGCTGGCGGG ATCGCCCCGG AAGTCGCTAN CCAATTTGTC GATTTCGTCG 

101 AGCAGGAACA ATGGGTTTTT TACGCCGGCT TTTGCCATAT TCTGCAAAAT 

151 CTTACCGGGC ATGGAGCCGA TATAGGTGCG GCGGTGTCCC CGGATTTCGC 

201 TTTCGTCGCG CACGCCGCCC AAAGCCATGC GGACATATTT CCGCCCCGTT 

251 GCTTTGGCGA TGGATTCGCC CAAAGAGGTT TTGCCCACGC CTGGAGGGCC 

301 GACCAGGCAC AGAATCGGGC CTTTGAGTTT GTCCATACGT TTTTGGACGG 

351 CGAGGTATTC CAAAATCCGT TCTTTGACTT TTTCCAGGCC GTAGTGGTCG 

401 GTATCCAGCA CCAATCCGGC TTTGGCGATG TCTTTGCTGA CGCGGGATTT 

4 51 TTTCTTCCAC GGCAGTTCGA GCAGGGTGTC GATGTAGTTG CGTACGACGG 

501 TGGATTCGGC AGACATCGGC GG CAT CAT TT TGAGCTTTTT CAGTTCGGAC 

551 AGGCATTTTT CTTCCGCTTC TTTGGTCATA CCCGCCTTTT TGATATCTGC 

601 TTCCAAGGCA TCCAGTTCGC CGTTTTCGTC TTCTTCGCCC AGTTCTTTGT 

651 GTATCGCTTT AATCTGTTCG TTCAGATAAT ATTCGCGCTG GGATTTTTCC 

7 01 ATTTGGCGTT TGACGCGTCC GCGTATGCGT TTTTCGGCCT GCATAATGTC 

751 GAGTTCGGAT TCCAGCTGTG CCAGCAGGAA TTCCATCCGT TTGCCGATTT 

801 CGGGAATTTC CAAAATCTGT TGGCGTTGCG CCAGTTTCAA CTGCAAATGC 

851 GCTGCGACCG TATCGGTTAG 

This corresponds to the amino acid sequence <SEQ ID 922; ORF 242.a>: 

a242.pep 

1 MIGELVVLLG IKHFEQRAGG IAPEVAXQFV DFVEQEQWVF YAGFCHILQN 

51 LTGHGADIGA AVSPDFAFVA HAAQSHADIF PPRCFGDGFA QRGFAHAWRA 

101 DQAQNRAFEF VHTFLDGEVF QNPFFDFFQA VVVGIQHQSG FGDVFADAGF 

151 FLPRQFEQGV DWAYDGGFG RHRRHHFELF QFGQAFFFRF FGHT RLFDIC 

201 FQGIQFAVFV F FAQFFVY RF NLFVQIIFAL GFFHL AFDAS AYAFFGLHNV 

251 E FGFQLCQQE FHPFADFGNF QNLLALRQFQ LQMRCDRIG* 

m242/a242 95.2% identity in 289 aa overlap 
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a242 QNPFFDFFQAVWGIQHQSGFGDVFADAGFFLPRQFEQGVDWAYDGGFGRHRRHHFELF 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 923>: 

g243.seq 

1 ATGGTaatcg tctGGTTGCc cgAGTTaccg CCGATGCCGG CGACGATGGG 

51 CATCAGCGCG GCGAGTGCGA CGATTTTTTC gatactgcCT TCAAACGCGC 

101 CGATGACGCG GCTGGCGAGG AAGGCGGTGC AGAGGTTGAC GGCGAGCCAC 

151 ATCCAGCGGT TTTTGACGGA ATCCAAGACG GGGGCGAACA GGTCTTCCTC 

201 TTCCTGCAAA CCTGCCATGT TCAACATATC CGCTTCGGAT TCTTCGCGGA 

2 51 TCACGTCCAC CATCTCGTCG ATGGTAATCc tgCCGATGAG CTTTTTGTTT 

301 TCATCAACGA CGGGCGCGGT AACCAAGTCG TAG 

This corresponds to the amino acid sequence <SEQ ID 924; ORF 243. ng>: 

g243.pep 

1 MVIVWLPELP PMPATMGISA ASA TIFSILP SNAPMTRLAR KAVQRLTASH 

51 IQRFLTESKT GANRSSSSCK PAMFNISASD SSR ITSTISS MVILPMSFLF 

101 SSTTGAVTKS * ' 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 925>: 

m243.seq 

1 ATGGTAATCG TCTGGTTGCC CGAGTTACCG CCTATGCCGG CGACGATGGG 

51 CATCAGCGCG GyGAGTGCGA CGATTTTTTC GATGCTGCCT TCAAACGCGC 

101 CGATAACACG GyTGGCGAGG AAGGCGGTGC AGAGGTTGAC GGCGAGCCAC 

151 ATCCAGyGGT TTTTCACCGA ATCCCACACG GGGGCGAAyA GGTCTTCCTC 

2 01 TTCCTGCAAA CCCGCCATAT TCAGCATATC CGCTTCCGAT TCTTCGCGGA 

251 TCACGTCCAC CATCTCGTCG ATGGTAATCC TGCCGATGAG CTTTTTGTTT 

301 TCATCGACGA CGGGCGCGGT . AACCAAGTCG TAG 

This corresponds to the amino acid sequence <SEQ ID 926; ORF 243>: 

m243.pep 

1 MVIVWLPELP PMPATMGISA XSA TIFSMLP SNAPITRLAR KAVQRLTASH 

51 IQXFFTESHT GANRSSSSCK PAIFSISASD SSR ITSTISS MVILPMSFLF 

101 SSTTGAVTKS * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 243 shows 92.7% identity over a 1 10 aa overlap with a predicted ORF (ORF 243.ng) 
from TV. gonorrhoeae: 

m243/g243 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 927>: 

a243 . seq 

1 ATGGTAATCG TCTGGTTGCC CGAGTTACCG CCTATGCCGG CGACGATGGG 

51 CATCAGCGCG GCGAGTGCGA CGATTTTTTC GATGCTGCCT TCAAACGCGC 

101 CGATAACACG GCTGGCGAGG AAGGCGGTGC AGAGGTTGAC GGCGAGCCAC 

151 ATCCAGCGGT TTTTGACGGA ATCCAAGACG GGGGCGAATA AGTCTTCCTC 

201 TTCTTGCAAA CCCGCCATAT TCAACATATC CGCTTCGGAT TCTTCGCGGA 

251 TCACGTCCAC CATTTCGTCA ACGGTCACCC TGCCGATGAG CTTTTTGTTT 

301 TCATCGACGA CGGGCGCGGT AACCAAGTCA TAG 

This corresponds to the amino acid sequence <SEQ ID 928; ORF 243.a>: 

a24 3 . pep 

1 MVIVWLPELP PMPATMGISA ASA TIFSMLP SNAPITRLAR KAVQRLTASH 

51 IQRFLTESKT GANKSSSSCK PAIFNISASD SSRITSTISS TVTLPMSFLF 

101 SSTTGAVTKS * 

m243/a243 92.7% identity in 1 10 aa overlap 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 929>: 

g244 . seq 

1 atgccgcctg aagcccggcc ggcgggttca gacggcattg ccgctttact 

51 tcgatcggtt tatacgcaaa acgcgcttca ggaaataaat cagattattc 

101 cccagacgcc ttcaggcttc cttccgtgcc accgtaacca tagccgggcg 

151 caacacacgg tcggacaggg tataaccctt cttcatcaca ccaaccacgg 

201 tattgggttc ctgctcactg gccaccgcct gcatcgcctg atggatattc 

251 ggatcgagct tatcgcccgc tttaggattg atttccttga tttgcgtggc 

301 atcaaacgcc ttctgcaact cattcaaagt catctgcaca cccattttca 

351 gcgcatcgaa attaccgctc tgatccaaaa gcgccatttc cagataatcc 

401 ttgaccggca acatttccac ggcaaacttc tgtccggcga acttgtgcgt 

451 atcggcaatt tcctgctggt ggcggcggcg caggttttgc tcgtttgcca 

501 aagcgcgcag ttgttcgtct ttcaactgcg cttccagctc ggcaatccgc 

551 gcctgcaaat cctcataagc cggctcggcg gcagcctgtt cctgtacacc 

601 gtccgcattt cctactgtct cgacggtttc caccgcctcc acattttcaa 

651 ccgcttcttc actgttttgc tgctgtgtct gttcgctcat atcgtatccc 

701 tcaaaacaaa ttggaaatca aaatccggtt attacccgag caagataagg 

751 acattttcaa gaaacttcaa gcaaaggcag gaaatttcac atccgccgcc 

801 gaatacccta ccgcaaaaac catataaacg gtaa 

This corresponds to the amino acid sequence <SEQ ID 930; ORF 244.ng>: 

g244 .pep 

1 MPPEARPAGS DGIAALLRSV YTQNALQEIN QIIPQTPSGF LPCHRNHSRA 

51 QHTVGQGITL LHHTNHGIGF LLTGHRLHRL MDIRIELIAR FRIDFLDLRG 

101 IKRLLQLIQS HLHTHFQRIE ITALIQKRHF QIILDRQHFH GKLLSGELVR 

151 IGNFLLVAAA QVLLV CQSAQ LFVFQLRFQL GNPRLQILIS RLGGSLFLYT 

201 VRISYCLDGF HRLHIFNR FF TVLLLCLFAH IVSLK TNWKS KSGYYPSKIR 

251 TFSRNFKQRQ EISHPPPNTL PQKPYKR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 93 1>: 

m244 . seq 
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1 ATGCCGTCTG AAGCCCGACA GGCGGGTTCA GACGGCATTG CCGCTTTACT 

51 TCGATCGGTT TATACGCAAA ACGCGCTTCA GGAAATAAAT CAGATTATTC 

101 CCCAGACGCC TTCAGGCTTC CTTCTGCGCC ACCGTAACCA TAGCCGGGCG 

151 CAACACGCGG TCGGACAGCG TATAACCCTT CTTCATCACA CCCACCACGG 

201 TATTCGGCTC CTGTTCGCTT GCCACCGCCT GCATCGCCTG ATGGATATTC 

251 GGATCGAGCT TATCGCCCGC TTTAGGGTTG ATTTCCTTGA TTTGCGTAGC 

301 ATCAAATGCT TTCTGCAACT CGTTCAAAGT CATCTGCACG CCCATTTTCA 

351 GCGCATCGAA ATTGCCGCTC TGATCCAAAA GCGCCATTTC CAGATAATCC 

401 TTGACCGGCA GCATTTCCAC GGCAAACTTC TGTCCGGCGA ACTTGTGCGT 

451 ATCCGCAATT TyCTGCTGGT GGCGGCGGCG CAGGTTTTGC TCGTTTGCCA 

501 AAGCGCGCTG CTCGTCTTTC AACTGCGTTT CCAGCTCGGC AATCCGCGCC 

551 TGCAAATCCT CATAAGCCGG CTCTGCGGCA GCCTGTTCCT GCACACCGTC 

601 CGCATTTCCT ACTGTTTCGA CGGTTTCCAC CGCCTCCACA TTTTCAACCG 

651 CTTCTTCACT GTTTTGCTGC TGTGTCTGTT CGCTCATATC GTATCCCTTA 

701 AAACAAATTG GAAATCAAAA TCCAGTTATT ACCCGCGCAA GATAAGGACA 

751 TTTTCAAGAA ACTTCAAkGA AAAkCAGAGA ATTTCAAATT CATTTTCAAA 

801 TCCCCTACCG AAAAAATAAT ATAGACGGTA A 

This corresponds to the amino acid sequence <SEQ ID 932; ORF 244>: 

m244 .pep 

1 MPSEARQAGS DGIAALLRSV YTQNALQEIN QIIPQTPSGF LLRHRNHSRA 

51 QHAVGQRITL LHHTHHGIRL LFACHRLHRL MDIRIELIAR FRVDFLDLRS 

101 IKCFLQLVQS HLHAHFQRIE IAALIQKRHF QIILDRQHFH GKLLSGELVR 

151 I RNFLLVAAA QVLLV CQSAL LVFQLRFQLG NPRLQILISR LCGSLFLHTV 

201 RISYCFDGFH RLHIFN RFFT VLLLCLFAHI VSLK TNWKSK SSYYPRKIRT 

251 FSRNFXQXQR ISNSFSNPLP KKXYRR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 244 shows 86.3% identity over a 277 aa overlap with a predicted ORF (ORF 244.ng) 
from N. gonorrhoeae: 

M244/G244 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 933>: 

a24 4 . seq 

1 ATGCCGTCTG AAGCCCGACA GGCGGGTTCA GACGGCATTG CCGCTTTACT 

51 TCGATCGGTT TATACGCAAA ACGCGCTTCA GGAAATAAAT CAGATTATTC 

101 CCCAGACGCC TTCAGGCTTC CTTCTGTGCC ACCGTAACCA TAGCCGGGCG 

151 CAACACGCGG TCGGACAGCG TATAACCCTT CTTCATCACG CCCACCACGG 

201 TATTGGGTTC CTGTTCGCTT GCCACCGCCT GCATCGCCTG ATGGATATTC 

251 GGATCGAGCT TATCGCCCGC TTTAGGATTG ATTTCCTTGA TTTGCGTAGC 

301 ATCAAATGCT TTCTGCAACT CGTTCAAAGT CATCTGCACG CCCATTTTCA 

351 GCGCATCGAA ATTGCCGCTC TGATCCAAAA GCGCCATTTC CAGATAATCC 

4 01 TTGACCGGCA GCATTTCCAC GGCAAACTTC TGTCCGGCGA ACTTGTGCGT 

4 51 ATCCGCAATT TCCTGCTGGT GGCGGCGGCG CAGGTTTTGC TCGTTTGCCA 

501 AAGCGCGCAG CTGCTCGTCT TTCAACTGCG CTTCCAGCTC GGCAATCCGC 

551 GCCTGCAAAT CCTCATAAGC CGGCTCTGCG GCAGCCTGTT CCTGCACACC 

601 GTCCGCATTT CCTACTGTCT CGACGGTTTC CACCGCCTCC ACATTTTCAA 

651 CCGCTTCTTC ACTGTTTTGC TGCTGTGTCT GTTCGCTCAT ATCGTATCCC 

701 TTAAAACAAA TTGGAAATCA AAATCCAGTT ATTACCCGCG CAAGATAAGG 

751 ACATTTTCAA GAAACTTCAA GCAAAGGCAG AGAATTTCAA ATTCATTTTC 

801 AAATCCCCTA CCGAAAAAAT AATATAGACG GTAA 

This corresponds to the amino acid sequence <SEQ ID 934; ORF 244.a>: 

a244 .pep 

1 MPSEARQAGS DGIAALLRSV YTQNALQEIN QIIPQTPSGF LLCHRNHSRA 

51 QHAVGQRITL LHHAHHGIGF LFACHRLHRL MDIRIELIAR FRIDFLDLRS 

101 IKCFLQLVQS HLHAHFQRIE IAALIQKRHF QIILDRQHFH GKLLSGELVR 

151 IRNFLL VAAA QVLLVCQSAQ LLV FQLRFQL GNPRLQILIS RLCGSLFLHT 

201 VRISYCLDGF HRLHIFNR FF TVLLLCLFAH IVSLK TNWKS KSSYYPRKIR 

251 TFSRNFKQRQ RISNSFSNPL PKK*YRR* 

m244/a244 96.8% identity in 277 aa overlap 
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m24 4 .pep GNPRLQI LI SRLCGSLFLHTVRISYCFDGFHRLH I FNRFFTVLLLCLFAH IVSLKTNWKS 

I i I I I I M I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I 
a244 GNPRLQI LI SRLCGSLFLHTVRISYCLDGFHRLH I FNRFFTVLLLCLFAH IVSLKTNWKS 

•190 200 210 220 230 240 

240 250 260 270 

m244 .pep KSSYYPRKIRTFSRNFXQXQRISNSFSNPLPKKXYRRX 

I I I I I I I I I i I I M I I I I I I I I I I I I I I I I I I ! I I I 
a24 4 KSSYYPRKIRTFSRNFKQRQRISNSFSNPLPKKXYRRX 

250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 93 5>: 

g244-l.seq 

1 atgccgcctg aagcccggcc ggcgggttca gacggcattg ccgctttact 
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51 tcgatcggtt tatacgcaaa acgcgcttca ggaaataaat cagattattc 

101 cccagacgcc ttcaggcttc cttccgtgcc accgtaacca tagccgggcg 

151 caacacacgg tcggacaggg tataaccctt cttcatcaca ccaaccacgg 

201 tattgggttc ctgctcactg gccaccgcct gcatcgcctg atggatattc 

251 ggatcgagct tatcgcccgc tttaggattg atttccttga tttgcgtggc 

301 atcaaacgcc ttctgcaact cattcaaagt catctgcaca cccattttca 

351 gcgcatcgaa attaccgctc tgatccaaaa gcgccatttc cagataatcc 

4 01 ttgaccggca acatttccac ggcaaacttc tgtccggcga acttgtgcgt 

451 atcggcaatt tcctgctggt ggcggcggcg caggttttgc tcgtttgcca 

501 aagcgcgcag ttgttcgtct ttcaactgcg cttccagctc ggcaatccgc 

551 gcctgcaaat cctcataagc cggctcggcg gcagcctgtt cctgtacacc 

601 gtccgcattt cctactgtct cgacggtttc caccgcctcc acattttcaa 

651 ccgcttcttc actgttttgc tgctgtgtct gttcgctcat atcgtatccc 

701 tcaaaacaaa ttggaaatca aaatccggtt attacccgag caagataagg 

751 acattttcaa gaaacttcaa gcaaaggcag gaaatttcac atccgccgcc 

801 gaatacccta ccgcaaaaac catataaacg gtaa 

This corresponds to the amino acid sequence <SEQ ID 936; ORF 244-1. ng>: 

g244-l.pep 

1 MPPEARPAGS DGIAALLRSV YTQNALQEIN QIIPQTPSGF LPCHRNHSRA 

51 QHTVGQGITL LHHTNHGIGF LLTGHRLHRL MDIRIELIAR FRIDFLDLRG 

101 I KRLLQL I QS HLHTHFQRIE ITALIQKRHF QIILDRQHFH GKLLSGELVR 

151 IGNFLLVAAA QVLLV CQSAQ LFVFQLRFQL GNPRLQILIS RLGGSLFLYT 

201 VRISYCLDGF HRLHIFNR FF TVLLLCLFAH IVSLK TNWKS KSGYYPSKIR 

251 TFSRNFKQRQ EISHPPPNTL PQKPYKR* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 93 7>: 

m244-l . seq 

1 ATGCCGTCTG AAGCCCGACA GGCGGGTTCA GACGGCATTG CCGCTTTACT 

51 TCGATCGGTT TATACGCAAA ACGCGCTTCA GGAAATAAAT CAGATTATTC 

101 CCCAGACGCC TTCAGGCTTC CTTCTGCGCC ACCGTAACCA TAGCCGGGCG 

151 CAACACGCGG TCGGACAGCG TATAACCCTT CTTCATCACA CCCACCACGG 

201 TATTCGGCTC CTGTTCGCTT GCCACCGCCT GCATCGCCTG ATGGATATTC 

251 GGATCGAGCT TATCGCCCGC TTTAGGGTTG ATTTCCTTGA TTTGCGTAGC 

301 ATCAAATGCT TTCTGCAACT CGTTCAAAGT CATCTGCACG CCCATTTTCA 

351 GCGCATCGAA ATTGCCGCTC TGATCCAAAA GCGCCATTTC CAGATAATCC 

4 01 TTGACCGGCA GCATTTCCAC GGCAAACTTC TGTCCGGCGA ACTTGTGCGT 

4 51 ATCCGCAATT TyCTGCTGGT GGCGGCGGCG CAGGTTTTGC TCGTTTGCCA 

501 AAGCGCGCTG CTCGTCTTTC AACTGCGTTT CCAGCTCGGC AATCCGCGCC 

551 TGCAAATCCT CATAAGCCGG CTCTGCGGCA GCCTGTTCCT GCACACCGTC 

601 CGCATTTCCT ACTGTTTCGA CGGTTTCCAC CGCCTCCACA TTTTCAACCG 

6 51 CTTCTTCACT GTTTTGCTGC TGTGTCTGTT CGCTCATATC GTATCCCTTA 

701 AAACAAATTG GAAATCAAAA TCCAGTTATT ACCCGCGCAA GATAAGGACA 

751 TTTTCAAGAA ACTTCAAkCA AAAkCAGAGA ATTTCAAATT CATTTTCAAA 

801 TCCCCTACCG AAAAAATAA 



This corresponds to the amino acid sequence <SEQ ID 938; ORF 244- 1>: 

m244-l.pep 

1 MPSEARQAGS DGIAALLRSV YTQNALQEIN QIIPQTPSGF LLRHRNHSRA 

51 QHAVGQRITL LHHTHHGIRL LFACHRLHRL MDIRIELIAR FRVDFLDLRS 

101 I KCFLQL VQS HLHAHFQRIE IAALIQKRHF QIILDRQHFH GKLLSGELVR 

151 IRNFLLVAAA QVLLV CQSAL LVFQLRFQLG NPRLQILISR LCGSLFLHTV 

201 RISYCFDGFH RLHIFNR FFT VLLLCLFAHI VSLK TNWKSK SSYYPRKIRT 

251 FSRNFXQXQR ISNSFSNPLP KK* 



m244-l/G244-l 86.3% identity in 277 aa overlap 

10 20 30 40 50 60 

m244-l.pep MPSEARQAGSDGIAALLRSVYTQNALQEINQIIPQTPSGFLLRHRNHSRAQHAVGQRITL 

li III llllllllllllllllllllllllllllllllll III III II III 

g244-l MPPEARPAGSDGIAALLRSVYTQNALQEINQIIPQTPSGFLPCHRNHSRAQHTVGQGITL 

10 20 30 40 50 60 



70 80 90 100 110 120 

m244 -1 . pep LHHTHHGIRLLFACHRLHRLMDIRIELIARFRVDFLDLRSIKCFLQLVQSHLHAHFQRIE 

Illhlll :|«: II II! I lllllllllll Ml I llhl I M I I'-l II I hi I I III 
g244-l LHHTNHGIGFLLTGHRLHRLMDIRIELIARFRIDFLDLRGIKRLLQLIQSHLHTHFQRIE 

70 80 90 100 110 120 
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130 140 150 160 170 180 

m244-l .pep IAALIQKRHFQIILDRQHFHGKLLSGELVRIRNFLLVAAAQVLLVCQSAALLVFQLRFQL 

I'H till Nil INI Mill IIIIIIIIMI IIIIIIIIIIIIIIIM hill II II I 

g244-l ITALIQKRHFQIILDRQHFHGKLLSGELVRIGNFLLVAAAQVLLVCQSAQLFVFQLRFQL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m244 -1 . pep GNPRLQILISRLCGSLFLHTVRISYCFDGFHRLHIFNRFFTVLLLCLFAHIVSLKTNWKS 

III Mil Mill llllhllllllhllllMIIIIIIMIIIIIIMMIIIIIMII 

g244-l GNPRLQILISRLGGSLFLYTVRISYCLDGFHRLHIFNRFFTVLLLCLFAHIVSLKTNWKS 

190 200 210 220 230 240 



250 260 270 

m244 -1 . pep KSSYYPRKIRTFSRNFXQXQRISNSFSNPLPKKX 

Ihlll lllllllll I |>lh I Ihl 
g244-l KSGYYPSKIRTFSRNFKQRQEISHPPPNTLPQKPYKRX 

250 260 270 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 939>: 

a244-l . seq 

1 ATGCCGTCTG AAGCCCGACA GGCGGGTTCA GACGGCATTG CCGCTTTACT 

51 TCGATCGGTT TATACGCAAA ACGCGCTTCA GGAAATAAAT CAGATTATTC 

101 CCCAGACGCC TTCAGGCTTC CTTCTGTGCC ACCGTAACCA TAGCCGGGCG 

151 CAACACGCGG TCGGACAGCG TATAACCCTT CTTCATCACG CCCACCACGG 

201 TATTGGGTTC CTGTTCGCTT GCCACCGCCT GCATCGCCTG ATGGATATTC 

251 GGATCGAGCT TATCGCCCGC TTTAGGATTG ATTTCCTTGA TTTGCGTAGC 

301 ATCAAATGCT TTCTGCAACT CGTTCAAAGT CATCTGCACG CCCATTTTCA 

351 GCGCATCGAA ATTGCCGCTC TGATCCAAAA GCGCCATTTC CAGATAATCC 

401 TTGACCGGCA GCATTTCCAC GGCAAACTTC TGTCCGGCGA ACTTGTGCGT 

4 51 ATCCGCAATT TCCTGCTGGT GGCGGCGGCG CAGGTTTTGC TCGTTTGCCA 

501 AAGCGCGCAG CTGCTCGTCT TTCAACTGCG CTTCCAGCTC GGCAATCCGC 

551 GCCTGCAAAT CCTCATAAGC CGGCTCTGCG GCAGGCTGTT CCTGCACACC 

601 GTCCGCATTT CCTACTGTCT CGACGGTTTC CACCGCCTCC ACATTTTCAA 

651 CCGCTTCTTC ACTGTTTTGC TGCTGTGTCT GTTCGCTCAT ATCGTATCCC 

701 TTAAAACAAA TTGGAAATCA AAATCCAGTT ATTACCCGCG CAAGATAAGG 

751 ACATTTTCAA GAAACTTCAA GCAAAGGCAG AGAATTTCAA ATTCATTTTC 

801 AAATCCCCTA CCGAAAAAAT AA 

This corresponds to the amino acid sequence <SEQ ID 940; ORF 244-1. a>: 

a244-l .pep 

1 MPSEARQAGS DGIAALLRSV YTQNALQEIN QIIPQTPSGF LLCHRNHSRA 

51 QHAVGQRITL LHHAHHGIGF LFACHRLHRL MDIRIELIAR FRIDFLDLRS 

101 IKCFLQLVQS HLHAHFQRIE IAALIQKRHF QIILDRQHFH GKLLSGELVR 

151 IRNFLL VAAA QVLLVCQSAQ LLV FQLRFQL GNPRLQILIS RLCGSLFLHT 

201 VRISYCLDGF HRLHIFN RFF TVLLLCLFAH IVSLK TNWKS KSSYYPRKIR 

251 TFSRNFKQRQ RISNSFSNPL PKK* 



m244-l/a244-l 96.8% identity in 274 aa overlap 

10 20 30 40 50 60 

m2 4 4-1. pep MPSEARQAGSDGIAALLRSVYTQNALQEINQIIPQTPSGFLLRHRNHSRAQHAVGQRITL 
I I I I I I I ! I I I II I I I I I I I I I I I I I I I I ! I I I I I i I I I I II I I I I I I I I I I I I I I I I I 
a24 4-l MPSEARQAGS DGIAALLRSVYTQNALQEINQI I PQTPSGFLLCHRNHSRAQHAVGQRITL 

10 20 30 40 50 60 



70 80 90 100 110 120 

m24 4-l.pep LHHTHHGIRLLFACHRLHRLMDIRIELIARFRVDFLDLRSIKCFLQLVQSHLHAHFQRIE 
111:1111 : I I I I I I i I I I I i > M I I I I I I I : I 1 I I I M I I I I I I I I I I I I t I M I I I I 
a2 44-l LHHAHHGIGFLFACHRLHRLMDI RI ELI ARFRI DFLDLRS IKCFLQLVQSHLHAHFQRI E 

70 80 90 . 100 110 120 



130 140 150 160 170 179 

m2 4 4 - 1 . pep IAALIQKRHFQI I LDRQHFHGKLLSGELVRIRNFLLVAAAQVLLVCQS A-LLVFQLRFQL 

I M M I M I M I M I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I 1 
a244-l IAALIQKRHFQIILDRQHFHGKLLSGELVRIRNFLLVAAAQVLLVCQSAQLLVFQLRFQL 

130 140 150 160 170 180 

180 190 200 210 220 230 239 

m2 44-1. pep GNPRLQILISRLCGSLFLHTVRISYCFDGFHRLHIFNRFFTVLLLCLFAHIVSLKTNWKS 
I I I I I I I I I I I I N I I I I I I I II I I I : I I I I I I I I I I I I I I I I | M | I | | | | | | | | | | I | 
a24 4-l GNPRLQILISRLCGSLFLHTVRISYCLDGFHRLHIFNRFFTVLLLCLFAHIVSLKTNWKS 
190 200 210 220 230 240 
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240 250 260 270 

m244-l .pep KSSYYPRKIRTFSRNFXQXQRISNSFSNPLPKKX 
I I i I It I I I I I I I 1 f I I I I I I I I I I I I I I | M 
a24 4-l KSSYYPRKIRTFSRNFKQEQRISNSFSNPLPKKX 
250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 941>: 

g246.seq 

1 atgtacgggc ggaacggtag tactcaagcg gccgttgcct tcgttttcga 

51 ccagacacag cgtgcccgtt tcggcaacgg cgaagtttac gccgctcaag 

101 ccgacatcgg cagtgctgta aatatcgcgc agggctttgc gggcgaatcc 

151 ggtcagttgg tccacgtcgt ctgtaagcgg tgtgccgagg ttttggtgga 

201 acagttcgct gacctgttct ttggttttat ggattgcggg catcacgata 

251 tgggtcggtt tttcgcctgc catttggacg ataaactcgc ccaagtcgct 

301 ttccaccgcc ttaatgcctt ttgcttcaag ataatggttc agctcgattt 

351 cttcgctgac catggatttg cctttgacca tcagcttgcc gtttttggct 

401 gtgatgatgt cgtggataat ttggcaggct tcggcagggg tttccgccca 

451 gtgtactttc acgcccaact tagtcaggtt ttcttccaac tgctccagca 

5 01 gcgcgggtaa 

This corresponds to the amino acid sequence <SEQ ID 942; ORF 246.ng>: 

g246.pep 

1 MYGRNGSTQA AVAFVFDQTQ RARFGNGEVY AAQA DIGSAV NIAQGFAGES 

51 GQLVHWCKR CAEVLVEQFA DLFFGFMDCG HHDMGRFFAC HLDDKLAQVA 

101 FHRLNAFCFK IMVQLDFFAD HGFAFDHQLA VFGCDDWDN LAGFGRGFRP 

151 VYFHAQLSQV FFQLLQQRG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 943>: 

m246.seq (partial) 

1 ATGCACGGGC GGTACGGTGG TACTCAAGCG ACCGTTgCTT CGTTTTCCAC 

51 CAGACACAGC GTACCTGTTT CAGCAACGGC AAAGTTTACG CCACTCAAAC 

101 CGACATCGGC AGTGCTGTAA ATATCGCGCA GTGCTTTACG GGCGAAGCCG 

151 GTCAGTTGGT CTACATCGTC TGTCAGCGGC GTACCGAGGT TTTGGTGGAA 

201 CAGTTCGCTA ACCTGTTCTT TGGTTTTGTG GATAGCAGGC ATCACGATAT 

251 GGGTCGGTTT TTCGCCTGCC ATTTGGACGA TGAACTCGCC CAAGTCGCTT 

301 TCTACCGCTT TAATGCyTTT TGCTTCAAGA TAATGrTTCA GCTCGATTTC 

351 CTCGCTGACC ATCGATTTGC CTTTGACCAT CAGCTTGCCG TTTTTGGCTG 

401 TGATGATGTC GTGGATAATT TGGCAGGCTT CGGTCGGGGT TTCTGCCCG. . 

This corresponds to the amino acid sequence <SEQ ID 944; ORF 246>: 

m246.pep (partial) 

1 MHGRYGGTQA TVAFVFHQTQ RTCFSNGKVY ATQT DIGSAV NIAQCFTGEA 

51 GQLVYIVCQR RTEVLVEQFA NLFFGFVDSR HHDMGRFFAC HLDDELAQVA 

101 FYRFNAFCFK IMXQLDFLAD HRFAFDHQLA VFGCDDWDN LAGFGRGFCP. 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 246 shows 80.0% identity over a 150 aa overlap with a predicted ORF (ORF 246.ng) 
from N. gonorrhoeae: 

m246/g246 

10 20 30 40 50 60 

m246 . pep MHGRYGGTQATVAFVFHQTQRTCFSNGKVYATQTDIGSAVNIAQCFTGEAGQLVYIVCQR 

1*11 MINIMI 1111= hlhllhhllllllllll 
9246 MYGRNGSTQAAVAFVFDQTQRARFGNGEVYAAQADIGSAVNI AQGFAGES GQLVHWCKR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m24 6 . pep RTEVLVEQFANLFFGFVDSRHHDMGRFFACHLDDELAQVAFYRFNAFCFKIMXQLDFLAD 
Hllllllhllllhl llllllll||||||:||||||:|:|||MIII MlhM 
9246 CAEVLVEQFADLFFGFMDCGHHDMGRFFACHLDDKLAQVAFHRLNAFCFKIMVQLDFFAD 
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70 80 90 100 110 120 

130 140 150 

m2 4 6 . pep HRFAFDHQLAVFGCDDWDNLAGFGRGFCP 
I I I I I I I I I II I I I I I I I I I I I I I I I I I 
9 2 4 6 HGFAFDHQLAVFGCDDWDNLAGFGRGFRPVYFHAQLSQVFFQLLQQRGX 

130 140 150 160 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 945>: 

a246.seq (partial) 

1 ATGCACGGGC GGAACGGTGG TACTCAAGCG ACCGTTGCCT TCGTTTTCCA 
51 ' CCAGACACAG CGTACCTGTT TCAGCAACGG CGAAGTTCAC GCCACTCAAA 

101 CCGACATCGG CAGTGCTGTA AATATCGCGC AGTGCTTTAC GGGCGAAGCC 

151 GGTCAGTTGG TCTACGTCGT CCGTTAACGG TGTGCCGAGG TTTTGGTGGA 

201 ACAGTTCGCT AACCTGTTCT TTGGTTTTAT GGATTGCGGG CATCACGATA 

251 TGGGTCGGTT TTTCACCTGC CATTTGGACG ATGAACTCGC CCAAGTCGCT 

301 TTCCACCGCT TTAATGCCTT TTGCTTCAAG ATAATGGTTC AGCTCGATTT 

351 CCTCGCTGAC CATCGATTTG CCTTTGACCA TCAGCTTGCC GTTTTTGGCT 

4 01 GTGATGATGT CGTGGATGAT TTCGCAGGCT TCGGCCGGTG TTTCCGCCCA 

4 51 GTGTACTTTT ACGCCCAACT TGGTCAGGTT TTCTTCCAGC TGCTCCAGCA 

501 G 

This corresponds to the amino acid sequence <SEQ ID 946; ORF 246.a>: 

a246 .pep (partial) 

1 MHGRNGGTQA TVAFVFHQTQ RTCFSNGEVH ATQTDIGSAV NIAQCFTGEA 

51 GQLVYWR*R CAEVLVEQFA NLFFGFMDCG HHDMGRFFTC HLDDELAQVA 

101 FHRFNAFCFK IMVQLDFLAD HRFAFDHQLA VFGCDDVVDD FAG FGRCFRP 

151 VYFYAQLGQV FFQLLQQ 

m246/a246 88.0% identity in 150 aa overlap 

10 20 30 40 50 60 

m24 6 . pep MHGRYGGTQAT VAFV FHQTQRTC FS NGKV YATQT DIG S AVN I AQC FTGE AGQL V Y I VCQR 

I II i I I I I I I I I I I I I II I I I II I I I : I : I I I I I I I I I I If I I I I I I I I I I M I : I ! 
a24 6 MHGRNGGTQATVAFVFHQTQRTCFSNGEVHATQTDIGSAVNIAQCFTGEAGQLVYVVRXR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 4 6 . pep RTEVLVEQFANLFFGFVDSRHHDMGRFFACHLDDELAQVAFYRFNAFCFKIMXQLDFLAD 
: I I I I I I I I I I I I I I : I I I I I f II I : I I I I I I I 1 ! I I I : I I I I I ! I I I I I I I I I I I 
a24 6 CAEVLVEQFANLFFGFMDCGHHDMGRFFTCHLDDELAQVAFHRFNAFCFKIMVQLDFLAD 

70 80 90 100 110 120 

130 140 150 

m2 4 6 . pep HRFAFDHQLAVFGCDDWDNLAGFGRGFCP 
I I I I I II 1 I I I I I I I I I I I: : I I I I I 1 ! 
a2 4 6 HRFAFDHQLAVFGCDDVVDDFAGFGRCFRPVYFYAQLGQVFFQLLQQ 

130 140 150 160 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 947>: 

g247.seq 

1 atgaaacgta aaatgctaaa cgtaccaaag ggcggttatg atggtatgaa 

51 gggttttacc attgttgaat ttctggttgc gggcctgctc agtataattg 

101 tcctgatagc ggtcgtatcg agttacttta catcccggaa attaaatgat 

151 gtggcaaacg agcgtcttgc cattcaacag gatttgcgga atgcggcaac 

201 attaattgtc cgcgatgcaa gaatggcggg gagcttcggt tgtttcaata 

251 tgtccgagca tactaaagac gatattgttg attcaagtaa tcaaactcaa 

301 tctaaccttg caaaacccgg tgccaaacaa gaaaatcccc ttttttcctt 

351 aaaaaggagc ggcatggata aacaactgat tcccgttgct gaatccatag 

401 atattaaata tccgggtttt atccagcgcc ttaacgcatt ggttttccaa 

451 tacggtatcg atgatcttga tgcgagtgct gagactgttg tagtcagcag 

501 ctgttccaaa atagcaaaac cgggtaagaa aatatctacc ttgcaagaag 

551 caaagagtgc attacagatt actaatgatg ataaacaaaa tggaaatatc 
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601 acccgtcaga aacatgtggt caatgcctat gcggtcggca ggtttggcaa 

651 taatgaggaa agtttgttcc gcttccaatt ggatgataag ggcaagtggg 

701 gtaatcctca gttgctcgtg aaaaaggtta aacgtatgga tgtgcggtat 

751 atttatgttt ccggttgtcc tgaagatgaa gatgccggca aagaggaaaa 

801 attcagatat acgaataaat tcgacaaatc caaaaatgct gttacgcctg 

851 ccggggtgga ggttttattg gatagcggcc ttaatgccaa gattgccgct 

901 tcttcagaca atagtattta tgcttaccgt atcaatgcga caatacgcgg 

951 gggaaatgta tgcgcaaaca gaacactttg a 



This corresponds to the amino acid sequence <SEQ ID 948; ORF 247.ng>: 

g247.pep 

1 MKRKMIiNVPK GGYDGMKGFT IVEFLVAGLL SIIVLIAWS SYFTSRKLND 



51 VANERLAIQQ DLRNAATLIV RDARMAGSFG CFNMSEHTKD DIVDSSNQTQ 

101 SNLAKPGAKQ ENPLFSLKRS GMDKQLIPVA ESIDIKYPGF IQRLNALVFQ 

151 YGIDDLDASA ETVWSSCSK IAKPGKKIST LQEAKSALQI TNDDKQNGNI 

2 01 TRQKHWNAY AVGRFGNNEE SLFRFQLDDK GKWGNPQLLV KKVKRMDVRY 
251 IYVSGCPEDE DAGKEEKFRY TNKFDKSKNA VTPAGVEVLL DSGLNAKIAA 

3 01 SSDNSIYAYR INATIRGGNV CANRTL* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 949>: 
m247.seq (partial) 

1 ATsAGACGTA AAATGCTAAA CGTwsyArAA GGCAGTTATG ATGGTATGAA 



51 AGGTTTTACC ATTATTGAAT TTTTGGTTGC GGGCCTGCTC AGTATGATTG 

101 TCCTGATGGC GGTCGGATCG AGTTACTTCA CATCCCGGAA ATTAAATGAT 

151 GCGGCAAACG AGCGTCTTGC CGCGCAACAG GATTTGCGGA ATGCGGCAAC 

201 ATTGATTGTC CGCGATGCGA GAATGGCAGG CGGCTTCGGT TGTTTCAATA 

251 TGTCCGAGCA TCCTGCAACT GATGTTATTC CCGATACGAC GCAACAAAAT 

3 01 TCTCCTTTTT CCTTAAAAAG GAACGGTATA GATAAACTTA TT C C CAT AG C 
351 GGAATCTTCA AATATCAATT ATCAGAATTT TTTCCAGGTT GGTAGCGCAT 

4 01 TGATTTTTCA ATACGGAATC GATGATGTTA ATGCAAGCAC CGCGACTACC 
451 GTCGTCAGCA GCTGTGCCGC AATATCGAAA CCGGGCAAGC AAATCCCTAC 
501 TTTAGAAGAT GCAAAAAAAG AATTGAAGAT TCCGGATCAG GATAAGGAGC 
551 AAAATGGCAA TATAGCGCGT CAAAGGCATG TGGTCAATGC CTATGCGGTC 
601 GGCAGGATTG CCGATGAGGA AAGTTTGTTC CGCTTCCAAT TGGATGATAA 
651 GGGCAAGTGG GGTAATCCTC AGTTGC. . . 



This corresponds to the amino acid sequence <SEQ ID 950; ORF 247>: 
m247 .pep (partial) 

1 XRRKMLNVXX GSYDGMKGFT IIEFLVAGLL SMIVLMAVGS SYFTSRKLND 



51 AANERLAAQQ DLRNAATLIV RDARMAGGFG CFNMSEHPAT DVIPDTTQQN 

101 SPFSLKRNGI DKLIPIAESS NINYQNFFQV GSALIFQYGI DDVNASTATT 

151 WSSCAAISK PGKQIPTLED AKKELKIPDQ DKEQNGNIAR QRHWNAYAV 

201 GRIADEESLF RFQLDDKGKW GNPQL 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 247 shows 69.3% identity over a 238 aa overlap with a predicted ORF (ORF 247.ng) 
from N. gonorrhoeae: 
m247/g247 



m24 7 . pep 



10 20 30 40 50 60 

XRRKMLNVXXGSYDGMKGFTI I EFLVAGLLSMIVLMAVGS SYFTSRKLND AANERLAAQQ 



g247 




10 20 30 40 50 60 



m24 7 .pep 



70 80 90 100 
DLRNAATLI VRDARMAGGFGCFNMSEHPATDVI PDTTQQNSPFSLKRN 



g247 




70 80 90 100 110 120 



m24 7 .pep 



110 120 130 140 150 160 

GIDK-LIPIAESSNINYQNFFQVGSALIFQYGIDDVNASTATTWSSCAAISKPGKQIPT 
1 = 1 I llhlll :|»| :|:| : | | : | | | | | | | s = | | : | : | | | | | : |: | || |: | | 



rirr/usyy/uyMo 
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g24 7 GMDKQL I PVAES I DI KYPGF I QRLNTUjVFQYGI DDLDAS AETVWSSCS KI AKPGKKI ST 

130 140 150 160 170 180 

170 180 190 200 210 220 

m247 . pep LEDAKKELKI PDQDKEQNGNI ARQRHVVNAYAVGRIAD- EESLFRFQLDDKGKWGNPQL 

|::|h hi ::|| I I I I I - I I ' I I II I I I I I h - I I I I I I I I I I I I I I I I I I I I 
. g24 7 LQEAKSALQITNDDK-QNGNITRQKHWNAYAVGRFGNNEESLFRFQLDDKGKWGNPQLL 

190 200 210 220 230 

g24 7 VKKVKRMDVRYIYVSGCPEDEDAGKEEKFRYTNKFDKSKNAVTPAGVEVLLDSGLNAKIA 
240 250 260 270 280 290 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 95 1>: 

a247 . seq 

1 ATGAGACGTA AAATGCTAAA CGTACCAAAA GGCAATTATG ATGGTATGAA 

51 GGGTTTTACC ATTATTGAAT TTTTGGTTGC GGGCATGCTC AGTATGATTG 

101 TCCTGATGGC GGTCGGATCG AGTTACTTCA CATCCCGGAA ATTAAATGAT 

151 GCGGCAAACG AGCGTCTTTC CGCGCAACAG GATTTGCGGA ATGCGGCAAC 

201 ATTGATTGTC CGCGATGCAA GAATGGCAGG GGGCTTCGGT TGTTTCAATA 

251 TGTCCGAGCA TACTAAAAAT GAT AT TAT TG TTGATCCAAG TAAGCAAACT 

301 CAACATGTCC CTGTAAAACC CGGTGCCAAA CAAGAAAATC CCCTTTTTTC 

351 TTTAGAGTGG GCTAATACTA ATAATACTAA TAATAATACA GCTAAATTGA 

4 01 TTCCTATTGC TGAATCCACA GATATTAAAT ATCCGGGTTT TGCCCAGGCT 

4 51 CGTCCGGCAT TGATTTTCCA ATACGGCATC GATGATCTTG ATGCGAGTGC 

501 TGAGACTGTT GTAGTCAGCA GCTGTTCCAA AATAGCAAAA CCGGGTAAGA 

551 AAATATCTAC CTTGCAAGAA GCAAAGAGTG CATTACAGAT TACTAATGAT 

601 GATAAACAAA ATGGAAATAT CACCCGTCAA AGGCATGTGG TCAATGCCTA 

651 TGCGGTCGGC AGGATTGCCG GTGAGGAAGG TTTGTTCCGC TTCCAATTGG 

701 ATGATAAGGG CAAGTGGGGT AATCCTCAGT TGCTCGTGAA AAAGATTAGA 

751 CAT AT G AAAG TGCGGTATAT CTATGTTTCC GACTGTCCTG AAGATGACGA 

801 TGCCGGCAAA GAGGAAAAAT TCAAATATAC GGGTACATTC GACAGCTCCA 

851 CAAATGCTGT TACGCCCGCC GGGGTGGAGG TTTTATTGAG TANCGGTACT 

901 GATACCAAGA TTGCCGCTTC TTCAGACAAT CATATTTATG CTTACCGTAT 

951 CGATGCGACA ATACGCGGGG GAAATGTATG CGCAAACAGA ACACTTTGA 

This corresponds to the amino acid sequence <SEQ ID 952; ORF 247.a>: 

a247 .pep 

1 MRRKMLNVPK GNYDGMKGFT IIEFLVAGML SMIVLMA VGS SYFTSRKLND 

51 AANERLSAQQ DLRNAATLIV RDARMAGGFG CFNMSEHTKN DIIVDPSKQT 

101 QHVPVKPGAK QENPLFSLEW ANTNNTNNNT AKLIPIAEST DIKYPGFAQA 

151 RPALIFQYGI DDLDASAETV WSSCSKIAK PGKKISTLQE AKSALQITND 

201 DKQNGNITRQ RHWNAYAVG RIAGEEGLFR FQLDDKGKWG NPQLLVKKIR 

251 HMKVRYIYVS DCPEDDDAGK EEKFKYTGTF DSSTNAVT PA GVEVLLSXGT 

301 DTKIAASSDN HIYAYRIDAT IRGGNVCANR TL* 

m247/a247 70.9% identity in 244 aa overlap 

10 20 30 40 50 60 

m24 7 .pep XRRKMLNVXXGSYDGMKGFTIIEFLVAGLLSMIVLMAVGSSYFTSRKLNDAANERLAAQQ 
I I I I I I I I : I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I ! I I I II I I I I I I : I I I 
a247 MRRKMLNVPKGNYDGMKGFTIIEFLVAGMLSMIVLMAVGSSYFTSRKLNDAANERLSAQQ 

10 20 30 40 50 60 

70 80 90 100 

m247 .pep D LRN AAT L I VR D ARMAGG FG C FNM S E H PAT DV I PDTTQQNSPFSLK- 

I I I I I I I I II I I 1 I I I I I I I I I I I I I I : I : I I : I : I III: 

a247 DLRNAATLIVRDARMAGGFGCFNMSEHTKNDIIVDPSKQTQHVPVKPGAKQENPLFSLEW 

70 80 90 100 110 120 

110 120 130 140 150 160 

m247 .pep RNGIDKLIPIAESSNINYQNFFQVGSALIFQYGIDDVNASTATTVVSSCAAISK 

I : II I I I I I I : : I : I : I I : I I I I I I I I II : : I I : I : I I I I I : I : I 
a24 7 ANTNNTNNNTAKLIPIAESTDIKYPGFAQARPALIFQYGIDDLDASAETWVSSCSKIAK 

130 140 150 160 170 180 



rci/t>s*y/iw»o 
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170 180 190 200 210 220 

m24 7 . pep PGKQIPTLEDAKKELKIPDQDKEQNGNIARQRHWNAYAVGRIADEESLFRFQLDDKGKW 

I I I : I I I :: I I : hi : : I I I I I I I : I I I I I I I I I I I M I I H : I I I I I I I I I I I I 
a24 7 PGKKISTLQEAKSALQITNDDK-QNGNITRQRHVVNAYAVGRIAGEEGLFRFQLDDKGKW 

190 200 210 220 230 



m2 47.pep GNPQL 
Mill 

a 24 7 GNPQLLVKKIRHMKVRYIYVSDCPEDDDAGKEEKFKYTGTFDSSTNAVTPAGVEVLLSXG 
240 250 260 270 280 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 953>: 

g247-l.seq (partial) 

1 CCCGGTGCCA AACAAGAAAA TCCCCTTTTT TCCTTAAAAA GGAGCGGCAT 

51 GGATAAACAA CTGATTCCCG TTGCTGAATC CATAGATATT AAATATCCGG 

101 GTTTTATCCA GCGCCTTAAC GCATTGGTTT TCCAATACGG T AT C GAT GAT 

151 CTTGATGCGA GTGCTGAGAC TGTTGTAGTC AGCAGCTGTT CCAAAATAGC 

201 AAAACCGGGT AAGAAAATAT CTACCTTGCA AGAAGCAAAG AGTGCATTAC 

251 AGATTACTAA TGATGATAAA CAAAATGGAA ATATCACCCG TCAGAAACAT 

301 GTGGTCAATG CCTATGCGGT CGGCAGGTTT GGCAATAATG AGGAAAGTTT 

351 GTTCCGCTTC CAATTGGATG ATAAGGGCAA GTGGGGTAAT CCTCAGTTGC 

4 01 TCGTGAAAAA GGTTAAACGT ATGGATGTGC GGTATATTTA TGTTTCCGGT 

4 51 TGTCCTGAAG ATGAAGATGC CGGCAAAGAG GAAAAATTCA GATATACGAA 

501 TAAATTCGAC AAATCCAAAA ATGCTGTTAC GCCTGCCGGG GTGGAGGTTT 

551 TATTGGATAG CGGCCTTAAT GCCAAGATTG CCGCTTCTTC AGACAATAGT 

601 ATTTATGCTT ACCGTATCAA TGCGACAATA CGCGGGGGAA ATGTATGCGC 

651 AAACAGAACA CTTTGA 

This corresponds to the amino acid sequence <SEQ ID 954; ORF 247-1 .ng>: 

g247-l.pep (partial) 

1 PGAKQENPLF SLKRSGMDKQ LIPVAESIDI KYPGFIQRLN ALVFQYGIDD 

51 LDASAETVVV SSCSKIAKPG KKISTLQEAK SALQITNDDK QNGNITRQKH 

101 VVNAYAVGRF GNNEESLFRF QLDDKGKWGN PQLLVKKVKR MDVRYIYVSG 

151 CPEDEDAGKE EKFRYTNKFD KSKNAVTPAG VEVLLDSGLN AKIAASSDNS 

201 IYAYRINATI RGGNVCANRT L* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 955>: 

m247-l . seq 

1 ATGAGACGTA AAATGCTAAA CGTACCAAAA GGCAGTTATG ATGGTATGAA 

51 AGGTTTTACC ATTATTGAAT TTTTGGTTGC GGGCCTGCTC AGTATGATTG 

101 TCCTGATGGC GGTCGGATCG AGTTACTTCA CATCCCGGAA ATTAAATGAT 

151 GCGGCAAACG AGCGTCTTGC CGCGCAACAG GATTTGCGGA ATGCGGCAAC 

201 ATTGATTGTC CGCGATGCGA GAATGGCAGG CGGCTTCGGT TGTTTCAATA 

251 TGTCCGAGCA TCCTGCAACT GATGTTATTC CCGATACGAC GCAACAAAAT 

301 TCTCCTTTTT CCTTAAAAAG GAACGGTATA GATAAACTTA TTCCCATAGC 

351 GGAATCTTCA AATATCAATT ATCAGAATTT TTTCCAGGTT GGTAGCGCAT 

401 TGATTTTTCA ATACGGAATC GATGATGTTA ATGCAAGCAC CGCGACTACC 

451 GTCGTCAGCA GCTGTGCCGC AATATCGAAA CCGGGCAAGC AAATCCCTAC 

501 TTTAGAAGAT GCAAAAAAAG AATTGAAGAT TCCGGATCAG GATAAGGAGC 

551 AAAATGGCAA TATAGCGCGT CAAAGGCATG TGGTCAATGC CTATGCGGTC 

601 GGCAGGATTG CCGATGAGGA AGGTTTGTTC CGCTTCCAAT TGGATGATAA 

651 GGGCAAGTGG GGTAATCCTC AGTTGCTCGT GAAAAAGGTT AGACATATGA 

701 AAGTGCGGTA TATCTATGTT TCCGGCTGTC CTGAAGATGA CGATGCCGGC 

7 51 AAAGAGGAAA CATTCAAATA TACGGATAAA TTCGACAGCG CCCAAAATGC 

801 TGTTACGCCC GCCGGGGTGG AGGTTTTATT GAGTAGCGGT ACTGATACCA 

851 AGATTGCCGC TTCTTCAGAC AATCATATTT ATGCTTACCG TATCGATGCG 

901 ACAATACGCG GGGGAAATGT ATGCGCAAAC AGAACACTTT GA 

This corresponds to the amino acid sequence <SEQ ID 956; ORF 247-l>: 

m247-l .pep 

1 MRRKMLNVPK GSYDGMKGFT IIEFLVAGLL SMIVLMAVGS SYFTSRKLND 

51 AANERLAAQQ DLRNAATLIV RDARMAGGFG CFNMSEHPAT DVIPDTTQQN 

101 SPFSLKRNGI DKLIPIAESS NINYQNFFQV GSALIFQYGI DDVNASTATT 

151 VVSSCAAISK PGKQIPTLED AKKELKIPDQ DKEQNGNIAR QRHVVNAYAV 

201 GRIADEEGLF RFQLDDKGKW GNPQLLVKKV RHMKVRYIYV SGCPEDDDAG 

251 KEETFKYTDK FDSAQNAVTP AGVEVLLSSG TDTKIAASSD NHIYAYRIDA 

301 TIRGGNVCAN RTL* 
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m247-l / g247-l 72.1% identity in 222 aa overlap 

70 80 90 100 110 120 

m247-l .pep NAATLIVRDARMAGGFGCFNMSEHPATDVIPDTTQQNSPFSLKRNGIDK-LIPIAESSNI 

I : I : I I I I I I : I : I I : I 

g247-l PGAKQENPLFSLKRSGMDKQLIPVAESIDI 

10 20 30 

130 140 150 160 170 180 

m247-l.pep NYQNFFQVGSALIFQYGIDDVNASTATTVVSSCAAISKPGKQIPTLEDAKKELKIPDQDK 
: I = ■ = i |:|||||: |:||||:| ||::||: |:| ::f | 

g24 7-l KYPGFIQRLNALVFQYGIDDLDASAETVVVSSCSKIAKPGKKISTLQEAKSALQITNDDK 
40 50 60 70 80 90 

190 200 210 220 230 240 

m247-l . pep EQNGNIARQRHVVNAYAVGRIAD-EEGLFRFQLDDKGKWGNPQLLVKKVRHMKVRYIYVS 

I I I I I :| I : II I I III I II::: I I : II I I I I I I I I I I I I I I I ::| I I j II I 1 

g247-l -QNGNITRQKHVVNAYAVGRFGNNEESLFRFQLDDKGKWGNPQLLVKKVKRMDVRYIYVS 
100 110 ■ 120 130 140 

250 260 270 280 290 300 

m247-l.pep GCPEDDDAGKEETFKYTDKFDSAQNAVTPAGVEVLLSSGTDTKI AAS SDNH I YAYRI DAT 
11111:11111! CI l:IM:::|| IIIIIM I I 1:11 = = I I I I t I I J lllllhll 
g247-l GCPEDEDAGKEEKFRYTNKFDKSKNAVTPAGVEVLLDSGLNAKIAASSDNSIYAYRINAT 
150 160 170 180 190 200 

310 

m2 47-1. pep I RGGN VCANRTLX 
I I I I I I I I I I I I I 
g2 4 7 - 1 I RGGNVCANRTLX 

210 220 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 957>: 

a247-l.seq (partial) 

1 AATAATACAG CTAAATTGAT TCCTATTGCT GAATCCACAG ATATTAAATA 

51 TCCGGGTTTT GCCCAGGCTC GTCCGGCATT GATTTTCCAA TACGGCATCG 

101 ATGATCTTGA TGCGAGTGCT GAGACTGTTG TAGTCAGCAG CTGTTCCAAA 

151 ATAGCAAAAC CGGGTAAGAA AATATCTACC TTGCAAGAAG CAAAGAGTGC 

201 ATTACAGATT ACTAATGATG ATAAACAAAA TGGAAATATC ACCCGTCAAA 

2 51 GGCATGTGGT CAATGCCTAT GCGGTCGGCA GGATTGCCGG TGAGGAAGGT 

301 TTGTTCCGCT TCCAATTGGA TGATAAGGGC AAGTGGGGTA ATCCTCAGTT 

351 GCTCGTGAAA AAGATTAGAC ATATGAAAGT GCGGTATATC TATGTTTCCG 

401 ACTGTCCTGA AGATGACGAT GCCGGCAAAG AGGAAAAATT CAAATATACG 

451 GGTACATTCG ACAGCTCCAC AAATGCTGTT ACGCCCGCCG GGGTGGAGGT 

501 TTTATTGAGT AGCGGTACTG ATACCAAGAT TGCCGCTTCT TCAGACAATC 

551 ATATTTATGC TTACCGTATC GATGCGACAA TACGCGGGGG AAATGTATGC 

601 GCAAACAGAA CACTTTGA 

This corresponds to the amino acid sequence <SEQ ID 958; ORF 247-1. a>: 

a247-l.pep (partial) . . 

1 NNTAKLIPIA ESTDIKYPGF AQARPALIFQ YGIDDLDASA ETVVVSSCSK 
51 IAKPGKKIST LQEAKSALQI TNDDKQNGNI TRQRHVVNAY AVGRIAGEEG 
101 LFRFQLDDKG KWGNPQLLVK KIRHMKVRYI YVSDCPEDDD AGKEEKFKYT 
151 GTFDSSTNAV TPAGVEVLLS SGTDTKIAAS SDNHIYAYRI DATIRGGNVC 
201 ANRTL* 

m247-l / a247-l 80.6% identity in 206 aa overlap 

10 20 30 

a247-l .pep NNTAKLIPIAESTDIKYPGFAQARPALIFQ 

I: I I I I I I I I :: I : I : I I : Mill 
m247~l GFGCFNMSEHPATDVIPDTTQQNSPFSLKRNGIDKLIPIAESSNINYQNFFQVGSALIFQ 
80 90 100 110 120 130 

40 50 60 70 80 89 

a247-l.pep YGIDDLDASAETVVVSSCSKIAKPGKKISTLQEAKSALQITNDDK-QNGNITRQRHVVNA 

IIIM::||: MIIMI: |:||||:| l|::||: |:| ::|| I I I I I : I I I I I I I I 
m2 4 7 - 1 YGI DDVNAST ATT VVSSCAAI SKPGKQI PTLEDAKKELKI PDQDKEQNGN I ARQRH VVNA 

140 150 160 170 180 190 

90 100 110 120 130 140 149 

a247-l.pep YAVGRIAGEEGLFRFQLDDKGKWGNPQLLVKKIRHMKVRYIYVSDCPEDDDAGKEEKFKY 
IIMIM Hill Ml Mil II I 1111:1 I I II I I I I I I I I I I I I III 
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m247-l YAVGRIADEEGLFRFQLDDKGKWGNPQLLVKKVRHMKVRYIYVSGCPEDDDAGKEETFKY 
200 210 220 230 240 250 

150 160 170 180 190 200 

a247-l.pep TGTFDSSTNAVTPAGVEVLLSSGTDTKIAASSDNHIYAYRIDATIRGGNVCANRTLX 

I HI: I I I I I I I I I I i I I I I I I I ! I 1 I I I I I I I I I I I i I I I I i I I I I I | 

m247-l TDKFDSAQNAVTPAGVEVLLSSGTDTKIAASSDNHIYAYRIDATIRGGNVCANRTLX 
260 270 280 290 300 310 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 959>: 

g24 8.seq 

1 atgcgcaaac agaacacttt gacaggaatc ccgacttctg acggacagag 

51 ggggtccgca ctgtttatcg tgctgatggt gatgatagtc gtggcctttt 

101 tggttgtaac tgccgcccag tcctacaata ccgaacagag gatcagtgcc 

151 aacgaatcag acaggaaatt ggctttgtct ttagccgagg cggctttgcg 

201 ggagggcgaa tttcaggttt tggatttgga atatgctgcg gacagtaagg 

251 ttacgtttag cgaaaactgt gaaaaaggtc tgtgtaccgc agtgaatgtg 

3 01 cggacaaata ataatggtag tgaagaggct tttggcaata tcgtggtgca 

351 aggcaagccc gccgttgagg cggtgaaacg ttcttgccct gcaaagtctg 

401 gcaaaaattc taccgacctg tgcattgaca ataaagggat ggaatataat 

451 aaaggcgcgg caggcgtcag caaaatgccg cgctatatta tcgaatattt 

501 aggcgtgaag aacggacaaa atgtttatcg ggttactgcc aaggcttggg 

551 gtaagaatgc caataccgtg gtcgtccttc aatcttatgt aggcaataat 

601 gatgagcaat aa 

This corresponds to the amino acid sequence <SEQ ID 960; ORF 248. ng>: 

g24 8.pep 

1 MRKQNTLTGI PTSDGQRGS A LFIVLMVMIV VAFLW TAAQ SYNTEQRISA 

51 NESDRKLALS LAEAALREGE FQVLDLEYAA DSKVTFSENC EKGLCTAVNV 

101 RTNNNGSEEA FGNIWQGKP AVEAVKRSCP AKSGKNSTDL CIDNKGMEYN 

151 KGAAGVS KMP RYIIEYLGVK NGQNVYRVTA KAWGKNANTV WLQSYVGNN 

201 DEQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 96 1>: 

m248.seq (partial) 

1 . . GGGTTTGCAC TGTTAATCGT GCTGATGGTG ATrATCGTCG TGGCT.TywT 

51 gGwTGTAACT GCCGCGCAGT CTTACAATAC cGAGCAGCGk ATCAGTkCCA 

101 ACGAATCAGA CAGGAAATTG GCTwTGTCTT TGGCCGAGkC GkCTwTGCGG 

151 GAAGGCGAAC TTCAGGTTTT GGATTTGGAA TATGATACGG ACAGTAAGGT 

201 TACATTTAGC GAAAACTGTG GAAAAGGTCT GTsTGCCGCA GTGAATGTGC 

251 GGACAAATAA TGATAATGAA GAGGCTTTTG ACAATATCGT GGTGCAAGGC 

301 AAGCCCACCG TTGAGGCGGT GAAGCGTTCT TGCCCTGCAA ATTCTACCGA 

351 CCTGTGCATT GACAAGAAAG GGwTGGAATA TAAGAAAGGC ACGAGAAGCG 

401 TCAc.AAAAT GCCACGTTAT ATTATCGAAT ATTTGGGCGT GwAGAACGGA 

451 GAAAATGTTT ATCGGGTTAC TGCCAAGGCT TGGGGtAAGA ATGCCAATAC 

501 CGTGGTCGTC CTTCAATCTT ATGTAAGCAA TAATGATGAG TAA 

This corresponds to the amino acid sequence <SEQ ID 962; ORF 248>: 

m248.pep (partial) 

1 . . GF ALLIVLMV XIWAFXXV T AAQSYNTEQR ISXNESDRKL AXSLAEXXXR 

51 EGELQVLDLE YDTDSKVTFS ENCGKGLXAA VNVRTNNDNE EAFDNIWQG 

101 KPTVEAVKRS CPANSTDLCI DKKGXEYKKG TRSVTKMPRY IIEYLGVXNG 

151 ENVYRVTAKA WGKNANTVW LQSYVSNNDE * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 248 shows 81.1% identity over a 185 aa overlap with a predicted ORF (ORF 248.ng) 
from N. gonorrhoeae: 

m248/g248 

10 20 30 40 

m24 8 .pep GFALLIVIJWXIVVAFXXVTAAQSYNTEQRISXNESDRKLAXS 

I Ihlllll Mill llllllllllllll llllllll | 



WO 
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9248 MRKQNTLTGI PTSDGQRGSALFI VLMVMI WAFLWTAAQS YNTEQRI SANESDRKLALS 

10 20 30 40 50 60 

50 60 70 80 90 100 

m2 4 8 . pep LAEXXXREGELQVLDLE YDTDS KVTFS ENCGKGLXAAVNVRTNND - NEE AFDNI WQGKP 

III MINIUM! MMIIIIIII III :||||||||: Mill MINIM 

g2 4 8 LAEAALREGEFQVLDLEYAADSKVTFSENCEKGLCTAVNVRTNNNGSEEAFGNI WQGKP 

70 80 90 100 110 120 



HO 120 130 140 150 

m2 4 8 . pep TVEAVKRSCPA NSTDLCIDKKGXEYKKGTRSVTKMPRYI IEYLGVXNGENVYRVTA 

: II 1 1 II 1 1 II MINIM Ihlh MMIMMIIIMI IhlllllM 

924 8 AVEAVKRSCPAKSGKNSTDLCIDNKGMEYNKGAAGVSKMPRYI I E YLGVKNGQNVYRVTA 

130 140 150 160 170 180 



160 170 180 

m2 4 8 . pep KAWGKNANTVWLQS YVSNNDEX 

I I I I I I! I I I I I I I I I I = i I M 
g24 8 KAWGKNANTVWLQS YVGNNDEQX 

190 200 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 963>: 

a2 48.seq 

1 ATGCGCAAAC AGAACACTTT GACGGGAATC CCGACTTCTG AC GG AC AG AG 

51 GGGGTTTGCA CTGTTTATCG TGCTGATGGT GATGATCGTC GTGGCTTTTT 

101 TGGTTGTAAC TGCCGCGCAG TCTTACAATA CCGAGCAGCG GATCAGTGCC 

151 AACGAATCAG ACAGGAAATT GGCTTTGTCT TTGGCCGAGG CGGCTTT.GCG 

201 GG AAGGCGAA CTTCAGGTTT TGGATTTGGA ATATGATACG GACAGTAAGG 

251 TTACATTTAG CGAAAACTGT GGAAAAGGTC TGTGTACCGC AGTGAATGTG 

301 CGGACAAATA ATGATAATGA AGAGGCTTTT GACAATATCG TGGTGCAAGG 

351 CAAGCCCACC GTTGAGGCGG TGAAGCGTTC TTGCACTGCA AAATCTACAG 

4 01 GCCTGTGCAT TGACAATAAA GGGATGGAAT ATAAGAAAGG CACGCAAAGC 

4 51 GTCAGCAAAA TGCCACGTTA TATTATCGAA TATTTGGGCG TGAAGAACGG 

501 AGAAAATGTT TATCGGGTTA CTGCCAAGGC TTGGGGTAAG AATGCCAATA 

551 CCGTGGTCGT CCTTCAATCT TATGTAAGCA ATAATGATGA GTAA 

This corresponds to the amino acid sequence <SEQ ID 964; ORF 248. a>: 

a248 .pep 

1 MRKQNTLTGI PTSDGQRGF A LFIVLMVMIV VAFLW TAAQ S YNTEQRI SA 

51 NESDRKLALS LAEAALREGE LQVLDLEYDT DSKVTFSENC GKGLCTAVNV 

101 RTNN DNEEAF DNIVVQGKPT VEAVKRSCTA KSTGLCIDNK GMEYKKGTQS 

151 VSKMPRYIIE YLGVKNGENV YRVTAKAWGK NANTWVLQS YVSNNDE* 

m248/a248 89.4% identity in 180 aa overlap 

10 20 30 40 

m24 8.pep GFALLIVLMVXIWAFXXVTAAQSYNTEQRISXNESDRKLAXS 

Mlhlllll Mill I I I I I I I I I I I I I I I I I I | I I i i 
a24 8 MRKQNTLTGIPTSDGQRGFALFIVLMVMIWAFLWTAAQSYNTEQRISANESDRKLALS 

10 20 30 40 50 60 

50 60 70 80 90 100 

m24 8 . pep LAEXXXREGELQVLDLE YDTDSKVTFSENCGKGLXAAVNVRTNNDNEEAFDNIVVQGKPT 

Ml I I I I I I I I I I ! i M I I I I i I I I I I I I I I : I I I I I I I I I I I | | | | | | | | | | | | | 
a2 4 8 LAEAALREGELQVLDLEYDTDSKVTFSENCGKGLCTAVNVRTNNDNEEAFDNI WQGKPT 

70 80 90 100 110 120 



HO 120 130 140 150 160 

m24 8.pep VEAVKRSCPANSTDLCIDKKGXEYKKGTRSVTKMPRYIIEYLGVXNGENVYRVTAKAWGK 
Mill Ml 1:11 Nihil I I I I I I : II : I I I If i M I I I I I I I I I 1 I I I I I I I ! I 
a24 8 VEAVKRSCTAKSTGLCIDNKGMEYKKGTQSVSKMPRYIIEYLGVKNGENVYRVTAKAWGK 

130 140 150 160 170 180 



m24 8 .pep 



170 180 
NANTWVLQS YVSNNDEX 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

a2 4 8 NANTVWLQSYVSNNDEX 

190 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 965>: 

m248-l . seq 

1 ATGCGCAAAC 

51 GGGGTTTGCA 

101 TGGTTGTAAC 

151 AACGAATCAG 

201 GGAAGGCGAA 

251 TTACATTTAG 

301 CGGACAAATA 

351 CAAGCCCACC 

401 ACCTGTGCAT 

451 GTCAGCAAAA 

501 AGAAAATGTT 

551 CCGTGGTCGT 

This corresponds to the amino acid sequence <SEQ ID 966; ORF 248- 1>: 

m248-l.pep 

1 MRKQNTLTGI PTSDGQRGF A LFIVLMVMIV VAFLVV TAAQ SYNTEQRISA 
51 NESDRKLALS LAEAALREGE LQVLDLEYDT DSKVTFSENC GKGLCAAVNV 
101 RTNNDNEEAF DNI VVQGKPT VEAVKRSCPA NSTDLCIDKK GMEYKKGTRS 
151 VSKMPRYIIE YLGVKNGENV YRVTAKAWGK NANTVVVLQS YVSNNDE* 

m248-l/g248 89.1% identity in 202 aa overlap 

10 20 30 40 50 60 

m248-l . pep MRKQNTLTGIPTSDGQRGFALFIVLMVMIVVAFLWTAAQSYNTEQRISANESDRKLALS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I II I I I I I 
g248 MRKQNTLTGI PTSDGQRGSALFIVLMVMIVVAFLVVTAAQSYNTEQRISANESDRKLALS 

10 20 30 40 50 60 

70 80 90 100 110 119 

m248-l.pep LAEAALREGELQVLDLEYDTDSKVTFSENCGKGLCAAVNVRTNND-NEEAFDNIWQGKP 
I I I I I I I I I I : I I I I I I I : I I I I I I I I I I I I I I : I I I I I I I I : : I I I I I I I I I I I I 
g248 LAEAALREGEFQVLDLEYAADSKVTFSENCEKGLCTAVNVRTNNNGSEEAFGNIVVQGKP 

70 80 90 100 110 120 

120 130 140 150 160 170 

T VEAVKRSC PA NSTDLCIDKKGMEYKKGTRSVSKMPRYIIEYLGVKNGENVYRVTA 

: i I I I I I I I I I I I I I I I I I : I I I I I : I I : : I I I I I I I I I I I I I I I I I : I I I I I I I 
AVEAVKRSCPAKSGKNSTDLCIDNKGMEYNKGAAGVSKMPRYIIEYLGVKNGQNVYRVTA 
130 140 150 160 170 180 

180 190 
KAWGKNANTVVVLQSYVSNNDEX 
I I I i I I I I I I I II I I II : I I I I 
KAWGKNANTVVVLQSYVGNNDEQX 
190 200 

97.0% identity in 197 aa overlap 

10 20 30 40 50 60 

MRKQNTLTGIPTSDGQRGFALFIVLMVMIVVAFLVVTAAQSYNTEQRISANESDRKLALS 
I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I ! I I I I I 
MRKQNTLTGI PTSDGQRGFALFIVLMVMIVVAFLVVTAAQSYNTEQRISANESDRKLALS 
10 20 30 40 50 60 

70 80 90 100 110 120 

LAEAALREGELQVLDLEYDTDSKVTFSENCGKGLCAAVNVRTNNDNEEAFDNI VVQGKPT 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I i I I : I II I I I I I I I I I I I I I I I I I I I I I 
LAEAALREGELQVLDLEYDTDSKVTFSENCGKGLCTAVNVRTNNDNEEAFDN I VVQGKPT 
70 80 90 100 110 120 

130 140 150 160 170 180 

VEAVKRSC PANSTDLC I DKKGMEYKKGTRSVSKMPRYI I EYLGVKNGENVYRVTAKAWGK 

I I I I I I I I 1:11 I I I I :l I I I I I I I I : I I I I I I I I II I I I I I 1 I II ! I I I I I I I I I I I 
VEAVKRSCTAKSTGLCIDNKGMEYKKGTQSVSKMPRYIIEYLGVKNGENVYRVTAKAWGK 
130 140 150 160 170 180 



AGAACACTTT GACGGGAATC 
CTGTTTATCG TGCTGATGGT 
TGCCGCGCAG TCTTACAATA 
ACAGGAAATT GGCTTTGTCT 
CTTCAGGTTT TGGATTTGGA 
CGAAAACTGT GGAAAAGGTC 
ATGATAATGA AGAGGCTTTT 
GTTGAGGCGG TGAAGCGTTC 
TGACAAGAAA GGGATGGAAT 
TGCCACGTTA TATTATCGAA 
TATCGGGTTA CTGCCAAGGC 
CCTTCAATCT TATGTAAGCA 



CCGACTTCTG ACGGACAGAG 
GATGATCGTC GTGGCTTTTT 
CCGAGCAGCG GATCAGTGCC 
TTGGCCGAGG CGGCTTTGCG 
ATATGATACG GACAGTAAGG 
TGTGTGCCGC AGTGAATGTG 
GACAATATCG TGGTGCAAGG 
TTGCCCTGCA AATTCTACCG 
ATAAGAAAGG CACGAGAAGC 
TATTTGGGCG TGAAGAACGG 
TTGGGGTAAG AATGCCAATA 
ATAATGATGA GTAA 



m248-l .pep 
g248 

m248-l.pep 
g248 

m248-l/a248 

m248-l.pep 
a248 

m248-l.pep 
a248 

m248-l .pep 
a248 
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190 

m24 8-l .pep NANTVVVLQSYVSNNDEX 
I I I I I ! I I I ! I I I I I I I I 
a 2 4 8 NANTVVVLQSYVSNNDEX 
190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 967>: 

g249 . seq 

1 atgaagaata atgattgctt gcgcctgaaa aatccccagt ccggtatggc 

51 gttgatagaa gtcttggtcg ctatgctcgt tctgaccatc ggtattttgg 

101 cattgctgtc cgtacagttg cggacagtcg cttccgtcag ggaggcggaa 

151 acgcaaacca tcgtcagcca aatcacgcaa aacctgatgg aaggaatgtt 

2 01 gatgaatccg accattgatt tggacagcaa caagaaaaac tatagtcttt 
251 acatgggaaa acagacacta tcagctgtgg atggtgagtt tatgcttgat 

3 01 gccgagaaaa gtaaggcgca gttggcagag gaacaattga agagatttag 
351 tcatgagctg aaaaatgcct tgccggatgc ggtagctatt cattacgccg 
401 tctgcaagga ttcgtcgggt gacgcgccga cattgtccga cagcggtgct 
451 ttttcttcaa attgcgacaa taaggcaaac ggggatactt tgattaaagt 
501 attgtgggta aatgattcgg caggggattc ggatatttcc cgtacgaatc 
551 ttgaagtgag cggcgacaat atcgtatata cctatcaggc aagggtcgga 
601 ggtcgtgaat ga 

This corresponds to the amino acid sequence <SEQ ID 968; ORF 249.ng>: 

g24 9.pep 

1 MKNNDCLRLK NPQSGMALIE V LVAMLVLTI GILALLSV QL RTVASVREAE 

51 TQTIVSQITQ NLMEGMLMNP TIDLDSNKKN YSLYMGKQTL SAVDGEFMLD 

101 AEKSKAQLAE EQLKRFSHEL KNALPDAVAI HYAVCKDSSG DAPTLSDSGA 

151 FSSNCDNKAN GDTLIKVLWV NDSAGDSDIS RTNLEVSGDN IVYTYQARVG 

201 GRE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 969>: 



m24 9 . seq 












1 


ATGAAGAATA 


ATGATTGCTT 


CCGCCTGAAA 


GATTCCCAGT 


CCGGTATGGC 


51 


GCTGATAGAA 


GTCTTGGTTG 


CTATGCTCGT 


TCTGACCATC 


GGTATTTTGG 


101 


CACTATTGTC 


TGTACAGTTG 


CGGACAGTCN 


NNNNNNNNNN 


NNNNNNNNNN 


151 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNTTGATGG 


AGGGAATGTT 


201 


GATGAATCCG 


ACCATTGATT 


CGGACAGCAA 


CAAGAAAAAC 


TATAATCTTT 


251 


ACATGGGAAA 


CCATACACTA 


TCAGCTGTGG 


ATGGCGATTT 


TGCGATTGAT 


301 


GCCATGAAAA 


CTAAGGGGCA 


ATTGGCAGAG 


GCACAATTGA 


AGAGATTTAG 


351 


TTATGAGCTG 


AAAAATGCCT 


TGCCGGATGC 


GGCAGCCATC 


CATTACGCCG 


401 


TCTGCAAGGA 


TTCGTCGGGT 


AACGCGCCGA 


CATTGTCCGG 


CAATGCTTTT 


451 


TCTTCAAATT 


GCGACAATAA 


GGCAAACGGG 


GATACTTTAA 


TTAAAGTATT 


501 


GTGGGTAAAT 


GATTCGGCAG 


GGGATTCGGA 


TATTTCCCGT 


ACGAATCTTG 


551 


AGGTGAGCGG 


CGACAATATC 


GTATATACTT 


ATCAGGCAAG 


GGTCGGAGGT 


601 


CGGGAATGA 











This corresponds to the amino acid sequence <SEQ ID 970; ORF 249>: 

m24 9 .pep 

1 MKNNDCFRLK DSQSGMALIE V LVAMLVLTI GILALLSV QL RTVXXXXXXX 
51 XXXXXXXXXX XLMEGMLMNP TIDSDSNKKN YNLYMGNHTL SAVDGDFAID 
101 AMKTKGQLAE AQLKRFSYEL KNALPDAAAI HYAVCKDSSG NAPTLSGNAF 
151 SSNCDNKANG DTLIKVLWVN DSAGDSDISR TNLEVSGDNI VYTYQARVGG 
201 RE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 249 shows 8 1 .3% identity over a 203 aa overlap with a predicted ORF (ORF 249.ng) 
from N. gonorrhoeae: 

m249/g249 

10 20 30 40 50 60 

m24 9 . pep MKNNDCFRLKDSQSGMALIEVLVAMLVLTIGILALLSVQLRTVXXXXXXXXXXXXXXXXX 
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g249 MKNNDCLRLK1JPQSGMALIEVLVAMLVLTIGILALLSVQLRTVASVREAETQTIVSQITQ 

10 20 30 40 50 60 



70 80 90 100 110 120 

m24 9 . pep XLMEGMLMNPTIDSDSNKKNYNLYMGNHTLSAVDGDFAIDAMKTKGQLAEAQLKRFSYEL 

Illlllllllll I I I I II hi I II -I I I I I I hi :|| hhllll Illllhll 
g24 9 NLMEGMLMNPTIDLDSNKKNYSLYMGKQTLSAVDGEFMLDAEKSKAQLAEEQLKRFSHEL 

70 80 90 100 110 120 



130 140 150 160 170 179 

m249 . pep KNALPDAAAIHYAVCKDSSGNAPTLSGN-AFSSNCDNKANGDTLIKVLWVNDSAGDSDIS 

1 1 1 h I h 1 1 1 II 1 1 1 1 1 1 hlil M : li 1 1 i i I I I 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 

g249 KNALPDAVAIHYAVCKDSSGDAPTLSDSGAFSSNCDNKANGDTLIKVLWVNDSAGDSDIS 

130 140 150 160 170 180 



180 190 200 

m2 4 9 . pep RTNLEVSGDNIVYTYQARVGGREX 

Ml III 1 1 II I II I MM III III 

g24 9 RTNLEVSGDNIVYTYQARVGGREX 

190 200 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 97 1>: 

a249.seq 

1 ATGAAGAATA ATGATTGCTT CCGCCTGAAA AACCCCCAGT CCGGTATGGC 

51 GCTGATAGAA GTCTTGGTCG CTATGCTCGT TCTGACCATC GGTATTTTGG 

101 CACTATTGTC TGTTCAGTTG CGGACAGTCG CTTCCGTCAG GGAGGCAGAG 

151 ACGCAAACCA TCGTCAGTCA AATCACGCAA AACCTGATGG AAGGAATGTT 

201 GATGAATCCG ACCATTGATT CGGACAGCAA CAAGAAAAAC TATAATCTTT 

251 ACATGGGAAA CCATCATGCA CTATCAGTTG TGGATGGCGA TTTTCAGGTT 

301 GATGCCATAA AAACTAAGAC GCAGTTGGCA GAGGCACAAT TGAAGAGATT 

351 TAGTTATGAG CTGAAAAATG CCTTGCCGGA TGCGGCAGCC ATCCATTACG 

401 CCGTCTGCAA GGATTCGTCG GGTGTTGCGC CGACATTGTC CGCCGGCAGT 

451 ACTTTTTCTT CAAATTGCGA TGGTAGTGCA AATGGGGATA CTTTGATTAA 

501 AGTATTGTGG GTAAATGATT CGGCAGGGGA TTCGGATATC GCCCGTACGA 

551 ATCTTGAGAC GAACGGCAAC AATATCGTAT ATACCTATCA GGCAAGGGTC 

601 GGAGGTCGGG AATGA 

This corresponds to the amino acid sequence <SEQ ID 972; ORF 249.a>: 

a24 9.pep 

1 MKNNDCFRLK NPQSGMALIE V LVAMLVLT I GILALLSV QL RTVASVREAE 

51 TQTIVSQITQ NLMEGMLMNP TIDSDSNKKN YNLYMGNHHA LSVVDGDFQV 

101 DAIKTKTQLA EAQLKRFSYE LKNALPDAAA IHYAVCKDSS GVAPTLSAGS 

151 TFSSNCDGSA NGDTLIKVLW VNDSAGDSDI ARTNLETNGN NIVYTYQARV 

201 GGRE* 

m249/a249 8 1 .9% identity in 204 aa overlap 

10 20 30 40 50 60 

m24 9 . pep MKNNDCFRLKDSQSGMALIEVLVAMLVLTIGILALLSVQLRTVXXXXXXXXXXXXXXXXX 

I I I I I I I I I I : I I I I I I I I II I I I I I I I I II I I I I I I I I I I I : : : 

a24 9 MKNNDCFRLKNPQSGMALIEVLVAMLVLTIGILALLSVQLRTVASVREAETQTIVSQITQ 

10 20 30 40 50 60 



70 80 90 100 110 119 

m24 9.pep XLMEGMLMNPTIDSDSNKKNYNLYMGNH-TLSAVDGDFAIDAMKTKGQLAEAQLKRFSYE 
I I I I I I I I I I I I I I I I I I I I I I I II I I : I I : II I I I : I I : I I I I I I I I I I I I I I I I 
a24 9 NLMEGMLMNPTIDSDSNKKNYNLYMGNHHALSWDGDFQVDAIKTKTQLAEAQLKRFSYE 

70 80 90 100 110 120 



120 130 140 150 160 170 

m24 9.pep LKNALPDAAAIHYAVCKDSSGNAPTLS-GNAFSSNCDNKANGDTLIKVLWVNDSAGDSDI 
I I I I I I I I I I 1 I I I I I I I I I I I I 1 I I I ::! I I I I I :: I I I I I I I I I I I I I I I i I I I I I 
a24 9 LKNALPDAAAIHYAVCKDSSGVAPTLSAGSTFSSNCDGSANGDTLIKVLWVNDSAGDSDI 

130 140 150 160 170 180 
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180 190 200 

m2 4 9 . pep SRTNLEVSGDNIVYTYQARVGGREX 
: I I I I I : : I : I I I I I I I I I I I I I I I 
a2 4 9 ARTNLETNGNNI VYTYQARVGGREX 

190 200 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 973>: 

m249-l . seq 

1 ATGAAGAATA ATGATTGCTT CCGCCTGAAA GATTCCCAGT CCGGTATGGC 

51 GCTGATAGAA GTCTTGGTTG CTATGCTCGT TCTGACCATC GGTATTTTGG 

101 CACTATTGTC TGTACAGTTG CGGACAGTCG CTTCCGTCAG GGAGGCGGAG 

151 ACACAAACCA TCGTCAGCCA AATCACGCAA AACCTGATGG AGGGAATGTT 

201 GATGAATCCG ACCATTGATT CGGACAGCAA CAAGAAAAAC TATAATCTTT 

251 ACATGGGAAA CCATACACTA TCAGCTGTGG ATGGCGATTT TGCGATTGAT 

301 GCCATGAAAA CTAAGGGGCA ATTGGCAGAG GCACAATTGA AGAGATTTAG 

351 TTATGAGCTG AAAAATGCCT TGCCGGATGC GGCAGCCATC CATTACGCCG 

401 TCTGCAAGGA TTCGTCGGGT AACGCGCCGA CATTGTCCGG CAATGCTTTT 

451 TCTTCAAATT GCGACAATAA GGCAAACGGG GATACTTTAA TTAAAGTATT 

501 GTGGGTAAAT GATTCGGCAG GGGATTCGGA TATTTCCCGT ACGAATCTTG 

551 AGGTGAGCGG CG ACAATATC GTATATACTT ATCAGGCAAG GGTCGGAGGT 

601 CGGGAATGA 

This corresponds to the amino acid sequence <SEQ ID 974; ORF 249-l>: 

m249-l .pep 

1 MKNNDCFRLK DSQSGMALIE V LVAMLVLTI GILALLSV QL RTVASVREAE 
51 TQTIVSQITQ NLMEGMLMNP TIDSDSNKKN YNLYMGNHTL SAVDGDFAID 
101 AMKTKGQLAE AQLKRFSYEL KNALPDAAAI HYAVCKDSSG NAPTLSGNAF 
151 SSNCDNKANG DTLIKVLWVN DSAGDSDISR TNLEVSGDNI VYTYQARVGG 
201 RE* 

m249-l/g249 90.1% identity in 203 aa overlap 

10 20 30 40 50 60 

MKNNDCFRLKDSQSGMALIEVLVAMLVLTIGILALLSVQLRTVASVREAETQTIVSQITQ 
i I I I I I : I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I ! I I I I 1 I I 
MKNNDCLRLKNPQSGMALIEVLVAMLVLTIGILALLSVQLRTVASVREAETQTIVSQITQ 
10 20 30 40 50 60 

70 80 90 100 110 120 

NLMEGMLMNPTIDSDSNKKNYNLYMGNHTLSAVDGDFAIDAMKTKGQLAEAQLKRFSYEL 
I I I I I I I I I I I I I I I I I I M : I I I I I I I II I I : I : I I I :| : I I I I I I I I I I : I I 
NLMEGMLMNPTIDLDSNKKNYSLYMGKQTLSAVDGEFMLDAEKSKAQLAEEQLKRFSHEL 
70 80 90 100 110 120 

130 140 150 160 170 179 

KNALPDAAAIHYAVCKDSSGNAPTLSGN-AFSSNCDNKANGDTLIKVLWVNDSAGDSDIS 
I I I I I i I : I I I II ! I I I I I I : I I I I I : I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
KNALPDAVAIHYAVCKDSSGDAPTLSDSGAFSSNCDNKANGDTLIKVLWVNDSAGDSDIS 
130 140 150 160 170 180 

180 190 200 

m2 4 9-1 .pep RTNLEVSGDN I VYTYQARVGGREX 
, I I I I I I II I I I I I I I I I I I I I I I I 
g249 RTNLEVSGDNI VYTYQARVGGREX 

190 200 

a249/ L36117 

gi 1 643582 (L36117) prepilin leader sequence requires cleavage to be active (Pseudomonas 
aeruginosa] 

>gi 1 1161222 (L48934) involved in type 4 fimbrial biogenesis; contains pre-pilin like leader 
sequence [Pseudomonas aeruginosa] 

>gi 1 1246299 (L76605) reference L36117, L48934 [Pseudomonas aeruginosa] Length = 185 
Score = 50.4 bits (118) , Expect = 9e-06 

Identities = 45/183 (24%), Positives « 84/183 (45%), Gaps - 26/183 (14%) 

Query: 13 QSGMALIEVLVAMLVLTIGILALLSVQLRTVASVREAETQTIVSQITQNLMEGMLMNPTI 72 

QSG ++IEVLVA+L+++IG+L ++++Q +T+ ++ + + + NL+E M + P 
Sbjct: 12 QSGFSMIEVLVALLLISIGVLGMIAMQGKTIQYTADSVERNKAAMLGSNLLESMRASPKA 71 



m249-l.pep 
g249 

m249-l.pep 
g249 

m2 4 9-1. pep 
g249 



Query: 7 3 DSDSNKKNYNLYMGNHHALSVVDGDFQVDAIKTKTQLAEA QLKRFSYELKNALPDAA 12 9 
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D + M G A + T L +A +L +4- ++KN LP A 

Sbjct: 72 LYDVKDQ MATQSDFFKAKGSAFPTAPSSCTPLPDAIKDRLGCWAEQVKNELPGAG 126 

Query: 130 AI— HYAVCKDSSGVAPTLSAGSTFSSNCDGSANGDTL-IKVLWVNDSAGDSDIARTNL 185 

+ Y +C+ S +CDG G L I++ W + A ++ 

Sbjct: 127 DLLKSDYYICRSSK PGDCDG-- KGSMLEIRLAWRGKQGACVNAADSSA 172 

Query: 186 ETN 188 
+T+ 

Sbjct: 173 DTS 175 

m249-l/a249 90.7% identity in 204 aa overlap 

10 20 30 40 50 60 

m249-l.pep MKNNDCFRLKDSQSGMALIEVLVAMLVLTIGILALLSVQLRTVASVREAETQTIVSQITQ 
Mllllllll: IIIMIIIItllllllllllliillllllMlllitllllllltllll 
a2 4 9 MKNNDCFRLKNPQSGMALIEVLVAMLVLTIGILALLSVQLRTVASVREAETQTI VSQITQ 

10 20 30 40 50 60 

70 80 90 100 110 119 

m24 9-l.pep NLMEGMLMNPTIDSDSNKKNYNLYMGNH-TLSAVDGDFAIDAMKTKGQLAEAQLKRFSYE 
I I I I I I I I I I I I I I I I I I I I I I I I I | | | :||:|tlll :tl:IM I I I i I t I I I I I I I 
a24 9 NLMEGMLMNPTIDSDSNKKNYNLYMGNHHALSVVDGDFQVDAIKTKTQLAEAQLKRFSYE 

70 80 90 100 110 120 

120 130 140 150 160 170 

m249-l.pep LKNALPDAAAIHYAVCKDSSGNAPTLS-GNAFSSNCDNKANGDTLIKVLWVNDSAGDSDI 
IIIIIIIMIIIIMMIIII Mil! I I I I I i I I I ! I f| | | | | | | 

a249 LKNALPDAAAIHYAVCKDSSGVAPTLSAGSTFSSNCDGSANGDTLIKVLWVNDSAGDSDI 
130 140 150 160 170 180 

180 190 200 

m2 4 9 - 1 . pep SRTNLEVSGDNI V YT YQARVGGREX 
: I I I I I :: I : I I I 1 i I I I I I I I I I I 
a249 ARTNLETNGNNIVYT YQARVGGREX 

190 200 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 975>: 

g250.seq 

1 atgacgcaca cagcctctcc acgtgatgaa ttcatacgcg gcataaaaga 

51 aagttcgccc atgctgattg ggcttttgcc ttgggcattg atactcggta 

101 tgcagggcgg gcaaaaaggt atgggccggc tggaaatgct gctgatgacg 

151 gggatgaact ttgccggcgg ctccgaattt gccacggtca acctgtgggc 

201 ggaacctctg ccgatactgc ttatcgccac cataaccttt atgattaatt 

251 cgcggcatat cctgatgggg ggcggcgctt gccacgcaca tgaaagaaat 

301 accgctgaaa aaagccgcgc ccgcgctgtt ttttatgtgt ga 

This corresponds to the amino acid sequence <SEQ ID 976; ORF 250.ng>: 

g250.pep 

1 MTHTASPRDE FIRGIKE SSP MLIGLLPWAL ILGM QGGQKG MGRLEMLLMT 

51 GMNFAGGSEF ATVNLWAEPL PILLI ATITF MINSRHILMG GG ACHAHERN 

101 TAEKSRARAV FYV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 977>: 

m250 . seq 

1 ATGCACACCT TCCCCGCATA ACGAATTTAT ACGCGGCATC AAAGAAAGTT 

51 CGCCTATGCT GATTGGGCTG CTGCCTTGGG CATTAATACT CGGTATGCAG 

101 GGCGGACAAA AAGGCATGAG CTGGCTGGAA ATGTTGTTGA TGACCAGTAT 

151 GAACTTCGCC GGCGGCTCCG AGTTTGCCAC GGTCAACCTG TGGGCsGAAC 

201 CTCTGCCGAT ACTGCTTATC GCCACCGTAA CCTTTATGAT TAATTCTCGG 

251 CATATCCTGA T . GGGGGCGG CGCTTGCCCC GCACCTGAAA GGAaTACCGC 

301 TGAAAAAAGC CGTGCCCGCA CTGTTTTTTA TGTGTGA 

This corresponds to the amino acid sequence <SEQ ID 978; ORF 250>: 

m2 50.pep 

1 MHTPSPHNEF IRGIKE SSPM LIGLLPWALI LGM OGGOKGM SWLEMLLMTS 

51 MNFAGGSEFA TVNLWAEPLP ILLI ATVTFM INSRHILMGG G ACPAPERNT 

101 AEKSRARTVF YV* 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 250 shows 91.0% identity over a 1 1 1 aa overlap with a predicted ORF (ORF 250.ng) 
from N. gonorrhoeae: 

m250/g250 

10 20 30 40 50 59 

m250 .pep MHTPSPHNEFIRGIKESSPMLIGLLPWALILGMQGGQKGMSWLEMLLMTSMNFAGGSEF 

II M-MIIIIIMIIIIMIIMIIIilllllMlh Mllllhlllllllll 

9 250 MTHT AS PRDE F I RGI KE S S PML I GLL PW AL I LGMQGGQKGMGRLEMLLMTGMNFAGGS E F 

10 20 30 40 50 60 

60 70 80 90 100 110 

m250 . pep ATVNLWAEPLPILLIATVTFMINSRHILMGGGACPAPERNTAEKSRARTVFYVX 

IIMIIIIIIIIIIIIhllMIIIIIIIIIIII I llllllllllhllll 

9250 ATVNLWAE PL P I LL I AT I TFM I NS RH I LMGGGACHAHERNTAE KS RARAVF YV 

70 80 90 100 110 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 979>: 

a250 . seq 

1 AT G AC AC AC A TAAGCTCGCC CCGTAACGAA TTTATACGCG GCATCAAAGA 

51 AAGTTCGCCC ATGCTGATCG GGCTTTTGCC TTGGGCATTA ATACTCGGTA 

101 TGCAGGGTGG ACAAAAAGGC ATGAGCTGGC TGGAAATGTT GTTGATGACC 

151 GGTATGAACT TCGCCGGCGG CTCCGAGTTT GCCACGGTCA ACCTGTGGGC 

201 GGAACCTCTG CCGATACTGC TTATCGCCAC CGTAACCTTT ATGATTAATT 

251 CTCGGCATAT CCTGATGGGG G . CGGCACTT GCCCCGCACC TGAAAGAAAT 

301 ACCGCTGAAA AAAGCCGTGC CCGCACTGTT TTTTATGTGT GA 

This corresponds to the amino acid sequence <SEQ ID 980; ORF 250.a>: 

a250 .pep 

1 MTHISSPRNE FIRGIKE SSP MLIGLLPWAL ILGM QGGQKG MSWLEMLLMT 

51 GMNFAGGSEF ATVNL WAE PL PILLIATVTF MI NSRHILMG XGTCPAPERN 

101 TAEKSRARTV FYV* 

m250/a250 94.6% identity in 1 1 1 aa overlap 

10 20 30 40 50 

59 

m250 . pep 

MHTPSPHNEFIRGIKESSPMLIGLLPWALILGMQGGQKGMSWLEMLLMTSMNFAGGSEF 
I 

M : I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | : | | | | | | | | | 
a250 

MTHISSPRNEFIRGIKESSPMLIGLLPWALILGMQGGQKGMSWLEMLLMTGMNFAGGSEF 

10 - 20 30 40 50 

60 

60 70 80 90 100 110 

m250 . pep ATVNLWAEPLPILLIATVTFMINSRHILMGGGACPAPERNTAEKSRARTVFYVX 
I I I I I I I I I I I I I II I I I I I I I I I I I I I I I i : I I | || | | | | | | | | | | | | M | | 
a250 ATVNLWAEPLPILLIATVTFMINSRHILMGXGTCPAPERNTAEKSRARTVFYVX 

70 80 90 100 110 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 98 1>: 

g251.seq 

1 atgcctgacc caatagggat tcttttcgct gccgtcgggg ttgatttttt 

51 tgccgttgtt ttgagggggc gttttcaacg aataggcgcg gttggcatgt 

101 tgataataat aatcctgatg gcggaggtcg gaaccaaaac ggtcgtaacc 
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151 gaggttgacg ctcaggttgt ggcggatttt ggcggtatcg aaggattttt 

201 tgaatgccgc ctgcaagagc ctgtggcttt ccccgtaaat cacgcggtcg 

251 gatttgtagt aggaagacgg cttgtcggca ctcgggcggc aatatttgtc 

301 cgaaccgtcg gcggaacagt gcgtctgctg aaaatgattg tccaaaccga 

351 tgccctgccg gtcgtaagag aggcgggcat aatccgccca agtgtcttta 

401 tcggcattgg tatagacata ttccaaaccg tagcggcttt tggtgtgcgt 

451 ctcgtcgtaa aacacgcccg taccgtattc cgcgcccacc tccgcaccgt 

501 tttcaccgtt ggtaatcagc ccgctgtatt tgcggccgcc cgcgtatttg 

551 ccgtagcctc ttatcgatcc gtatttttta ttttcatcaa aaaccgcctt 

601 ggtcaggaat gccggaaccg tcatatcgcg cgtgtcgaaa gtttgctgcg 

651 tgcgttcgag tatgccgccg atgtagtgcc gtttgttttc aaaacgaaaa 

701 cccgggcgga acagccacga ccggctttcg tatga 



This corresponds to the amino acid sequence <SEQ ID 982; ORF 251.ng>: 



1 MPDPIGILFA AVGVDFFAW LRGRFQRIGA VGMLIIIILM A EVGTKTWT 

51 EVDAQWADF GGIEGFFECR LQEPVAFPVN HAVGFWGRR LVGTRAAIFV 

101 RTVGGTVRLL KMIVQTDALP WREAGIIRP S VFIGIGIDI FQTVAAFG VR 

151 LWKHARTVF RAHLRTVFTV GNQPAVFAAA RVFAVASYRS VFFIFIKNRL 

201 GQECRNRHIA RVESLLRAFE YAADWPFVF KTKTRAEQPR PAFV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 983>: 



1 ATGCGTGCTG CGGTAGTCGT AGCGCAAGCC CGCGCCGACA TCCGCCCACC 

51 TGCCCAAACG GACATTGTCC CGAACTGCCG CGTAATAGCT TTTACCGTTG 

101 ATGCTGCGCG GCGTGCAGTC CGTATAAGTA TTGTTGCCCA AGCGGCAGAT 

151 TTGCCCCGTA ACGACATTTC CCCTGCCTAT GGTGACCCAA TAGGGGCTGG 

201 TTTCACTGCC GTTGGGGCTG ATTTTTTTGC CGTTGTTTTG AGGGGGCGTG 

251 TTCGACGAAT AGGCGCGGTT GGCATGTTGA TAATAATAAT CCTGATGGCG 

301 GAGATTAGAG CCAAAGCGGT CAAACCCGAG ATTCACGCTC AGGTTGTGGC 

351 GGATTTTGGC GGTATCGAAG GATTTTTTGA ATGCCGCCTG CAAGAGCCTG 

4 01 TGGCTTTCCC CGTAAATCAC GCGATCGGAT TTGTAATAGG AAAACGGCTT 

451 GTCGGCACTC GGGCGGCAAT ATTTGTCCGA ACCGTCGGCA GAACAGTGCG 

501 TCTGCTGAAA ATGATTATCC AAACCGATGC CCTGCCGGTC GTAAGAGAGG 

551 CGGGCATAAT CCGCCCAAGT GTCTTTATCG GCATTGGTAT AGACATATTC 

601 CAAAC CGTAG CGGCTTTTGG TGTGCGTCTC GTCGTAAAAC ACGCCCGTAC 

651 CGTATTCCGC GCCCACCAGC GCACCGTTTT CGCCGTTGGT • AAACAGTCCG 

701 CCGTATTTGT GGTTGCCCGC GTATTTGCCG TTACCGGGCA AAGAACCCGC 

751 CTGTTTTTTA TTTGCATCAA AAACCGCCTT GGTCAGGAAT GCCGGAACCG 

801 TCATATCGCG CGTGTCGAAA GTTTGTTGCG TGTGTTCGAG TATGCCGCCG 

851 ATGTAGTGCC GCTTATTCTC AAAACGAAAA CCCGGGCGGA ACAGCCACGA 

901 CCGGCTTTCG TATGA 



This corresponds to the amino acid sequence <SEQ ID 984; ORF 25 1>: 



1 MRAAVWAQA RADIRPPAQT DIVPNCRVIA FTVDAARRAV RISIVAQAAD 
51 LPRNDISPAY GDP I GAG FT A VGADFFAWL RG RVRRIGAV GMLIIIILMA 
101 EIRAKAVKPE IRAQ WAD FG GIEGFFECRL QEPVAFPVNH AIGFVIGKRL 
151 VGTRAAI FVR TVGRTVRLLK MIIQTDALPV VREAGIIRPS VFIGIGIDIF 
201 QTVAAFG VRL WKHARTVFR AHQRTVFAVG KQSAVFWAR VFAVTGQRTR 
251 LFFICIKNRL GQECRNRHIA RVESLLRVFE YAADWPLIL KTKTRAEQPR 
3 01 PAFV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 25 1 shows 85.2% identity over a 243 aa overlap with a predicted ORF (ORF 25 1 .ng) 
from N. gonorrhoeae: 



g251.pep 



m251 . seq 



m251 .pep 



m25l/g251 



m251.pep 



40 50 60 70 80 90 

TVDAARRAVRI S I VAQAADLPRND I SPAYGDP I GAGFTAVGADFFAWLRGRVRRI GAVG 



g251 




10 20 30 
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100 110 120 130 140 150 

tn2 5 1 . pep ML 1 1 1 1 LMAE I RAKAVKPE I HAQ WAD FGG I EG F FE CRLQE P VAF PVNHAI GFV I GKRLV 

I | || I I I I I I = : I : I I : I I I I I I II I I I I I I I I I I I I I I I I I I I I h I I h h II I 
g251 MLIII ILMAEVGTKTWTEVDAQWADFGGIEGFFECRLQEPVAFPVNHAVGFWGRRLV 

40 50 60 70 80 90 



160 170 180 190 200 210 

m251.pep GTRAAI FVRTVGRTVRLLKMI IQTDALPWREAGI IRPSVFIGIGIDI FQTVAAFGVRLV 

I I I I I I II I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
g251 GTRAAI FVRTVGGTVRLLKMIVQTDALPWREAGI IRPSVFIGIGIDI FQTV71AFGVRLV 

100 110 120 130 140 150 



220 230 240 250 260 270 

m2 5 1 . pep VKHARTVFRAHQRTVFAVGKQSAVFWARVFAVTGQRTRLFFI CI KNRLGQECRNRHI AR 

I, Mlhl Mlhlhl ll|::|IIIM- I :||| I I II I II I I I I I I I I I 
g251 VKHARTVFRAHLRTVFTVGNQPAVFAAARVFAVASYRS-VFFIFI KNRLGQECRNRHI AR 

160 170 180 190 200 210 



280 290 300 

m251 . pep VESLLRVFEYAADWPLILKTKTRAEQPRPAFVX 

I I I I I I : I I I I I I I I h : : | I I I I I I I I I I I I I I 
g251 VESLLRAFEYAADWPFVFKTKTRAEQPRPAFVX 
220 230 240 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 985>: 

a251 . seq 

1 ATGCGTGCTG CGGTAGTCGT AGCGCAACCC CGCGCCGACA TCCGCCCACC 

51 TGCCCAAACG GACATTGTCC CGAACTGCCG CGTAATAGCT TTTGCCGTTG 

101 ATGCTGCGCG GCGTGCAGTC CGTATAAGTA TTGTTGCCCA AGCGGCAGAT 

151 TTGCCCCGTA ACCACATTTC CCCTGCCTAT GCTGACCCAA TAGGGTTGGT 

201 CCTTGCCGCC GTTGGGGTTG GCGGTTTTAG GGGGCGTTTT CGACGAATAG 

251 GCGCGGTTGG CATGTTGATA ATAATAATCC TGATGGCGGA GATTAGAGTC 

301 AAAGCGGTCA AAACCGAGAT TCACGCTCAG GTTGTGGCGG ATTTTGGCGG 

351 TATCGAAGGA TTTTTTGAAT GCCGCCTGCA AGAGCCTGTG GCTTTCCCCG 

401 TAAATCACGC GGTCGGATTT GTAGTAGGAA AACGGCTTGT CGGCACTCGG 

4 51 GCGGCAATAT TTGTCCGAAC CGTCGGCAGA ACAGTGCGTC TGCTGAAAAT 

501 GATTGTCCAA ACCGATGCCC TGCCGGTCGT AAGAGAGGCG GGCATAATCC 

551 ACCCAAGTGT CTTTATCGGC ATTGGTATAG ACATATTCCA AACCGTAGCG 

601 GCTTTTGGTG TGCGTCTCGT CGTAAAACAC GCCCGTACCG TATTCCGCGC 

651 CCACCAGCGC ACCGTTTTCG CCGTTGGTAA ACAGACCGCC GTATTTGTGG 

701 TCGCCCGCGT ATTTGCCGTT GCCTCTTATC GGTCCGTATT TTCTATTTTC 

751 ATCAAAAACC GCCTTGGTCA GGAATGCCGG AACCGTCATA TCGCGCGTGT 

801 CGAAAGTTTG TTGCGTGTGT TCGAGTATGC CGCCGATGTA GTGCCGTTTG 

851 TTTTCAAAAC GAAAACCCGG GCGGAACAGC CACGATCGGC TTTCGTATGA 

This corresponds to the amino acid sequence <SEQ ID 986; ORF 251.a>: 

a251 . pep 

1 MRAAWVAQP RADIRPPAQT DIVPNCRVIA FAVDAARRAV RISIVAQAAD 

51 LPRNHISPAY ADPIGLVLAA VGVGGFRGRF R RIGAVGMLI IIILMAEI RV 

101 KAVKTEIHAQ WADFGGIEG FFECRLQEPV AFPVNHAVGF VVGKRLVGTR 

151 AAIFVRTVGR TVRLLKMIVQ TDALPWREA GIIHPS VFIG IGIDIFQTVA 

201 AFGVRLWKH ARTVFRAHQR TVFAVGKQTA VFWARVFAV ASYRSVF SIF 

251 IKNRLGQECR NRHIARVESL LRVFEYAADV VPFVFKTKTR AEQPRSAFV* 

m25 l/a25 1 88.5% identity in 304 aa overlap 

10 20 30 40 50 60 

m251.pep MRAAWVAQARADIRPPAQTDIVPNCRVIAFTVDAARRAVRISIVAQAADLPRNDISPAY 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I Mill 
a251 MRAAVVVAQPRADIRPPAQTDIVPNCRVIAFAVDAARRAVRISIVAQAADLPRNHISPAY 

10 20 30 40 50 60 



70 80 90 100 110 120 

m251 . pep G DP I GAG FT AVG ADFFAWLRGRVRRI GAVGML 1 1 1 1 LMAE I RAKAVKPE I HAQ WAD FG 

: I II I : : I I I : I Ml II I I I I I I I M I I I I I I II : I I I I II I I I I II I I I 
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a2 5 1 ADPIGLVLAAVGVGGF RGRFRRIGAVGMLI 1 1 1 LMAE I RVKAVKTE I HAQWADFG 

70 80 90 100 110 

130 140 150 160 170 180 

m251 . pep GIEGFFECRLQEPVAFPVNHAIGFVIGKRLVGTRAAIFVRTVGRTVRLLKMIIQTDALPV 
I I I I I II I II I I I I I I I I I I I : I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I : | I | | | | | 
a251 GIEGFFECRLQE PVAFPVNHAVG FWGKRLVGTRAAIFVRTVGRTVRLLPCMIVQTDALPV 

120 130 140 150 160 170 

190 200 210 220 230 240 

m251 . pep VRE AG IIRPSVFIGIGIDI FQT VAAFG VRL WKHART V FRAHQRT V FAVGKQ S AV FWAR 

I I I I I I I : I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I II I I I : I I M I 1 I 
a251 VREAGI IHPSVFIGIGI DI FQTVAAFGVRLWKHARTVFRAHQRTVFAVGKQTAVFVVAR 

180 190 200 210 220 230 

250 260 270 280 290 300 

m251 . pep VFAVTGQRTRLFFICIKNRLGQECRNRHIARVESLLRVFEYAADWPLILKTKTRAEQPR 
MM:: | : : | | I M I I I I I I I I I I I I i II I I I I I I I I I I II I I ::: I I I II I I I II 
a251 VFAVASYRS-VFSIFIKNRLGQECRNRHIARVESLLRVFEYAADVVPFVFKTKTRAEQPR 
240 * 250 260 270 280 290 



m251.pep PAFVX 
MM 

a251 SAFVX 
300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 987>: 

g253 . seq 

1 atgatcgaca gggaccgtat gttgcgggac acgttggaac gtgtgcgtgc 

51 ggggtcgttc tggttatggg tggtggtggc atcgatgatg tttaccgccg 

101 gattttcagg cacttatctt ctgatggaca atcaggggct gaatttcttt 

151 ttagttttgg cgggagtgtt gggcatgaat acgctgatgc tggcagtatg 

201 gttggcaacg ttgttcctgc gcgtgaaagt gggacggttt ttcagcagtc 

251 cggcgacgtg gtttcggggc aaaggccctg taaatcaggc ggtgttgcgg 

301 ctgtatgcgg accagtggcg gcaaccttcg gtacgatgga aaataggcgc 

351 aacggcgcac agcttgtggc tctgcacgct gctcggaatg ctggtgtcgg 

401 tattgctgct gcttttggtg cggcaatata cgttcaactg ggaaagcacg 

451 ctgttgagca atgccgcttc ggtacgcgcg gtggaaatgt tggcatggct 

501 gccgtcgaaa ctcggtttcc ctgtccccga tgcgcgggcg gtcatcgaag 

551 gtcgtctgaa cggcaatatt gccgatgcgc gggcttggtc ggggctgctg 

601 gtcggcagta tcgtctgcta cggcatcctg ccgcgcctct tggcttgggt 

651 agtgtgtaaa atccttttga aaacaagcga aaacggattg gatttggaaa 

701 aaacctatta tcaggcggtc atccgccgct ggcagaacaa aatcaccgat 

751 gcggatacgc gtcgggaaac cgtgtccgcc gtttcgccga aaatcgtctt 

801 gaacgatgcg ccgaaatggg cgctcatgct ggagaccgag tggcaggacg 

851 gccaatggtt cgagggcagg ctggcgcagg aatggctgga taagggcgtt 

901 gccgccaatc gggaacaggt tgccgcgctg gagacagagc tgaagcagaa 

951 accggcgcaa ctgcttatcg gcgtacgcgc ccaaactgtg ccggaccggg 

1001 gcgtgctgcg gcagattgtg cggctttcgg aagcggcgca gggcggcgcg 

1051 gtggtgcagc ttttggcgga acaggggctt tcagacgacc tttcggaaaa 

1101 gctggaacat tggcgtaacg cgctgaccga atgcggcgcg gcgtggcttg 

1151 agcctgacag ggtggcgcag gaaggccgtt tgaaagacca ataa 

This corresponds to the amino acid sequence <SEQ ID 988; ORF 253. ng>: 

g253.pep 

1 MIDRDRMLRD TLERVRAGS F WL WW AS MM FTAGFS GTYL LMDNQGLNFF 

51 LVIiAGVLGMN TLMLAW IAT LFLRVKVGRF FSSPATWFRG KGPVNQAVLR 

101 LYADQWRQPS VRWKIGATAH SLW LCTLLGM LVSVLLLLLV RQYTFNWEST 

151 LLSNAASVRA VEMLAWLPSK LGFPVPDARA VIEGRLNGNI ADARAWSGLL 

201 VGSIVCYGIL PRLLA WWCK ILLKTSENGL DLEKTYYQAV IRRWQNKITD 

251 ADTRRETVSA VSPK1VLNDA PKWALMLETE WQDGQWFEGR LAQEWLDKGV 
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3 01 AANREQVAAL ETELKQKPAQ LLIGVRAQTV PDRGVLRQIV RLSEAAQGGA 

3 51 WQLLAEQGL SDDLSEKLEH WRNALTECGA AWLEPDRVAQ EGRLKDQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 989>: 

m253.seq 

1 ATGATTGACA GGAACCGTAT GCTGCGGGAG ACGTTGGAAC GTGTGCGTGC 

51 GGGGTCGTTC TGGTTGTGGG TGGTGGCGGC GACGTTTGCA TTTTTTACCG 

101 GTTTTTCAGT CACTTATCTT CTAATGGACA ATCAGGGTCT GAATTTCTTT 

151 TTGGTTTTGG CGGGCGTGTT GGGCATGAAT ACGCTGATGC TGGCAGTATG 

2 01 GTTGGCAATG TTGTTCCTGC GTGTGAAAGT GGGGCGTTTT TTCAGCAGTC 

251 CGGCGACGTG GTTTCGGGGC AAAGACCCTG TAAATCAGGC GGTGTTGCGG 

301 CTGTATGCGG ACGAGTGGCG GCAACCTTCG GTACGTTGGA AAATAGGCGC 

351 AACGTCGCAC AGCCTGTGGC TCTGCACGCT GCTCGGAATG CTGGTGTCGG 

4 01 TATTGTTGCT GCTTTTGGTG CGGCAATATA CGTTCAACTG GGAAAGCACG 
451 CTGTTGAGCA ATGCCGCTTC GGTACGCGCG GTGGAAATGT TGGCATGGCT 
501 GCCGTCGAAA CTCGGTTTCC CTGTCCCCGA TGCGCGGGCG GTCATCGAAG 
551 GCCGTCTGAA CGGCAATATT GCCGATGCGC GGGCTTGGTC GGGGCTGCTG 
601 GTCGGCAGTA TCGCCTGCTA CGGCATCCTG CCGCGCCTGC TGGCTTGGGT 
651 AGTGTGTAAA ATCCTTTTGA AAACAAGCGA AAACGGATTG GATTTGGAAA 
701 AGCCCTATTA TCAGGCGGTC ATCCGCCGCT GGCAGAACAA AATCACCGAT 
751 GCGGATACGC GTCGGGAAAC CGTGTCCGCC GTTTCACCGA AAATCATCTT 
801 GAACGATGCG CCGAAATGGG CGGTCATGCT GGAGACCGAG TGGCAGGACG 
851 GCGAATGGTT CGAGGGCAGG CTGGCGCAGG AATGGCTGGA TAAGGGCGTT 
901 GCCACCAATC GGGAACAGGT TGCCGCGCTG GAGACAGAGC TGAAGCAGAA 
951 ACCGGCGCAA CTGCTTATCG GCGTGCGCGC CCAAACTGTG CCGGACCGCG 

1001 GCGTGTTGCG GCAGATTGTC CGACTCTCGG AAGCGGCGCA GGGCGGCGCG 

1051 GTGGTGCAGC TTTTGGCGGA ACAGGGGCTT TCAGACGACC TTTCGGAAAA 

1101 GCTGGAACAT TGGCGTAACG CGCTGGCCGA ATGCGGCGCG GCGTGGCTTG 

1151 AGCCTGACAG GGCGGCGCAG GAAGGGCGTT TGAAAGACCA ATAA 

This corresponds to the amino acid sequence <SEQ ID 990; ORF 253>: 

m253.pep 

1 MIDRNRMLRE TLERVRAGS F WLWWAATFA FFTGFS VTYL LMDNQGLNFF 

51 LVLAGVLGMN TLMLAVW LAM LFLRVKVGRF FSSPATWFRG KDPVNQAVLR 

101 LYADEWRQPS VRWKIGATSH SLW LCTLLGM LVSVLLLLLV RQYTFNWEST 

151 LLSNAASVRA VEMLAWLPSK LGFPVPDARA VIEGRLNGNI ADARAWSGLL 

201 VGSIACYGIL PRLLAW WCK ILLKTSENGL DLEKPYYQAV IRRWQNKITD 

251 ADTRRE TVS A VSPKIILNDA PKWAVMLETE WQDGEWFEGR LAQEWLDKGV 

3 01 ATNREQVAAL ETELKQKPAQ LLIGVRAQTV PDRGVLRQIV RLSEAAQGGA 

3 51 WQLLAEQGL SDDLSEKLEH WRNALAE CG A AWLEPDRAAQ EGRLKDQ* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 253 shows 94.7% identity over a 397 aa overlap with a predicted ORF (ORF 253.ng) 
from N. gonorrhoeae: 

m253/g253 

10 20 30 40 50 60 

MIDRNRMLRETLERVRAGSFWLWVVAATFAFFTGFSVTYLLMDNQGLNFFLVLAGVLGMN 

IMhlllhlMMIIIIIIIIIhh: I :||| IIIIIIIIIIIMMIIIIMII 

MIDRDRMLRDTLERVRAGSFWLWVVVASMMFTAGFSGTYLLMDNQGLNFFLVLAGVLGMN 
10 20 30 40 50 60 

70 80 90 100 110 120 

TLMLAVWLAMLFLRVKVGRFFSSPATWFRGKDPVNQAVLRLYADEWRQ PS VRWKIGATSH 

Mlllllll 1 1 M I i i 1 1 1 1 1 1 1 1 1 1 1 il I ; 1 1 1 M 1 1 1 1 1 1 •■! II 1 1 1 1 1 1 1 M : 

TLMLAVWLATLFLRVKVGRFFSSPATWFRGKGPVNQAVLRLYADQWRQPSVRWKIGATAH 
70 80 90 100 110 120 

130 140 150 160 170 180 

SLWLCTLLGMLVSVLLLLLVRQYTFNWESTLLSNAASVRAVEMLAWLPSKLGFPVPDARA 

I M I M I II ;l I M II I IMIMIMIIII III M II IIMIII 'MlilM II IMI , 

SLWLCTLLGMLVSVLLLLLVRQYTFNWESTLLSNAASVRAVEMLAWLPSKLGFPVPDARA 
130 140 150 160 170 180 
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m253 .pep 
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m253 -pep 
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190 200 210 220 230 240 

m253 . pep VI EGRLNGNI ADARAWSGLLVGS I AC YG I LPRLLAWWCKI LLKTSENGLDLEKPYYQAV 

MMIM IMMIMM I MIMIMIM MIIIMI M II Ml M MIMM M I'll 

9253 VI EGRLNGNI ADARAWSGLLVGS I VCYGILPRLLAWWCKILLKTSENGLDLEKTYYQAV 

190 200 210 220 230 240 

250 260 270 280 290 300 

m2 53 . pep I RRWQNKITDADTRRE TVS AVSPKI I LNDAPKWAVMLETEWQDGEWFEGRLAQEWLDKGV 

M M I M 1 1 M M 1 1 1 1 M 1 1 1 1 1 h 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 h 1 1 1 I 1 1 I i 1 1 1 1 1 1 1 

9 253 IRRWQNKITDADTRRETVSAVSPKIVLNDAPKWALMLETEWQDGQWFEGRLAQEWLDKGV 

250 260 270 280 290 300 

310 320 330 340 350 360 

m253 . pep ATNREQVAALETELKQKPAQLLIGVRAQTVPDRGVLRQIVRLSEAAQGGAWQLLAEQGL 

I : M 1 1 1 1 1 1 M 1 1 1 II I M I M 1 1 1 M 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 ( 

g253 AANREQVAALETELKQKPAQLLIGVRAQTVPDRGVLRQIVRLSEAAQGGAWQLLAEQGL 

310 320 330 340 350 360 

370 380 390 

m253 .pep SDDLSEKLEHWRNALAECGAAWLEPDRAAQEGRLKDQX 

Ml I II I IIIIMIhllll Mill Ihllll Mill 

92 53 SDDLSEKLEHWRNALTECGAAWLEPDRVAQEGRLKDQX 

370 380 390 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 991: 



a253.seq 






1 


ATGATCGACA 


GGAACCGTAT 


51 


GGGGTCGTTC 


TGGTTGTGGG 


101 


GTTTTTCAGT 


TACTTATCTT 


151 


TTGGTTTTGG 


CGGGCGTGTT 


201 


GTTGGCAATG 


TTGTTCCTGC 


251 


CGGCGACGTG 


GTTTCGGGGC 


301 


CTGTATGCGG 


ACGAGTGGCG 


351 


' AAC GTCGCAC 


AGCCTGTGGC 


401 


TATTGTTGCT 


GCTTTTGGTG 


451 


CTGTTGGGCG 


ATTCGTCTTC 


501 


GCCTGCGAAA 


CTGGGTTTTC 


551 


GTCGTCTGAA 


CGGCAATATT 


601 


GTCGGCAGTA 


TCGCCTGCTA 


651 


GGTATGCAAA 


ATCCTTTTGA 


701 


AGCCCTATTA 


TCAGGCGGTC 


751 


GCGGATACGC 


GTCGGGAAAC 


801 


GAACGATGCG 


CCGAAATGGG 


851 


GCGAATGGTT 


CGAGGGCAGG 


901 


GCCGCCAATC 


GGGAACAGGT 


951 


ACCGGCGCAA 


CTGCTTATCG 


1001 


GCGTGTTGCG 


GCAGATCGTC 


1051 


GTGGTGCAGC 


TTTTGGCGGA 


1101 


GCTGGAACAT 


TGGCGTAACG 


1151 


AACCCGACAG 


AGCGGCGCAG 


1201 


TGA 





GCTGCGGGAG 
TGGCGGCGGC 
CTAATGGACA 
GGGCATGAAT 
GCGTGAAAGT 
AAAGACCCTG 
GCAACCTTCG 
TCTGCACGCT 
CGGCAATATA 
GGTACGGCTG 
CCGTGCCTGA 
GCCGATGCGC 
CGGCATCCTG 
AAACAAGCGA 
ATCCGCCGCT 
CGTGTCCGCC 
CGGTCATGCT 
CTGGCGCAGG 
TGCCGCGCTG 
GCGTGCGCGC 
CGACTTTCGG 
ACAGGGGCTT 
CGCTGACCGA 
GAAGGCCGTC 



ACGTTGGAAC 
GACGTTTGCG 
ATCAGGGTCT 
ACGCTGATGC 
GGGGCGTTTT 
TCAATCAGGC 
GTACGTTGGA 
GCTCGGAATG 
CGTTCAACTG 
GTGGAAATGT 
TGCGCGGGCG 
GGGCTTGGTC 
CCGCGCCTCT 
AAACGGCTTG 
GGCAGAACAA 
GTTTCGCCGA 
GGAGACCGAA 
AATGGCTGGA 
GAGACAGAGC 
CCAAACTGTG 
AAGCGGCGCA 
TCAGACGACC 
ATGCGGCGCG 
TGAAAACCAA 



GTGTGCGTGC 
TTTTTTACCG 
GAATTTCTTT 
TGGCAGTATG 
TTCAGCAGTC 
GGTGTTGCGG 
AAATAGGCGC 
CTGGTGTCGG 
GGAAAGCACG 
TGGCATGGCT 
GTCATCGAAG 
GGGGCTGCTG 
TGGCTTGGGC 
GATTTGGAAA 
AATCACCGAT 
AAATCGTCTT 
TGGCAGGACG 
TAAGGGCGTT 
TGAAGCAGAA 
CCCGACCGCG 
GGGCGGCGCG 
TTTCGGAAAA 
GCGTGGCTGG 
CGACCGCACT 



This corresponds to the amino acid sequence <SEQ ID 992; ORF 253. a>: 

a253 . pep 

1 MIDRNRMLRE TLERVRAGSF WLWVAAATFA FFTGFSVTYL LMDNQGLNFF 

51 LVLAGVLGMN TLMLAV WLAM LFLRVKVGRF FSSPATWFRG KDPVNQAVLR 

101 LYADEWRQPS VRWKIGATSH SLW LCTLLGM LVSVLLLLLV RQYTFNWEST 

151 LLGDSSSVRL VEMLAWLPAK LGFPVPDARA VIEGRLNGNI ADARAWSGLL 

201 VGSIACYGIL PRLLA WAVCK ILLKTSENGL DLEKPYYQAV IRRWQNKITD 

251 ADTRRETVSA VSPKIVLNDA PKWAVMLETE WQDGEWFEGR LAQEWLDKGV 

301 AANREQVAAL ETELKQKPAQ LLIGVRAQTV PDRGVLRQIV RLSEAAQGGA 

351 WQLLAEQGL SDDLSEKLEH WRNALTECGA AWLEPDRAAQ EGRLKTNDRT 

401 * 
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m253/a253 97.2% identity in 395 aa overlap 

10 20 30 40 50 60 

m253 . pep MIDRNRMLRETLERVRAGSFWLWWAATFAFFTGFSVTYLLMDNQGLNFFLVLAGVLGMN 
I I I M I I I I I I I I I I I I t I I I I I I : I I I I I I II I I I I I | I I i | I I I t | | I | | | | | I | | | | 
a253 MIDRNRMLRETLERVRAGSFWLWVAAATFAFFTGFSVTYLLMDNQGLNFFLVLAGVLGMN 

10 20 30 40 50 60 

70 80 90 100 110 120 

rr\253.pep TLMLAVWLAMLFLRVKVGRFFSSPATWFRGKDPVNQAVLRLYADEWRQPSVRWKIGATSH 
I I I M I I I I H I I I I I I I I I I ! I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I | I I I | | 
a253 TLMLAVWLAMLFLRVKVGRFFSSPATWFRGKDPVNQAVLRLYADEWRQPSVRWKIGATSH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m253 . pep SLWLCTLLGMLVSVLLLLLVRQYTFNWESTLLSNAASVRAVEMLAWLPSKLGFPVPDARA 
M I I I I I I I I I I I I I I I I I I I I f I I M I I i I I :::: I I j I I I I I ] I I : i I I I I I I I I I I 
a253 SLWLCTLLGMLVSVLLLLLVRQYTFNWESTLLGDSSSVRLVEMLAWLPAKLGFPVPDARA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m253 . pep VIEGRLNGNIADARAWSGLLVGSIACYGILPRLLAWVVCKILLKTSENGLDLEKPYYQAV 
M I I I ! I I I I I I I I I I I I I I | | I | ! | | | | | | j | || | : | | | | | | | | || | | | | | | | | | | | || 
a253 VIEGRLNGNIADARAWSGLLVGSIACYGILPRLLAWAVCKILLKTSENGLDLEKPYYQAV 

190 200 210 220 230 240 

250 260 270 280 290 300 

m253 . pep IRRWQNKITDADTRRETVSAVSPKIILNDAPKWAVMLETEWQDGEWFEGRLAQEWLDKGV 
I I i I I I I I I I I I I I I I I I I I I I I I I: I I I I I I I I I I I I I I I I I i I I I I I I I I M I I I I I I 
a253 IRRWQNKITDADTRRETVSAVSPKIVLNDAPKWAVMLETEWQDGEWFEGRLAQEWLDKGV 

250 260 270 280 290 300 

310 320 330 340 350 360 

m253 . pep ATNREQVAALETELKQKPAQLLIGVRAQTVPDRGVLRQIVRLSEAAQGGAVVQLLAEQGL 

I: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I i I I I I I I 

a253 AANREQVAALETELKQKPAQLLIGVRAQTVPDRGVLRQIVRLSEAAQGGAVVQLLAEQGL 

310 320 330 340 350 360 

370 380 390 

m253 . pep SDDLSEKLEHWRNALAECGAAWLEPDRAAQEGRLKDQX 

I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I 
a253 SDDLSEKLEHWRNALTECGAAWLEPDRAAQEGRLKTNDRTX 

370 380 390 400 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 993>: 

agccatttga gcggtttgat 

gctgaaaacc ataggacacg 

tttacggcat cagccttctt 

ggaattgcag ccggaaaact 

catgatttat gtgctgattg 

ctttgagaaa cgggccgggc 

gcggctgcag gaatcgcaca 

acgtctgctg tctattgcga 

cggtaatgaa atccctgaca 

ctggcggcag gcggtatgct 

cgatgaaaaa atccgacacg 

gcggcagcat aacccaattt 

This corresponds to the amino acid sequence <SEQ ID 994; ORF 254.ng>: 
g254.pep 

1 MYAGERFNTY SHLSGLILAA AGLMLMLLKT IGHGDGYRIF SVSVYGISLL 



g254 . seq 








1 


atgtatgcag 


gcgaacgctt 


caatacttac 


51 


tctggcggcg 


gcaggtttga 


tgctgatgct 


101 


gggacggata 


ccgtatcttc 


agcgtatcgg 


151 


ctgctctatt 


tgagttcctc 


gctgtaccac 


201 


gaaaagcatt 


ttgaaaaaaa 


ccgaccactg 


251 


ccggaagcta 


cacaccgttt 


gcactggttt 


301 


tggacggtat 


tttcactgtc 


ctggctgctg 


351 


agaactcacc 


atcggacgga 


aaagcgaaaa 


401 


tttatatcgt 


aatgggctgg 


atggtcttgg 


451 


gcctcactcc 


cgccggcagg 


actggcttgg 


501 


gtacagcgtc 


ggcatttact 


ggtttgtaaa 


551 


ggcacggaat 


ctggcatctg 


ttcgtattgg 


601 


gtcagcgtgt 


acggttatgt 


aatctga 
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51 LLYLS SSLYH GIAAGKLKSI LKKTDHCMIY VLIAGSYTPF ALVSLRNGPG 

101 WTVFSLS WLL AAAGIAQELT IGRKS EKRLL SIAIYIVMGW MVLAVMKSLT 

151 ASLPPAGLAW LAAGGMLYSV GIYWFVNDEK IRHGHGIWHL FVLGGSITQF 

201 VSVYGYVI* 



m254 . seq 


(partial) 










1 


. . GTATCGGTTT 


ACGGCATCAG 


CCTTCTTCTG 


CTCTATTTGA 


GTTCCTGGCT 


51 


GTACCACGGA 


ATTGCAGCCG 


GAAAACTGAA 


AAGCATTTTG 


AAAAAAACCG 


101 


ACCACTGCAT 


GATTTATGTG 


CTGATTGCCG 


GAAGCTACAC 


ACCGTTTGCA 


151 


CTGGTTTCTT 


TGAGAAACGG 


GCCGGGCTGG 


ACGGTATTTT 


CACTGTCCTG 


201 


GCTGCTGGCG 


GCTGCAGGAA 


TCGCACAAGA 


ACTCACCATC 


GGACGGAAAA 


251 


GCGAAAAACG 


TCTGCTGTCT 


ATTGTGATTT 


ATGTCGTCAT 


GGGTTGGATG 


301 


GTCTTGGCGG 


TAATGAAATC 


CCTGACAGCC 


TCACTCCCGT 


CGGCAGGACT 


351 


GGCTTGGCTG 


GCGGCAGGCG 


GTATGCTGTA 


CAGTGTCGGC 


ATTTACTGGT 


401 


TTGTAAACGA 


TGAAAAAATC 


CGACACGGGC 


ACGGAATCTG 


GCATCTGTTC 


451 


GTATTGGGCG 


GCAGCATCAC 


CCAATTTGTC 


AGCGTGTACG 


GTTACGTAAT 


501 


CTGA 











This corresponds to the amino acid sequence <SEQ ID 996; ORF 254>: 

m254.pep (partial) 

1 - . VSVYGISLLL LYLS SWLYHG IAAGKLKSIL KKTDHCMIYV LIAGSYTPFA 

51 LVSLRNGPGW TVFSLSW LLA AAGIAQELTI GRKS EKRLLS IVIYWMGWM 

101 VLAVMKSLTA SLPSAGLAWL AAGGMLYSVG IYWFVNDEKI RHGHGIWHLF 

151 VLGGSITQFV SVYGYVI* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 254 shows 97.6% identity over a 167 aa overlap with a predicted ORF (ORF 254.ng) 
from N. gonorrhoeae: 

m254/g254 



10 20 30 

m254 .pep VS VYGI SLLLLYLSS WLYHGI AAGKLKS I L 

I Mill II I II M II Illlllllllllll 

g254 HLSGLILAAAGLMLMLLKTIGHGDGYRIFSVSVYGISLLLLYLSSSLYHG IAAGKLKSIL 

20 30 40 50 60 70 

40 50 60 70 80 90 

m254 . pep KKTDHCMI YVLI AGS YTPFALVSLRNGPGWTVFSLSWLLAAAGIAQELTI GRKSEKRLLS 

1 1 M 1 1 M 1 1 1 1 1 II II 1 1 Ml 1 1 1 II 1 1 1 1 1 i 1 1 M 1 1 1 1 1 ! II i I II il 1 1 1 II 1 1 M 

g2 54 KKTDHCMI YVL I AGS YTPFALVS LRNGPGWTVFS LS WLLAAAG I AQELTI GRKS E KRLLS 

80 90 100 110 120 130 

100 110 120 130 140 150 

m2 54 . pep IVI YWMGWMVLAVMKSLTASLPSAGLAWLAAGGMLYSVGI YWFVNDEKI RHGHGIWHLF 

hlhlllllllMIIIMIIII 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 1 II 1 1 II 1 1 II I II 1 1 

g254 IAIYIVMGWMV^^KSLTASLPPAGLAWLAAGGMLYSVGIYWFTODEKIRHGHGIWHLF 
140 150 160 170 180 190 



160 

m2 5 4 . pep VLGGS ITQFVSVYGYVIX 

I II I I I I I I I I I I I I I I I 
g254 VLGGS ITQFVSVYGYVIX 

200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 997>: 

a254 . seq 

1 AT G TAT AC AG GCGAACGCTT CAATACTTAC AGCCATTTGA GCGGTTTGAT 

51 TCTGGCGGCG GCAGGTTTGG CGCTGATGCT GCTGAAAACC ATAGGACACG 

101 GGGACGGCTA CCGTATCTTC AGCGTATCGG TTTACGGCAT CAGCCTTCTT 

151 CTGCTCTATT TGAGTTCCTC GCTGTACCAC GGAATTGCAG CCGGAAAACT 

201 GAAAAGCATT TTGAAAAAAA CCGACCACTG CATGATTTAT GTGCTGATTG 

251 CCGGAAGCTA CACACCGTTT GCACTGGTTT CTTTGAGAAA CGGGCCGGGC 
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301 TGGACGGTAT TTTCACTGTC CTGGCTGCTG GCGGCTGCAG GAATCGCACA 

351 AGAACTCACC ATTGGACGGA AAAGCGAAAA ACGACTGCTG TCTATTGCGA 

4 01 TTTATATCGT AATGGGCTGG ATGGTCTTGG CGGTAATGAA ATCCCTGACA 

4 51 GCCTCACTCC CGCCGGCAGG ACTGGCTTGG CTGGCGGCAG GCGGTATGCT 

501 GTACAGCGTC GGCATTTACT GGTTTGTAAA CGATGAAAAA ATCCGACACG 

551 GGCACGGAAT CTGGCATCTG TTCGTATTGG GCGGCAGCAT CACCCAATTT 

601 GTCAGCGTGT ACGGTTACGT AATCTGA 



This corresponds to the amino acid sequence <SEQ ID 998; ORF 254. a>: 

a254 .pep 

1 MYTGERFNTY SHLSGLILAA AGLALMLLKT IGHGDGYRIF SVSVYGISLL 

51 LLYLSSSLYH G IAAGKLKSI LKKTDH CMIY VLIAGSYTPF ALV SLRNGPG 

101 WTVFSLSWLL AAAGIAQELT IGRKSEKR LL SIAIYIVMGW MVLAV MKSLT 

151 AS LP PAG LAW LAAGGMLYSV GIYWFVNDEK IRHGHGIWHL FVLGGSITQF 

201 VSVYGYVI* 

m254/a254 97.6% identity in 167 aa overlap 

10 20 

30 

m254.pep 

VSVYGISLLLLYLSSWLYHGIAAGKLKSIL 

I II I I I I I ! I I I I I i 

I I I I I I I I I I I I I I 
a254 

HLSGLILAAAGLALMLLKTIGHGDGYRIFSVSVYGISLLLLYLSSSLYHGIAAGKLKSIL 

20 30 40 50 60 

70 

40 50 60 70 80 

90 

m254 .pep 

KKT DHCMI YVL IAGS YT PFALVSLRNGPGWTVFSLSWLLAAAGI AQELT I GRKS EKRLLS 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

a254 

KKTDHCMIYVLIAGSYTPFALVSLRNGPGWTVFSLSWLLAAAGIAQELTIGRKSEKRLLS 

80 90 100 110 120 

130 



100 110 120 130 140 

150 

m254 . pep 

IVIYVVMGWMVLAVMKSLTASLPSAGLAWLAAGGMLYSVGIYWFVNDEKIRHGHGIWHLF 

I : I I : I I I I I I I I I I I I I I I I I I 
I I I II I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I 
a254 

IAIYIVMGWMVLAVMKSLTASLPPAGLAWLAAGGMLYSVGIYWFVNDEKIRHGHGIWHLF 

140 150 160 170 180 

190 



160 

m254 . pep VLGGSITQFVSVYGYVIX 
I I I I i I I I I I I I II I I I I 
a254 VLGGSITQFVSVYGYVIX 

200 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 999>: 
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g255 . seq 








1 


atggttggac 


aggaagcctt 


gcggggtcag 


51 


cgcgttgcgt 


tacgctgtca 


aaacctgcgc 


101 


gcgttgatgc 


ccatcatcgc 


gtaggcgatt 


151 


aacgggttcg 


cccaaaccga 


cggggacgtt 


201 


tcgcgccgac 


ggaatccaag 


gatttgcgca 


251 


agttcggcga 


tttggctttg 


gttggcggca 


301 


gtgttcgctg 


ccttcaaacc 


ggattttttt 


351 


aggcggtgat 


ttccgtgccg 


aatttttctt 


401 


gctccggcgg 


caacgcgggc 


tgcggtttcg 


451 


gccccggtag 


tcgcgcgtac 


cgtatttgtg 


501 


gtccggggcg 


gaaggcggtg 


gcgatgtcgc 


551 


tcggtgttgc 


ggattag 





tttgggaaat 
tgggtaacgt 
tttggcaacg 
tcctgccgcc 
tagtcggcgt 



This corresponds to the amino acid sequence <SEQ ID 1000; ORF 255.ng>: 

g255 .pep 

1 MVGQEALRGQ FVAVFA AALR YAVKTCADFH AFDGVDAHHR VGDFGIEAVE 

51 NGFAQTDGDV GGFDMQFRAD GIQGFAHTVH IVFQFGDLAL VGGKKRILGN 

101 VFAAFKPDFF FADLGNVGGD FRAEFFFQPF FGNGSGGNAG CGFAGGTPAA 

151 APWARTVFV PIGIVGV SGA EGGGDVAWF AALVGVAD* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1001 >: 



m255 . seq 




1 


GTGGTTGGAC 


51 


CGCGTTGCGT 


101 


GCGTTGATGC 


151 


AACAGGTTCG 


201 


TCGCGCCGAC 


251 


AGCTCGGCAA 


301 


GTGTTCGCAG 


351 


AGGCGGTGAT 


401 


GCTCCGGCAG 


451 


GCCGCGGTAG 


501 


GGCCGGGGCG 


551 


TCGGTATTGC 



This corresponds to the amino acid sequence <SEQ ID 1002; ORF 255>: 

m2 55 .pep 

1 WGQEALRGQ FVAVFAAALR YAVKTCADFH AFDGVDAHHR VGDFGIEAVK 

51 NRFAQADRDI GCFDMQLRAD GIQGFAHAVH IVFQLGNLAM VGGKKRILGN 

101 VFAAFKPDFF FADLGNVGGD FRAEFFFQPF FGNGSGSNAG GGFTGGAPAA 

151 AAWARAVFV PIGIVGVAGA EAGGDVAWF AALVGIAD* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 255 shows 88.8% identity over a 188 aa overlap with a predicted ORF (ORF 255.ng) 
from N. gonorrhoeae: 

m255/g255 

10 20 30 40 50 60 

m2 5 5 . pep WGQEALRGQFVAVFAAALRYAVKTCADFHAFDGVDAHHRVGDFGI EAVKNRFAQADRD I 

:| 1 1 Mill! MIIM! Mill ' MINI I III Mill Mill IMMh; ! Ih' h 

g2 5 5 MVGQEALRGQ FVAVFAAALRYAVKT CAD FHAFDGVDAHHRVGDFG I EAVENGFAQTDGDV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m255 pep GCFDMQLRADG I QGFAHAVHIVFQLGNLAMVGGKKR I LGNVFAAFKPDFF FADLGNVGGD 

I I II I M M M M M h M M I h h M M 1 1 1 1 II I II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 

g255 GGFDMQFRADG I QGFAHTVH I VFQFGDLALVGGKKR I LGNVFAAFKPDFF FADLGNVGGD 

70 80 90 100 110 120 

130 140 150 160 170 180 

m255 .pep FRAEFFFQPFFGNGSGSNAGGGFTGGAPAAAAWARAVFVPIGIVGVAGAEAGGDVAWF 

1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 : 1 1 ; 1 1 1 1 I II h 1 1 M 1 1 II I h ! I h 1 1 1 1 1 1 1 1 

g255 FRAEFFFQPFFGNGSGGNAGCGFAGGTPAAAPWARTVFVPIGIVGVSGAEGGGDVAWF 
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130 



140 



150 



160 



170 



180 



189 

m2 5 5 . pep AALVGI ADX 

llllhlll 
g255 AALVGVADX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1003>: 

a255 . seq 

1 GTGGTTGGAC AGGAAGCCTT GCGGGGTGAG TTCGTCGCCG TGTTCGCTGC 

51 CGCGTTGCGT TACGCTGTCA AAACCTGCGC CGATTTCCAC GCCTTTGACG 

101 GCGTTGATGC CCATCATGGC GTAGGCGATT TCGGCATCGA GGCGGTCGAA 

151 TACGGGTTCG CCCAAGCCGA CGGGGACGTT GGCGGCTTCA ATATGCAGCT 

201 TCGCGCCGAC GGAATCCAAG GATTTGCGCA CGCTGTCCAT ATAGTTTTCC 

251 AGCTCGGCAA TTTGGCTATG GTTGGCGGCA AAAAAAGGAT TTTGGGAAAT 

301 GTGTTCGCAG CCTTCAAACC GGATTTCTTT TTCGCCGACT TGGGTAACGT 

351 AGGCGGTGAT TTCCGTGCCG AATTTTTCTT TCAACCATTT TTTGGCAACG 

4 01 GCTCCGGCGG CAACGCGGGC GGCGGTTTCG CGGGCGGAAC TCCTGCCGCC 

4 51 GCCCCGGTAG TCGCGCGTGC CGTATTTGTG CCAATAGGTA TAGTCGGCGT 

501 GGCCGGGGCG GAAGCTGGTG GCGATGTTGC CGTAGTCTTT GCTGCGCTGG 

551 TCGGTATTGC GGATTAA 

This corresponds to the amino acid sequence <SEQ ID 1004; ORF 255. a>: 

a255 . pep 

1 WGQEALRGE FVAVFA AALR YAVKTCADFH AFDGVDAHHG VGDFGIEAVE 

51 YGFAQADGDV GGFNMQLRAD GIQGFAHAVH IVFQLGNLAM VGGKKRILGN 

101 VFAAFKPDFF FADLGNVGGD FRAEFFFQPF FGNGSGGNAG GGFAGGTPAA 

151 APWARAVFV PIGIVGVAG A E AGGDVAWF AALVGIAD * 

m255/a255 93.1% identity in 188 aa overlap 

10 20 30 40 50 60 

m255 . pep WGQEALRGQFVAVFAAALRYAVKTCADFHAFDGVDAHHRVGDFGIEAVKNRFAQADRDI 
I li M i I II: IMII II I I I I I I Ml II M I I j | M Ml MIIIMM: Mill |: 
a255 WGQEALRGE FVAVFAAALR YAVKTCAD FHAFDG VDAHHGVGD FG I E AVE YG FAQADG D V 

10 20 30 40 50 60 

70 80 90 100 110 120 

m255 . pep GCFDMQLRADGIQGFAHAVHIVFQLGNLAMVGGKKRILGNVFAAFKPDFFFADLGNVGGD 
I I : I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II | | M I I M I I 
a255 GG FNMQLRADG I QG FAHAVH I V FQLGNLAMVGGKKR I LGN VFAAFK P D FFFADLGN VGG D 

70 80 90 100 110 120 

130 140 150 160 170 180 

m255 . pep FRAE FFFQ P FFGNG S G S N AGGG FT GG A P AAAA WARAV FV P I G I VG V AG AE AGG D V A W F 

I I I I I M I M I I I ! M : M I I I I : M : i I I I I I M I M I I I I I I I I M I M I I I I I M I 
a255 FRAEFFFQPFFGNGSGGNAGGGFAGGTPAAAPWARAVFVPIGIVGVAGAEAGGDVAWF 

130 140 150 160 170 180 



189 

m2 55 . pep AALVGI ADX 

I I II I I I I I 
a255 AALVGI ADX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1005>: 

g256.aeq 

1 atgctcgcgg tacgcaatcg gggttggcac ggcgcagtcg tccatttccg 

51 cagctgcggc ggcgtagcga acaccgcccc ggtgttctac cacttgggtg 

101 ataccgccga aatcgccttt gctttggaca cgctcaccgc gcgttaccgt 

151 gaaatatacg ccgtcggcgt atcgctgggc ggcaacgcgc cggcaaaata 

201 tttgggcgaa cagggcaaaa aggcattgcc gcacgcctcg gccgccgtat 

251 ccgcccccgt tgatgcagag gcggcaggca gccgcttcga cagcggcatc 

301 acgcggctgc tctacacgcg ctacttcctc cgcacactga tacccaaagc 
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351 acgttcgctc caaggttttc agacggcatt tgccgcaggg tgcaaaacac 

401 tgggcgagtt tgacgaccgt ttcaccgcac cgctgcacgg ctttgccgac 

451 cggcacgact actaccgcca aacttcctgc aaaccgctgc tcaaacacgt 

501 tgccaaaccg ctgctcctgc tcaatgccgc caacgacccc ttcctgccgc 

551 ccgaagccct gccccgtgca gacgaagcgt ccgaagccgt taccctgttc 

601 caacctgcac acggcgggca cgccggcttt gtcagcagca ccggcggcag 

651 gctgcacctg caatggctgc cgcagaccgt cctgtcctat tttgacagct 

701 tccgcacaaa caggcgttaa 

This corresponds to the amino acid sequence <SEQ ID 1006; ORF 256.ng>: 

g2 56.pep 

1 MLAVRNRGWH GAWHFRSCG GVANTAPVFY HLGDTAEIAF ALDTLTARYR 

51 EIYAVGVSLG GNAPAKYLGE QGKKALPHAS AAVSAPVDAE AAGSRFDSGI 

101 TRLLYTRYFL RTLIPKARSL QGFQTAFAAG C KTLGEFDDR FTAPLHGFAD 

151 RHDYYRQTSC KPLLKHVAKP LLLLNAANDP FLPPEALPRA DEASEAVTLF 

201 QPAHGGHAGF VSSTGGRLHL QWLPQTVLSY FDSFRTNRR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1007>: 

m256.seq 

1 ATGCTTGCGG TACGCGATCG GGGTTGGCAC GGCGTAGTCG TCCATTTCCG 

51 CAGCTGCGGC GGCATTGCCA ACACCGCTCC GGTGTTCTAC CA. CTtGGCG 

101 ATACCGCCGA AATCGCCTTT ACTTTGGACA CGTTCGCCGC GCGTTACCGT 

151 GAAAtATACG CCGTCGGCGT ATCGCTGGGC GGCAACGCGC TGGCAAAATA 

2 01 TTTGGGCGAA CAGGGCAAAA AGGCATTGCC GCAAGCCGCT GCCGTCATCT 

2 51 CCGCCCCCGT CGATGCAGAG GCGGCAGGCA GACGCTTCGA CAGCGGCATC 

3 01 ACGCGGCTGC TCTACACGCG CTACTTCCTC CGCACCCTGA TACCCAAAGC 
351 AAAATCGCTC CAAGGTTTTC AGACGGCATT TGCCGCAGGG TGCAAAACAC 

4 01 TGGGCGAGTT TGACGACCGC TTCACCGCAC CGCTGCACGG CTTTGCCGAC 
4 51 CGGCACGACT ACTACCGCCA AACTTCCTGC AAACCGCTGC TCAAACACGT 
501 TGCCAAACCG CTGCTCCTGC TCAATGCCGT CAACGACCCC TTCCTGCCGC 
551 CCGAAGCCCT GCCCCGCGCA GACGAAGTAT CCGAAGCCGT TACCCTGTTC 
601 CAGCCGGCAT ATGGTGGTCA TGTCGGCTTT GTCAGCAGCA CCGGCGGCAG 
651 GCTGCACCTG CAATGGCTGC CGCAGACCGT CCTGTCCTAT TTCGACAGCT 
701 TCCGCACAAA CAGGCGTTAA 

This corresponds to the amino acid sequence <SEQ ID 1008; ORF 256>: 

m256 .pep 

1 MLAVRDRGWH GVWHFRSCG GIANTAPVFY XLGDTAEIAF TLDTFAARYR 

51 EIYAVGVSLG GNALAKYLGE QGKKALPQAA AVISAPVDAE AAGRRFDSGI 

101 TRLLYTRYFL RTLIPKAKSL QGFQTAFAAG C KTLGEFDDR FTAPLHGFAD 

151 RHDYYRQTSC KPLLKHVAKP LLLLNAVNDP FLPPEALPRA DEVSEAVTLF 

201 QPAYGGHVGF VSSTGGRLHL QWLPQTVLSY FDSFRTNRR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 256 shows 92.9% identity over a 239 aa overlap with a predicted ORF (ORF 256.ng) 
from N. gonorrhoeae: 

m256/g256 

10 20 30 40 50 60 

m256.pep MLAVRDRGWHGVWHFRSCGGIANTAPVFYHLGDTAEIAFTLDTFAARYREIYAVGVSLG 

< i M: ! Ill I : NlMlh |:|| III [ h I I I I M I II MM -I I I I ! | || MM 
9256 MLAVRNRGWHGAWHFRSCGGVANTAPVFYHLGDTAE I AFALDTLTARYRE I YAVGVSLG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m256 . pep GNALAKYLGEQGKKALPQAAAVISAPVDAEAAGRRFDSGITRLLYTRYFLRTLIPKAKSL 

HI IMMIIMMIh|:|::||||MIIII I I I I I I II I I I I I I I I I I I I I I I s | | 
9 2 5 6 GNAP AKYLGEQGKKALPHAS AAVS AP VDAEAAGSRFDS G I TRLL YTR YFLRTL I P KARS L 

7 0 80 90 100 110 120 

^0 140 150 160 170 .180 

m2 5 6 . pep QGFQTAFAAGCKTLGEFDDRFTAPLHGFADRHDYYRQTSCKPLLKHVAKPLLLLNAVNDP 

M M M I MMMMIMIMMMIIMMM M MM Ml IMM MIMMIMIM 

9 2 5 6 QGFQTAFAAGCKTLGEFDDRFTAPLHGFADRHDYYRQTSCKPLLKHVAKPLLLLNAANDP 
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130 140 150 160 170 180 

190 200 210 220 230 240 

m2 5 6 . pep FLPPEALPRADEVSEAVTLFQPAYGGHVGFVSSTGGRLHLQWLPQTVLSYFDSFRTNRRX 

IIMIMMII|:|MIIIMII:||hllllMIIIMIIMIIIIIIIIIIII||||| 

9 2 5 6 FLPPEALPRADEASEAVTLFQPAHGGHAGFVSSTGGRLHLQWLPQTVLSYFDSFRTNRRX 

190 200 210 220 230 



a256 sea 






1 


ATGCTCGCGG 


TACGCGATCG 


51 


CAGCTGCGGC 


GGCGTAGCGA 


101 


ATACCGCCGA 


AATTGCCTTT 


151 


GAAATATACG 


CCGTCGGCGT 


201 


TTTGGGCGAA 


CAGGGCGAAA 


251 


CCGCACCCGT 


CGATGCAGAG 


301 


ACACGGCTGC 


TCTACACGCG 


351 


ACGGTCGCTC 


CAAGGTTTTC 


401 


TGGGCGAGTT 


TGACGACCGT 


451 


CGGCACGACT 


ACTACCGCCA 


501 


TGCCAAACCG 


CTGCTCCTGC 


551 


CCGAAGCGCT 


GCCCCGCGCA 


601 


CAGCCGACAC 


ACGGTGGTCA 


651 


GCTGCACCTG 


CAATGGTTGC 


701 


TCCGCACAAA 


CAGGCGTTAA 



AACTTCCTGC AAACCGCTGC TCAAACACGT 
TCAATGCCGT CAACGACCCC TTCCTGCCGC 
GACGAAGTGT CCGAAGCCGT TACCCTGTTC 
TGTCGGCTTT GTCGGCAGCA CCGGCGGCAG 
CGCAGACCGT CCTGTCCTAT TTCGACAGCT 

This corresponds to the amino acid sequence <SEQ ID 1010; ORF 256.a>: 

a256.pep 

1 MLAVRDRGWN GVWHFRSCG GVANTAPVFY HLGDTAEIAF TLDTLAARYR 

51 EIYAVGVSLG GNALAKYLGE QGENALPQAA AVISAPVDAE AAGNRFDSGI 

101 TRLLYTRYFL RTLIPKARSL QGFQTAFAAG CKTLGEFDDR FTAPLHGFAD 

151 RHDYYRQTSC KPLLKHVAKP LLLLNAVNDP FLPPEALPRA DEVSEAVTLF 

201 QPTHGGHVGF VGSTGGRLHL QWLPQTVLSY FDSFRTNRR* 

m256/a256 95.4% identity in 239 aa overlap 

10 20 30 40 50 60 

m256 . pep MLAVRDRGWHGVVVHFRSCGGIANTAPVFYXLGDTAEIAFTLDTFAARYREIYAVGVSLG 
I M I I I I I I : I i I I I I I I | I | : | | | | | | | | I I I I I M I I I I I I : I I | | | | | | | | | | | | | 
MLAVRDRGWNGVWHFRSCGGVANTAPVFYHLGDTAEIAFTLDTLAARYREIYAVGVSLG 
10 20 30 40 50 60 



a256 



70 80 90 100 110 120 

m256 . pep GNALAKYLGEQGKKALPQAAAVISAPVDAEAAGRRFDSGITRLLYTRYFLRTLIPKAKSL 
IMIIIIMIII::MIIMMIIIIIHIMI I I! M III Ml I I II i ! I M II f: I I 
^256 GNALAKYLGEQGENALPQAAAVISAPVDAEAAGNRFDSGITRLLYTRYFLRTLIPKARSL 

7 0 80 90 100 110 120 

130 140 150 160 170 180 

m256 . pep QGFQTAFAAGCKTLGEFDDRFTAPLHGFADRHDYYRQTSCKPLLKHVAKPLLLLNAVNDP 

1 1 I 1 1 1 1 1 N 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j | 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 

QGFQTAFAAGCKTLGEFDDRFTAPLHGFADRHDYYRQTSCKPLLKHVAKPLLLLNAVNDP 
130 140 150 160 170 180 



a256 



190 200 210 220 230 240 

m256 . pep FLPPEALPRADEVSEAVTLFQPAYGGHVGFVSSTGGRLHLQWLPQTVLSYFDSFRTNRRX 
Ml M I I Mi I I I I I I I I Ml I:: M I I I I I: I M I I It II I M I II I III I 1 I I I II I i 
FLPPEALPRADEVSEAVTLFQPTHGGHVGFVGSTGGRLHLQWLPQTVLSYFDSFRTNRRX 
190 200 210 220 230 240 



a256 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 101 1>: 

g256-l.seq 

1 ATGATTTTGA CACCGCCGGA CACGCCCTTT TTCCTCCGCA ACGGCAATGC 
51 CGACACGATT GCCGCCAAAT TCCTGCAACA CCCCGCACCC GCATACCGCC 
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101 GCGAGATGCT TCCCGACAGC ACGGGTAAAA CCAAAACCGC CTACGACTTT 

151 TCAGCAGGCG GCATTTCGCC CGATGCGCCG CTGGTCGTGC TGTTTCACGG 

201 TTTGGAAGGA AGCAGCCGCA GCCATTACGC GGTCGAACTG ATGCTCGCGG 

251 TACGCAATCG GGGTTGGCAC GGCGCAGTCG TCCATTTCCG CAGCTGCGGC 

301 GGCGTAGCGA ACACCGCCCC GGTGTTCTAC CACTTGGGTG ATACCGCCGA 

351 AATCGCCTTT GCTTTGGACA CGCTCACCGC GCGTTACCGT GAAATATACG 

401 CCGTCGGCGT ATCGCTGGGC GGCAACGCGC CGGCAAAATA TTTGGGCGAA 

451 CAGGGCAAAA AGGCATTGCC GCACGCCTCG GCCGCCGTAT CCGCCCCCGT 

501 TGATGCAGAG GCGGCAGGCA GCCGCTTCGA CAGCGGCATC ACGCGGCTGC 

551 TCTACACGCG CTACTTCCTC CGCACACTGA TACCCAAAGC ACGTTCGCTC 

601 CAAGGTTTTC AGACGGCATT TGCCGCAGGG TGCAAAACAC TGGGCGAGTT 

651 TGACGACCGT TTCACCGCAC CGCTGCACGG CTTTGCCGAC CGGCACGACT 

701 ACTACCGCCA AACTTCCTGC AAACCGCTGC TCAAACACGT TGCCAAACCG 

751 CTGCTCCTGC TCAATGCCGC CAACGACCCC TTCCTGCCGC CCGAAGCCCT 

801 GCCCCGTGCA GACGAAGCGT CCGAAGCCGT TACCCTGTTC CAACCTGCAC 

851 ACGGCGGGCA CGCCGGCTTT GTCAGCAGCA CCGGCGGCAG GCTGCACCTG 

901 CAATGGCTGC CGCAGACCGT CCTGTCCTAT TTTGACAGCT TCCGCACAAA 

951 CAGGCGTTAA 

This corresponds to the amino acid sequence <SEQ ID 1012; ORF 256-l.ng>: 

g256-l .pep 

1 MILTPPDTPF FLRNGNADTI AAKFLQHPAP AYRREMLPDS TGKTKTAYDF 

51 SAGGISPDAP LVVLFHGLEG SSRSHYAVEL MLAVRNRGWH GAVVHFRSCG 

101 GVANTAPVFY HLGDTAEIAF ALDTLTARYR EIYAVGVSLG GNAPAKYLGE 

151 QGKKALPHAS AAVSAPVDAE AAGSRFDSGI TRLLYTRYFL RTLIPKARSL 

201 QGFQTAFAAG CKTLGEFDDR FTAPLHGFAD RHDYYRQTSC KPLLKHVAKP 

251 LLLLNAANDP FLPPEALPRA DEASEAVTLF QPAHGGHAGF VSSTGGRLHL 

301 QWLPQTVLSY FDSFRTNRR* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1013>: 

m256-l. seq 

1 ATGATTTTAA CACCGCCGGA 

51 CGACACGATT GCCGCCAAAT 

101 GAGAGCTGCT TCCCGACAGC 

151 TCAGACGGCA TTTCGCCCGA 

201 GGAAGGAAGC AGCCGCAGCC 

251 GCGATCGGGG TTGGCACGGC 

301 ATTGCCAACA CCGCTCCGGT 

351 CGCCTTTACT TTGGACACGT 

401 TCGGCGTATC GCTGGGCGGC 

4 51 GGCAAAAAGG CATTGCCGCA 

501 TGCAGAGGCG GCAGGCAGAC 

551 ACACGCGCTA CTTCCTCCGC 

601 GGTTTTCAGA CGGCATTTGC 

651 CGACCGCTTC ACCGCACCGC 

701 ACCGCCAAAC TTCCTGCAAA 

7 51 CTCCTGCTCA ATGCCGTCAA 

801 CCGCGCAGAC GAAGTATCCG 

851 GTGGTCATGT CGGCTTTGTC 

901 TGGCTGCCGC AGACCGTCCT 

951 GCGTTAA 

This corresponds to the amino acid sequence <SEQ ID 1014; ORF 256- 1>: 

m256-l .pep 

1 MILTPPDTPF FLRNGNADTI AAKFLQRPAP AYRRELLPDS TGKTKVAYDF 

51 SDGISPDAPL VVLFHGLEGS SRSHYAVELM LAVRDRGWHG VVVHFRSCGG 

101 IANTAPVFYH LGDTAEIAFT LDTFAARYRE IYAVGVSLGG NALAKYLGEQ 

151 GKKALPQAAA VISAPVDAEA AGRRFDSGIT RLLYTRYFLR TLIPKAKSLQ 

201 GFQTAFAAGC KTLGEFDDRF TAPLHGFADR HDYYRQTSCK PLLKHVAKPL 

251 LLLNAVNDPF LPPEALPRAD EVSEAVTLFQ PAYGGHVGFV SSTGGRLHLQ 

301 WLPQTVLSYF DSFRTNRR* 

m256-l/g256-l 93.1% identity in 319 aa overlap 

10 20 30 40 50 59 

m256-l .pep MILTPPDTPFFLRNGNADTIAAKFLQRPAPAYRRELLPDSTGKTKVAYDFS-DGISPDAP 
I I I I I I I I I I I I I I I I I I I I ! I I I I I : I I I I I I I I : I I I I I I I I I : I I I II I I f ( I ! I 
g256-l MILTPPDTPFFLRNGNADTIAAKFLQHPAPAYRREMLPDSTGKTKTAYDFSAGGISPDAP 

10 20 30 40 50 60 



CACGCCCTTT 
TCCTGCAACG 
ACGGGTAAAA 
TGCGCCGCTG 
ATTACGCGGT 
GTAGTCGTCC 
GTTCTACCAC 
TCGCCGCGCG 
AACGCGCTGG 
AGCCGCTGCC 
GCTTCGACAG 
ACCCTGATAC 
CGCAGGGTGC 
TGCACGGCTT 
CCGCTGCTCA 
CGACCCCTTC 
AAGCCGTTAC 
AGCAGCACCG 
GTCCTATTTC 



TTCCTCCGCA 
CCCCGCGCCC 
CCAAAGTCGC 
GTCGTGCTGT 
CGAACTGATG 
ATTTCCGCAG 
TTGGGCGATA 
TTACCGTGAA 
CAAAATATTT 
GTCATCTCCG 
CGGCATCACG 
CCAAAGCAAA 
AAAACACTGG 
TGCCGACCGG 
AACACGTTGC 
CTGCCGCCCG 
CCTGTTCCAG 
GCGGCAGGCT 
GACAGCTTCC 



ACGGCAATGC 
GCATACCGCC 
CTACGACTTT 
TTCACGGTTT 
CTTGCGGTAC 
CTGCGGCGGC 
CCGCCGAAAT 
ATATACGCCG 
GGGCGAACAG 
CCCCCGTCGA 
CGGCTGCTCT 
ATCGCTCCAA 
GCGAGTTTGA 
CACGACTACT 
CAAACCGCTG 
AAGCCCTGCC 
CCGGCATATG 
GCACCTGCAA 
GCACAAACAG 



m256-l . pep 



60 70 80 90 100 110 119 
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I I I I I I I I I I I I I I I I I I I I I I I I I : I I I t I : M I I I I I I I : I I t I I f I i I I I I I I I I I I 
g256-l LVVLFHGLEGSSRSHYAVELMLAVRNRGWHGAVVHFRSCGGVANTAPVFYHLGDTAEIAF 
70 80 90 100 110 120 

120 130 140 150 160 170 179 

TLDTFAARYREIYAVGVSLGGNALAKYLGEQGKKALPQAAAVISAPVDAEAAGRRFDSGI 
: I I I : : I I ! I I I I I I ! I I I ! I I I I I I I f I I I I I I I 1 : i : I : : I I I I I I I I j I Mill 
ALDTLTARYREI YAVGVSLGGNAPAKYLGEQGKKALPHASAAVSAPVDAEAAGSRFDSGI 
130 140 150 160 170 180 

180 190 200 210 220 230 239 

TRLLYTRYFLRTLIPKAKSLQGFQTAFAAGCKTLGEFDDRFTAPLHGFADRHDYYRQTSC 
I t I ! I I I I I I I I I I I I I : I I I i i I I ! I I I M I i I I I I I I M I I I I I I I I I I I I I I I I I I i 
TRLLYTRYFLRTLIPKARSLQGFQTAFAAGCKTLGEFDDRFTAPLHGFADRHDYYRQTSC 
190 200 210 220 230 240 

240 250 260 270 280 290 299 

KPLLKHVAKPLLLLNAVNDPFLPPEALPRADEVSEAVTLFQPAYGGHVGFVSSTGGRLHL 
li II MINI If MM Ml II Mill M Mi IHMIIII I IIMII: I Nil IN II II 
KPLLKHVAKPLLLLNAANDPFLPPEALPRADEASEAVTLFQPAHGGHAGFVSSTGGRLHL 
250 260 270 280 290 300 

300 310 319 

m2 5 6-1. pep QWLPQTVLSYFDSFRTNRRX 
I I II M M I I ] I I M I I I I I 
g256~l QWLPQTVLSYFDSFRTNRRX 
310 320 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 1015>: 

a256-l. 3eq 

1 ATGATTTTGA CACCGCCGGA 

51 CGACACGATT GCCGCCAAAT 

101 GCGAGCTGCT TCCCGACAGC 

151 TCAGACGGCA TTTCGCCCGA 

201 GGAGGGCGGC AGTGGCAGCC 

251 GCGATCGGGG TTGGAACGGC 

301 GTAGCGAACA CCGCCCCGGT 

351 TGCCTTTACT TTGGACACGC 

401 TCGGCGTATC GCTGGGCGGC 

451 GGCGAAAACG CGCTGCCGCA 

501 TGCAGAGGCG GCAGGCAACC 

551 ACACGCGCTA CTTCCTCCGC 

601 GGTTTTCAGA CGGCATTTGC 

651 CGACCGTTTC ACCGCACCGC 

701 ACCGCCAAAC TTCCTGCAAA 

751 CTCCTGCTCA ATGCCGTCAA 

801 CCGCGCAGAC GAAGTGTCCG 

851 GTGGTCATGT CGGCTTTGTC 

901 TGGTTGCCGC AGACCGTCCT 

951 GCGTTAA 

This corresponds to the amino acid sequence <SEQ ID 1016; ORF 256-l.a>: 

a256-l.pep 

1 MILTPPDTPF FLRNGNADTI AAKFLQRSAP AYRRELLPDS TGKTKTAYDF 

51 SDGISPDAPL VVLFHGLEGG SGSHYAVELM LAVRDRGWNG VVVHFRSCGG 

101 VANTAPVFYH LGDTAEIAFT LDTLAARYRE IYAVGVSLGG NALAKYLGEQ 

151 GENALPQAAA VISAPVDAEA AGNRFDSGIT RLLYTRYFLR TLIPKARSLQ 

201 GFQTAFAAGC KTLGEFDDRF TAPLHGFADR HDYYRQTSCK PLLKHVAKPL 

251 LLLNAVNDPF LPPEALPRAD EVSEAVTLFQ PTHGGHVGFV GSTGGRLHLQ 

301 WLPQTVLSYF DSFRTNRR* 

a256-l/m256-l 95.6% identity in 318 aa overlap 

10 20 30 40 50 60 

MILTPPDTPFFLRNGNADTIAAKFLQRSAPAYRRELLPDSTGKTKTAYDFSDGISPDAPL 
I I II I M I I I I I I I I II M I M I I I I I I I I II II I I II I M I I I : M I I II I I I I I I I I 
MILTPPDTPFFLRNGNADTIAAKFLQRPAPAYRRELLPDSTGKTKVAYDFSDGISPDAPL 
10 20 30 40 50 60 

70 80 90 100 110 120 

VVLFHGLEGGSGSHYAVELMLAVRDRGWNGVVVHFRSCGGVANTAPVFYHLGDTAEIAFT 
I II I I I I M : I I I I I I II I I I II I I I I :| II I I I I I I I I : I I I I II I II I I I I I I II I I 
VVLFHGLEGSSRSHYAVELMLAVRDRGWHGVVVHFRSCGGIANTAPVFYHLGDTAEIAFT 



m256-l .pep 
g256-l 



m256-l .pep 
g256-l 



m256-l . pep 
g256-l 



CACACCCTTT 
TCCTGCAACG 
ACGGGTAAAA 
TGCGCCGCTG 
ATTACGCGGT 
GTAGTCGTCC 
GTTCTACCAC 
TCGCCGCGCG 
AACGCGCTGG 
AGCCGCCGCC 
GCTTCGACAG 
ACACTGATAC 
CGCAGGGTGC 
TGCACGGCTT 
CCGCTGCTCA 
CGACCCCTTC 
AAGCCGTTAC 
GGCAGCACCG 
GTCCTATTTC 



TTCCTCCGCA 
CTCCGCACCT 
CCAAAACCGC 
GTCGTGCTGT 
CGAACTGATG 
ATTTCCGCAG 
TTGGGCGATA 
TTACCGTGAA 
CAAAATATTT 
GTCATCTCCG 
CGGCATCACA 
CCAAAGCACG 
AAAACACTGG 
TGCCGATCGG 
AACACGTTGC 
CTGCCGCCCG 
CCTGTTCCAG 
GCGGCAGGCT 
GACAGCTTCC 



ACGGCAATGC 
GCATACCGCC 
CTACGACTTT 
TTCACGGTTT 
CTCGCGGTAC 
CTGCGGCGGC 
CCGCCGAAAT 
ATATACGCCG 
GGGCGAACAG 
CACCCGTCGA 
CGGCTGCTCT 
GTCGCTCCAA 
GCGAGTTTGA 
CACGACTACT 
CAAACCGCTG 
AAGCGCTGCC 
CCGACACACG 
GCACCTGCAA 
GCACAAACAG 



a256-l .pep 
m256-l 

a256-l .pep 
m256-l 
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70 80 90 100 110 120 

130 140 150 160 170 180 

a256-l.pep LDTLAARYREIYAVGVSLGGNALAKYLGEQGENALPQAAAVISAPVDAEAAGNRFDSGIT 

111:11111111111 IIMMIil::|lllllll Ml liMIIM lllllll 

m256-l LDTFAARYREIYAVGVSLGGNALAKYLGEQGKKALPQAAAVISAPVDAEAAGRRFDSGIT 

130 140 150 160 170 180 

190 200 210 220 230 240 

a256-l.pep RLLYTRYFLRTLIPKARSLQGFQTAFAAGCKTLGEFDDRFTAPLHGFADRHDYYRQTSCK 

I I I I I I I I I I I I I I I |:| I I I I I I I I I I I I || I I I | I || I | I | I f| I I I I | | 

m256-l RLLYTRYFLRTLIPKAKSLQGFQTAFAAGCKTLGEFDDRFTAPLHGFADRHDYYRQTSCK 

190 200 210 220 230 240 

250 260 270 280 290 300 

a256-l.pep PLLKHVAKPLLLLNAVNDPFLPPEALPRADEVSEAVTLFQPTHGGHVGFVGSTGGRLHLQ 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : : I I I I I I I : I I I I I I I I I 
m256-l PLLKHVAKPLLLLNAVNDPFLPPEALPRADEVSEAVTLFQPAYGGHVGFVSSTGGRLHLQ 

250 260 • 270 280 290 300 

310 319 
a2 5 6- 1 . pep WLPQTVLS YFDS FRTNRRX 
I I I I I I I I I I I I I I I I I I I 
m256-l WLPQTVLS YFDS FRTNRRX 

310 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 101 7>: 

g257 . seq 

1 atgggcaggc atttcgggcg cagacgtttt ctgacggctg ccgccgttgc 

51 tgtggccggt gcggcggttt cttttttgcc gaatcctttt gccgccggcg 

101 gcgaaaaacg caacatggat aaaaaacgcg atgaaaatgt gtttttctgg 

151 aaaggtgtcg cgctgggttc cggcgcggag ctgcgcctgt tcggcgtgga 

201 cgacagacag gcggcggatt tggtcaataa ggttttggcg gaagtggcgc 

251 gtttggaaaa aatgttcagc ctttaccgtg aagacagcct gatcagccgt 

301 ctgaaccgcg acggttatct gacttcgcct ccggcggatt ttttggaact 

351 gttgagcctg gccgcgatat tcacgcgctg a 

This corresponds to the amino acid sequence <SEQ ID 1018; ORF 257.ng>: 

g257 .pep 

1 MGRHFGRRRF LTAAAVAVAG A AVS FLPNPF AAGGEKRNMD KKRDENVFFW 

51 KGVALGSGAE LRLFGVDDRQ AADLVNKVLA EVARLEKMFS LYREDSLISR 

101 LNRDGYLTSP PADFLELLSL AAIFTR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 101 9>: 

m257 . seq 

1 ATGGGCAGGC ATTTCGGGCG . CAGCGTTTT CTGACGGTTG CCGCCGTTGC 

51 GGCGGGGaC . GCGGCGGTTT CTTTCCTGCC GAATCCTTTT GCCGCCGATG 

101 ATGAAAAACG CAAcGGGGAT GAAAAACGCA ATGAAAATGT GTTTTTCTGG 

151 AAAGGTGTCG CACTGGGTTC CGGTGCGGa . CTCCGTCTGT TCGGTGTGGA 

2 01 CGACAGGCGT GCGGCGGATT TGGTCAACAA GGTTTTGGCG GAAGTGGCGC 

251 GTTTGGAAAA ATTGTTCAGC CTTTACCGTG AAGACAGCCT GATCAGCCGC 

301 CTGAACAGGG ACGGTTATCT GACTTCGCCG TCGGCGGATT TTTTGGAACT . 

351 GkTGAGCCTG GCCGCGATAT TCACGCkCTG A 

This corresponds to the amino acid sequence <SEQ ID 1020; ORF 257>: 

m257 .pep 

1 MGRHFGXQRF LTVAAVAAGX A AVS FLPNPF AADDEKRNGD EKRNENVFFW 

51 KGVALGSGAX LRLFGVDDRR AADLVNKVLA EVARLEKLFS LYREDSLISR 

101 LNRDGYLTSP SADFLELXSL AAIFTX* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 257 shows 88.0% identity over a 125 aa overlap with a predicted ORF (ORF 257.ng) 
from N. gonorrhoeae: 

m257/g257 
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m2 5 7 . pep MGRHFGRQRFLTVAAVAAGTAAVSFLPNPFAADDEKRNGDEKRNENVFFWKGVALGSGAD 

llllllhlllhlllh' llllllllllll 1 1 1 1 h I h 1 1 1 M II 1 1 1 1 1 1 1 1 : 

92 5 7 MGRHFGRRRFLTAAAVAVAGAAVS FL PNP FAAGGE KRNMDKKRDENV FFWKGVALGS GAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 57 . pep LRLFGVDDRRAADLVNKVLAEVARLEKLFSLYREDSLISRLNRDGYLTSPSADFLELXSL 

I MM I M h 1 1 II 1 1 1 M I M I II I h INN I ' 1 1 i 1 1 Ml 1 1 II I, I MINI 1 1 

9257 LRLFGVDDRQAADLVNKVLAEVARLEKMFSLYREDSLISRLNRDGYLTSPPADFLELLSL 

70 80 90 100 110 120 

m257.pep AAIFTXX 
1 I I I I I 

g257 AAIFTRX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1021>: 
a257 . seq 

• 1 ATGGGCAGGC ATTTCGGGCG CAGGCGTTTT TTGACAGTTG CCGCCGTTGC 

51 GGCGGCGGGC GCGGCGGTTT CTTTCCTGCC GAATCCTTTT GCCGCCGATG 

101 ATGAAAAACG CAATAAAGAT GAAAAACGCA ATGAAAATGT GTTTTTCTGG 

151 AAAGGTGTCG CACTGGGTTC CGGTGCGGAG CTCCGTCTGT TCGGTGTGGA 

201 CGACAGGCGT GCGGCGGATT TGGTCAACAA GGTTTTGGCG GAAGTGGCGC 

251 GTTTGGAAAA AATGTTCAGC CTTTACCGTG AAGACAGCCT GATCAGCCGT 

301 CTGAACCGTG ACGGTTATTT GACTTCGCCG CCGGCGGATT TTTTGGAACT 

351 GTTGAGCCTG GCCGTGATAT TCACGCGCTG A 

This corresponds to the amino acid sequence <SEQ ID 1022; ORF 257.a>: 

a257 .pep 

1 MGRHFGRRRF LTVAAVAAAG A AVSFLPNPF AADDEKRNKD EKRNENVFFW 

51 KGVALGSGAE LRLFGVDDRR AADLVNKVLA EVARLEKMFS LYREDSLISR 

101 LNRDGYLTSP PADFLELLSL AVIFTR* 

m257/a257 92.0% identity in 125 aa overlap 

10 20 30 40 50 

60 

m257 .pep 

MGRHFGXQRFLTVAAVAAGXAAVSFLPNPFAADDEKRNGDEKRNENVFFWKGVALGSG7VX 

I I I I I I : I I I I I I ! I I I : I I I I I I I I I I I I I I I I I I 

I I I I I I I I I I I I I I I I I I I I 
a257 

MGRHFGRRRFLTVAAVAAAGAAVSFLPNPFAADDEKRNKDEKRNENVFFWKGVALGSGAE 

10 20 30 40 50 

60 

70 80 90 100 110 

120 

m257 .pep 

LRLFGVDDRRAADLVNKVLAEVARLEKLFSLYREDSLISRLNRDGYLTSPSADFLELXSL 

I I I I I M I I I I I I I I I I I I I I I I I I I I : | | | | | | | | | | | | | | | | | | | | | | MUM 

I I 

a257 

LRLFGVDDRRAADLVNKVLAEVARLEKMFSLYREDSLISRLNRDGYLTSPPADFLELLSL 

70 80 90 100 110 

120 



m257.pep AAIFTXX 
1:111 

a257 AVIFTRX 
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g258 . seq 








1 


atgcgccgct 


tcctaccgat 


cgcagccata 


51 


cggattgacg 


gcggcgaccg 


gcagcaccag 


101 


ggtggatagt 


ctcgttcagc 


gcaatgctgc 


151 


ttggcacgtt 


atgtcatatt 


gctgttgaaa 


201 


cggttcgcag 


attgccaaac 


gcctttccgg 


251 


tactgcccgg 


cttgttcctg 


ttcggcattt 


301 


acgattaatt 


cgtggttcgg 


caacgacacc 


351 


ccttaatttg 


agcaagtccg 


cactggattt 


401 


gcaacgccgt 


tcccgtacag 


atagacctca 


451 


ggcaatatgg 


gcagtgtgct 


ggaacactac 


501 


gcttgccctg 


tacaatgccg 


caagcgggaa 


551 


cgcaccaatt 


cgaccagccg 


cttcccgaca 


601 


cagcagaccg 


gttcggttcg 


gagtttggaa 


651 


cgcgcaggga 


tggttgtcgg 


caggtacgca 


701 


tgttcttccg 


ccagccgatt 


cccgaaaatg 


751 


attgaaaagg 


cgcgggcgaa 


atatgccgaa 


801 


tttgcagacc 


ttttttctgg 


taaccctgct 


851 


tttttcttgc 


gctggtaatg 


gcactgtatt 


901 


cccattctgt 


cgcttgccga 


gggcgcaaag 


951 


cagccagacg 


cgccccgtat 


tgcgcaacga 


1001 


agctgttcaa 


ccatatgacc 


gagcagcttt 


1051 


gaacgcaacc 


gccggcgcga 


ggaagccgcc 


1101 


gttggatggg 


ttgactaccg 


gtgtggtggt 


1151 


gccgtaccgc 


ggtgttttcc 


acttgtcatt 


1201 


taa 







This corresponds to the amino acid sequence <SEQ ID 1024; ORF 258.ng>: 

g258 .pep 

1 MRRFLPIAAI CAWLLYGLT AATGSTSSLA DYFWWIVSFS AML LLVLSAV 

51 LARYVILLL K DRRNGVFGSQ IAKRLS GMFT LVAVLPGLFL FGI SAQFING 

101 TINSWFGNDT HEALER SLNL SKSALDLAAD NAVSNAVPVQ IDLIGTASLS 

151 GNMGSVLEHY AGSGFAQLAL YNAASGKIEK SINPHQFDQP LPDKEHWEQI 

201 QQTGSVRSLE SIGGVLYAQG WLSAGTHNGR DYALFFRQPI PENVAQDAVL 

251 IEKARAKYAE LSYSKKGLQT FFLVT LLIAS LLSIFLALVM AL YFARRFVE 

301 PILSLAEGAK AVAQGDFSQT RPVLRNDEFG RLTKLFNHMT EQLSIAKEAD 

351 ERNRRREEAA RHYLECVLDG LTTGWVSYP LSCCRTAVFS TCHSSPLSYF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1025>: 

m258 . seq 

1 , ATGCGCCGTT TTCTACCGAT CGCAGCCATA TGCGCCGTCG ' TCCTGTTGTA 

51 CGGACTGACG GCGGCAACCG GCAGCACCAG TTCGCTGGCG GATTATTTCT 

101 GGTGGATTGT TGCGTTCAGC GCAATGCTGC TGCTGGTGTT GTCCGCCGTT 

151 TTGGCACGTT ATGTCATATT GCTGTTGAAA GACAGGCGCG ACGGCGTATT 

201 CGGTTCGCAG ATTGCCAAAC GCCTTTCTGG GATGTTTACG CTGGTTGCCG 

251 TACTGCCCGG CGTGTTTCTG TTCGGCGTTT CCGCACAGTT CATCAACGGC 

301 ACGATTAATT CGTGGTTCGG CAACGATACC CACGAGGCGC TTGAACGCAG 

351 CCTCAATTTG AGCAAGTCCG CATTGAATTT GGCGGCAGAC AACGCCCTCG 

4 01 GCAACGCCGT CCCCGTGCAG ATAGACCTCA TCGGCGCGGC TTCCCTGCCC 

451 GGGGATATGG GCAGGGTGCT GGAACATTAC GCCGGCAGCG GTTTTGCCCA 

501 GCTTGCCCTG TACAATGCCG CAAGCGGCAA AATCGAAAAA AGCATCAACC 

551 CGCACAAGCT CGATCAGCCG TTTCCAGGTA AGGCGCGTTG GGAAAAAATC 

601 CAACGGGCGG GTTCGGTCAG GGATTTGGAA AGCATAGGCG GCGTATTGTA 

651 CGCGCAGGGC TGGCTGTCGG CGGGTACGCA CAACGGGCGC GATTACGCCT 

701 TGTTTTTCCG TCAGCCGGTT CCCAAAGGCG TGGCAGAGGA TGCCGTCTTA 

751 ATCGAAAAGG CAAGGGCGAA ATATGCTGAG TTGAGTTACA GCAAAAAAGG 

801 TTTGCAGACC TTTTTCCTGG CAACCCTGCT GATTGCCTCG CTGCTGTCGA 

851 TTTTTCTTGC ACTGGTCATG GCACTGTATT TCGCCCGCCG TTTCGTCGAA 

901 CCCGTCCTAT CGCTTGCCGA GGGGGCGAAG GCGGTGGCGC AAGGCGATTT 

951 CAGCCAGACG CGCCCCGTGT TGCGCAACGA CGAGTTCGGA CGCTTGACCA 
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1001 AGTTGTTCAA CCACATGACC GAGCAGCTTT CCATCGCCAA AGAAGCAGAC 

1051 GAGCGCAACC GCCGGCGCGA GGAAGCCGCC AGGCATTATC TTGAATGCGT 

1101 GTTGGAGGGG CTGACCACGG GCGTGGTGGT GTTTGACGAA CAAGGCTGTC 

1151 TGAAAACsTT CAACAAAGCG GCGGAACAGA TTyTGGGGAT GCCGCTTACC 

1201 CCCcTGTGGG GCAGCAGCCG GCACGGTTGG CACGGCGTTT CGG CGCAGCA 

1251 GTCCCTGCTT GCCGAAGTGT TTGCCGCCAT CGGCGCGGCG GCAGGTACGG 

13 01 ACAAACCGGT CCATGTGAAA TATGCCGCGC CGGACGATGC CAAAATCCTG 

1351 CTGGGCAAGG CAACCGTCCT GCCCGAAGAC AACGGCAACg GCGTGGTAAT 

1401 GGTGATTGAC GACATCACCG TTTTGATACA CGCGCAAAAA GAAGCCGCGT 

1451 GGGGCGAAgT GGCGaAgCGG CTGGCACACG AAATCCGCAA TCCGCTCACG 

1501 CCCATCCAGC TTTCCGCCGA ACgGsTGGCG TkGAAATTGG GCGGGAAGCT 

1551 GGATGAGCAG GATGCGCAAA TCCTGACGCG TTCGACCGAC ACCATCGTCA 

1601 AACAGGTGGC GGCATTGAAG GAAATGGTCG AAGCATTCCG CAATTATGCG 

1651 CGTTCCCCTT CGCTCAAATT GGAAAATCAG GATTTGAACG CCTTAATCGG 

1701 CGATGTGTTG GCATTGTATG AAGCCGGTCC GTGCCGGTTT GCGGCGGACT 

1751 TGCCGGCGAA CCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 1026; ORF 258>: 

m258 .pep 

1 MRRFLPIAAI CAWLLYGLT AATGSTSSLA DYFWWIVAFS AML LLVLSAV 

51 LARYVILLL K DRRDGVFGSQ IAKRLS GMFT LVAVLPGVFL FGV SAQFING 

101 TINSWFGNDT HEALERSLNL SKSALNLAAD NALGNAVPVQ IDLIGAASLP 

151 GDMGRVLEHY AGSGFAQLAL YNAASGKIEK SINPHKLDQP FPGKARWEKI 

201 QRAGSVRDLE SIGGVLYAQG WLSAGTHNGR DYALFFRQPV PKGVAEDAVL 

251 IEKARAKYAE LSYSKKGLQT FFLA TLLIAS LLSIFLALVM AL YFARRFVE 

301 PVLSLAEGAK AVAQGDFSQT RPVLRNDEFG RLTKLFNHMT EQLSIAKEAD 

351 ERNRRREEAA RHYLECVLEG LTTGVWFDE QGCLKTFNKA AEQILGMPLT 

401 PLWGSSRHGW HGVSAQQSLL AEVFAAIGAA AGTDKPVHVK YAAPDDAKIL 

4 51 LGKATVLPED NGNGWMVID DITVLIHAQK EAAWGEVAKR LAHEIRNPLT 

501 PIQLSAERXA XKLGGKLDEQ DAQILTRSTD TIVKQVAALK EMVEAFRNYA 

551 RSPSLKLENQ DLNALIGDVL ALYEAGPCRF AADLPANR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 258 shows 90.9% identity over a 386 aa overlap with a predicted ORF (ORF 258.ng) 
from N. gonorrhoeae: 

m258/g258 

10 20 30 40 50 60 

MRRFLPIAAI CAWLLYGLTAATGSTSSLADYFWWIVAFSAMLLLVLSAVLARYVILLLK 
f I I I i I I I 1 i I I I I I I t I I I 1 I j I I I | I I I | I I | | | 1 x | | | | j | | | { | I I | f | I I J | I I I 
MRRFLPIAAI CAWLLYGLTAATGSTSSLADYFWWIVSFSAMLLLVLSAVLARYVILLLK 
10 20 30 40 50 60 

70 80 90 100 110 120 

DRRDGVFGSQIAKRLSGMFTLVAVLPGVFLFGVSAQFINGTINSWFGNDTHEALERSLNL 

M hi I llllill IIMIIIMIIIIhlll hi II I! I III IM IMIMM M! || || 

DRRNGVFGSQIAKRLSGMFTLVAVLPGLFLFGISAQFINGTINSWFGNDTHEALERSLNL 
70 80 90 100 110 120 



m258 .pep 
g258 

m258 .pep 
g258 



130 140 150 160 170 180 

m258 . pep SKSALNLAADNALGNAVPVQ I DL I GAASLPGDMGRVLEH YAGS G FAQLAL YNAASGKI EK 

llllhlllllh:|||IMIII||:||| |:|| I I I I I I I I M M I I M I I I I I I I I I 
g258 SKSALDLAADNAVSNAVPVQIDLIGTASLSGNMGSVLEHYAGSGFAQLAL YNAASGKIEK 

130 140 150 160 170 180 

190 200 210 220 230 240 

m2 58 . pep SINPHKLDQPFPGKARWEKIQRAGSVRDLESIGGVLYAQGWLSAGTHNGRDYALFFRQPV 

1 1 1 1 1 s 5 1 1 h I I h h I h : I 1 1 h I 1 1 1 1 I 1 1 I M 1 1 I II I 1 1 I 1 1 I I I 1 1 I II h 

9258 SINPHQFDQPLPDKEHWEQIQQTGSVRSLESIGGVLYAQGWLSAGTHNGRDYALFFRQPI 

190 200 210 220 230 240 
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250 260 270 280 290 300 

PKGVAEDAVLIEKARAKYAELSYSKKGLQTFFI^TLLIASLLSIFLALVMALYFARRFVE 

I- I hi II I M 1 1 1 II II II II i 1 1 1 ! 1 1 , i h 1 1 1 ' 1 1 1 i II II Mi II II M I M I 

PENVAQDAVLIEKARAKYAELSYSKKGLQTFFLVTLLIASLLSIFLALVM^YFARRFVE 
250 260 270 280 290 300 

310 320 330 340 350 360 

PVLSLAEGAKAVAQGDFSQTRPVLRNDEFGRLTKLFNHMTEQLSIAKEADERNRRREEAA 

h 1 1 1 1 II M I , I Ml 1 1 1 II : I M II 1 1 1 1 1 1 1 1 1 II 1 1 ! 1 1 ; i II M 1 , 1 1 1 1 1 ; 1 1 1 

PILSLAEGAKAVAQGDFSQTRPVLRNDEFGRLTKLFNHMTEQLSIAKEADERNRRREEAA 
310 320 330 340 350 360 

370 380 390 400 410 420 

RHYLECVLEGLTTGWVFDEQGCLKTFNKAAEQILGMPLTPLWGSSRHGWHGVSAQQSLL 

MM I II MM II MM MM 

RHYLECVLDGLTTGVWSYPLSCCRTAVFSTCHSSPLSYFX 
370 380 390 400 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 1027>: 

a258 . seq 

1 ATGCGCCGTT TTCTACCGAT CGCAGCCATA TGCGCCGTCG TCCTGTTGTA 

51 CGGACTGACG GCGGCAACCG GCAGCACCAG TTCGCTGGCG GATTATTTCT 

101 GGTGGATTGT TGCGTTCAGC GCAATGCTGC TGCTGGTGTT GTCCGCCGTT 

151 TTGGCACGTT ATGTCATATT GCTGTTGAAA GACAGGCGCG ACGGCGTATT 

201 CGGTTCGCAG ATTGCCAAAC GCCTTTCCGG GATGTTTACG CTGGTTGCCG 

251 TACTGCCCGG CGTGTTTCTG TTCGGCGTTT CCGCACAGTT TATCAACGGC 

301 ACGATTAATT CGTGGTTCGG CAACGATACC CACGAGGCGC TTGAACGCAG 

351 CCTCAATTTG AGCAAGTCCG CATTGAATCT GGCGGCAGAC AACGCCCTTG 

4 01 GCAACGCCAT CCCCGTGCAG ATAGACCTCA TCGGCGCGGC TTCCCTGCCC 

4 51 GGGGAT AT GG GCAGGGTGCT GGAACATTAC GCCGGCAGCG GTTTTGCCCA 

501 GCTTGCCCTG TACAATGCCG CAAGCGGCAA AATCGAAAAA AGCATCAACC 

551 CGCACAAGCT CGATCAGCCG TTTCCAGGTA AGGCGCGTTG GGAAAAAATC 

601 CAACAGGCGG GTTCGGTCAG GGATTTGGAA AGCATAGGCG GCGTATTGTA 

651 CGCGCAGGGC TGGCTGTCGG CAGGTACGCA CAACGGGCGC GATTACGCCT 

701 TGTTTTTCCG TCAGCCGGTT CCCAAAGGCG TGGCAGAGGA TGCCGTCTTA 

751 ATCGAAAAGG CAAGGGCGAA ATATGCTGAG TTGAGTTACA GCAAAAAAGG 

801 TTTGCAGACC TTTTTCCTGG CAACCCTGCT GATTGCCTCG CTGCTGTCGA 

851 TTTTTCTTGC ACTGGTCATG GCACTGTATT TCGCCCGCCG TTTCGTCGAA 

901 CCCGTCCTAT CGCTTGCCGA GGGGGCGAAG GCGGTGGCGC AAGGCGATTT 

951 CAGCCAGACG CGCCCCGTGT TGCGCAACGA CGAGTTCGGA CGCTTGACCA 

1001 AGTTGTTCAA CCACATGACC GAGCAGCTTT CCATCGCCAA AGAAGCAGAC 

1051 GAGCGCAACC GCCGGCGCGA GGAAGCCGCC AG AC ATT AT C TCGAATGCGT 

1101 GTTGGAGGGG CTGACCACGG GCGTGGTGGT GTTTGACGAA CAAGGCTGTC 

1151 TGAAAACCTT CAACAAAGCG GCGGAACAGA TTTTGGGGAT GCCGCTTACC 

1201 CCCCTGTGGG GCAGCAGCCG GCACGGTTGG CACGGCGTTT CGGCGCAGCA 

1251 GTCCCTGCTT GCCGAAGTGT TTGCCGCCAT CGGCGCGGCG GCAGGTACGG 

1301 ACAAACCGGT CCATGTGAAA TATGCCGCGC CGGACGATGC CAAAATCCTG 

1351 CTGGGCAAGG CAACCGTCCT GCCCGAAGAC AACGGCAACG GCGTGGTAAT 

14 01 GGTGATTGAC GACATCACCG TTTTGATACA CGCGCAAAAA GAAGCCGCGT 

1451 GGGGCGAAGT GGCAAAACGG CTGGCACACG AAATCCGCAA TCCGCTCACG 

1501 CCCATCCAGC TTTCTGCCGA ACGGCTGGCG TGGAAATTGG GCGGGAAGCT 

1551 GGACGAGCAG GACGCGCAAA TCCTGACACG TTCGACCGAC ACCATCATCA 

1601 ' AACAAGTGGC GGCATTAAAA GAAATGGTCG AGGCATTCCG CAATTACGCG 

1651 CGTTCCCCTT CGCTCAAATT GGAAAATCAG GATTTGAACG CCTTAATCGG 

1701 CGATGTGTTG GCATTGTACG AAGCTGGTCC GTGCCGGTTT GCGGCGGAAC 

1751 TTGCCGGCGA ACCGCTGATG ATGGCGGCGG ATACGACCGC CATGCGGCAG 

1801 GTGCTGCACA ATATTTTCAA AAATGCCGCC GAAGCGGCGG AAGAAGCCGA 

1851 TGTGCCCGAA GTCAGGGTAA AATCGGAAGC GGGGCAGGAC GGACGGATTG 

1901 TCCTGACAGT TTGCGACAAC GGCAAGGGGT TCGGCAGGGA AATGCTGCAC 

1951 AATGCCTTCG AGCCGTATGT AACGGACAAA CCGGCTGGAA CGGGATTGGG 

2001 ACTGCCCGTG GTGAAAAAAA TCATTGAAGA ACACGGCGGC CGCATCAGCC 

2051 TGAGCAATCA GGATGCGGGC GGCGCGTGTG TCAGAATCAT CTTGCCAAAA 

2101 ACGGTAGAAA CTTATGCGTA G 



m258 .pep 
g258 

m258 .pep 
9258 

m2 5 8 . pep 
g258 
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This corresponds to the amino acid sequence <SEQ ID 1028; ORF 258.a>: 

a258 .pep 

1 MRRFLPIAAI CAWLLYGLT AATGSTSSLA DYFWW IVAFS AMLLLVLSAV 

51 LARYVILLLK DRRDGVFGSQ IAKRLS GMFT LVAVLPGVFL FGV SAQFING 

101 TINSWFGNDT HEALERS LNL SKSALNLAAD NALGNAI PVQ IDLIGAASLP 

151 GDMGRVLEHY AGSGFAQLAL YNAASGKIEK SINPHKLDQP FPGKARWEKI 

201 QQAGSVRDLE SIGGVLYAQG WLSAGTHNGR DYALFFRQPV PKGVAEDAVL 

251 IEKARAKYAE LSYSKKGLQT FFLAT LLIAS LLSIFLALVM AL YFARRFVE 

301 PVLSLAEGAK AVAQGDFSQT RPVLRNDEFG RLTKLFNHMT EQLSIAKEAD 

351 ERNRRREEAA RHYLECVLEG LTTGVWFDE QGCLKTFNKA AEQILGMPLT 

4 01 PLWGSSRHGW HGVSAQQSLL AEVFAAIGAA AGTDKPVHVK YAAPDDAKIL 

451 LGKATVLPED NGNGWMVID DITVLIHAQK EAAWGEVAKR LAHEIRNPLT 

501 PIQLSAERLA WKLGGKLDEQ DAQILTRSTD TIIKQVAALK EMVEAFRNYA 

551 RSPSLKLENQ DLNALIGDVL ALYEAGPCRF AAELAGEPLM MAADTTAMRQ 

601 VLHNIFKNAA EAAEEADVPE VRVKSEAGQD GRIVLTVCDN GKGFGREMLH 

651 NAFEPYVTDK PAGTGLGLPV VKKIIEEHGG RISLSNQDAG GACVRIILPK 

701 TVETYA* 



m258/a258 99.0% identity in 584 aa overlap 

10 20 30 40 50 60 

m258.pep MRRFLPIAAICAWLLYGLTAATGSTSSLADYFWWIVAFSAMLLLVLSAVLARYVILLLK 
I I i I I I I I I M I I I I I I I I I I I I I i I I I I I I I I I I I i I I I I i I I I I I I I I I I I I I 1 I I I I 

a258 MRRFLPIAAICAWLLYGLTAATGSTSSLADYFWWIVAFSAMLLLVLSAVLARYVILLLK 

10 20 30 40 50 60 



70 80 90 100 110 120 

m258 . pep DRRDGVFGSQIAKRLSGMFTLVAVLPGVFLFGVSAQFINGTINSWFGNDTHEALERSLNL 
I I I I I I I I I I I ! I It I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I 
a258 DRRDGVFGSQIAKRLSGMFTLVAVLPGVFLFGVSAQFINGTINSWFGNDTHEALERSLNL 

70 80 90 100 110 120 



130 140 150 160 170 180 

m258 . pep SKSALNLAADNALGNAVPVQIDLIGAASLPGDMGRVLEHYAGSGFAQLALYNAASGKIEK 
I I II I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I 
a258 SKSALNLAADNALGNAIPVQIDLIGAASLPGDMGRVLEHYAGSGFAQLAL YNAASGKIEK 

130 140 150 160 170 180 

190 200 210 220 230 240 

m258 .pep S IN PHKL DQ PFPGKARWEK I QRAG S VRDLE S IGGVLY AQGWLS AGTHNGR D Y AL FFRQ P V 

I I I I I I I I I I I I I I I I I I i I I : I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
a258 S IN PHKLDQP FPGKARWEKIQQAG SVRDLE S IGGVL Y AQGWLS AGTHNGRDYAL FFRQ PV 

190 200 210 220 230 240 



250 260 270 280 290 300 

m258 . pep PKGVAEDAVLIEKARAKYAELSYSKKGLQTFFLATLLIASLLSIFLALVMALYFARRFVE 
I M I I I I I I I I I I I I I I I I I I! II I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I 
a258 PKGVAEDAVL IEKARAKYAELSYSKKGLQTFFLATLL I AS LLS I FLALVMALYFARRFVE 

250 260 270 280 290 300 



310 320 330 340 350 360 

m258 . pep PVLSLAEGAKAVAQGDFSQTRPVLRNDEFGRLTKLFNHMTEQLSIAKEADERNRRREEAA 
I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 1 I 1 I I I I I I I I I I I I I I I 
a258 PVLSLAEGAKAVAQGDFSQTRPVLRNDEFGRLTKLFNHMTEQLSIAKEADERNRRREEAA 

310 320 330 340 350 360 



370 380 390 400 410 420 

m258 . pep RHYLECVLEGLTTGVWFDEQGCLKT FNKAAEQI LGMPLT PLWGS SRHGWHGVS AQQSLL 

I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 11 I I 1 I I II I I II I I M ! 
a258 RHYLECVLEGLTTGVWFDEQGCLKT FN KAAEQILGMPLTPLWGSSRHGWHGVSAQQSLL 

370 380 390 400 410 420 



430 440 450 460 470 480 

m258 . pep AEVFAAIGAAAGTDKPVHVKYAAPDDAKILLGKATVLPEDNGNGVVMVIDDITVLIHAQK 
I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a258 AEVFAAIGAAAGTDKPVHVKYAAPDDAKILLGKATVLPEDNGNGWMVIDDITVLIHAQK 
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430 440 450 460 470 480 

490 500 510 520 530 540 

m258 . pep EAAWGEVAKRLAHEIRNPLTPIQLSAERXAXKLGGKLDEQDAQILTRSTDTIVKQVAALK 
I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I II i I I I I I I I I I I I I I I : I I I I I I I 
a258 EAAWGEVAKRLAHEIRNPLTPIQLSAERLAWKLGGKLDEQDAQI LTRST DT 1 1 KQVAALK 

490 500 510 520 530 540 

550 560 570 580 589 

m258 . pep EMVEAFRNYARSPSLKLENQDLNALIGDVLALYEAGPCRFAADLPANRX 

I I I M I I I I I I I I I I I I I i I I I I I I I I I I M I I I I I I I I I I I : I 
a258 EMVEAFRNYARSPSLKLENQDLNALIGDVLALYEAGPCRFAAELAGEPLMMAADTTAMRQ 

550 560 570 580 590 600 

a258 VLHNIFKNAAEAAEEADV'PEVRVKSEAGQDGRIVLTVCDNGKGFGREMLHNAFEPYVTDK 

610 620 630 640 650 660 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1029>: 

g259 . seq 

1 atgatgatgc acgcttctgt ccaaagtcgt ttcgcaccga tactttatgt 

51 tttgattttc tttgccggtt ttttgaccgc gcaaatctgg ttcaatcaga 

101 aagcctatac tgaagagctg cctccgcttc tgtccgcatt gtccgccgtc 

151 gcgctggtgt ggctggcgtg ggcgttcgtg tcggtgcgtt caaaggctaa 

201 ggcagaaaag ttctaccgcg aaaaaatgat acagaacgaa agcatacacc 

251 ccgtcctgca cgcttctttg caacacttgg aacacaagcc gcaaatgctc 

301 gccctgctgg tcaaaaacca cggcaaaggc atggcggaac aggtcaggtt 

351 caaggcggaa gtgctgcccg acgacgaaga cgcgcgcacg attgccgccg 

4 01 agttggcaaa aatggatatg ttcgcattgg ggacggacgc ggtcgcctcg 

451 ggcgaaacct atgggcgcgt gttcgccgat attttcgagt tgtcggcggc 

501 tttggaaagg cgcgcgttca aagggatact gaaactgacg gcggaatata 

551 aaaaacatct tcggcgatgc ctgccgttcg gaaacggcgt tggatttggg 

601 cgcgctcaat caggcgttga gggaaatctc gaaaacgccg gaaaagccta 

651 a 

This corresponds to the amino acid sequence <SEQ ID 1030; ORF 259.ng>: 

g259 .pep 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYT EEL PPLLS ALSAV 

51 ALVWLAWAFV SV RSKAKAEK FYREKMIQNE SIHPVLHASL QHLEHKPQML 

101 ALLVKNHGKG MAEQVRFKAE VLPDDEDART IAAELAKMDM FALGTDAVAS 

151 GETYGRVFAD IFELSAALER RAFKGILKLT AEYKKHLRRC LPFGNGVGFG 

201 RAQSGVEGNL ENAGKA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1031 >: 



m259 . seq 


(partial) 










1 


ATGATGATGC 


ACGCTTCTGT 


CCAAAGCCGT 


TTCGCACCGA 


TACTTTATGT 


51 


TTTGATTTTC 


TTTGCCGGTT 


TTTTGACCGC 


GCAAATCTGG 


TTCAATCAGA 


101 


AAGCCTATAC 


TGAAGAGCTG 


CCTCCGCTTC 


TGTCCGCATT 


GTCCGCCGTC 


151 


GCGCTGGTGT 


GGCTGGCGTG 


GGCGTTCGTG 


TCGGCGCGTT 


CAAAGGCCAA 


201 


GGCGGAAAAG 


TTCTACCGCG 


AAAAAATGAT 


ACAGAACGAA 


AGCATACACC 


251 


CCGTCsTGCA 


CGCCTCTTTG 


CAACACTTGG 


AACACAAGCC 


GCAAATACTC 


301 


GCCCTGCTGG 


TCAAAAACCA 


CGGCAAAGGG 


ATGGCGGAAC 


AGGTCAGGTT 


351 


CAAGGCGGAA 


GTGCTGCCCG 


ACGACGAAGA 


CGCGCGCACG 


ATTGCCGCCG 


401 


AGTTGGCAAA 


AATGGATATG 


TTCGCATTGG 


GGACkGACGC 


GGTCGCCTCG 


451 


GGCGAAACCT 


ATGGACGCGT 


GTTCGCCGAT 


ATTTTCGAGT 


TGTCGGmGGC 


501 


TTTGGAAGGG 


CGCGCGTTCA 


AAGGAATGTT 


GAAACTGACG 


GCGGAATATA 


551 


AA.AACATCT 


TCGGmGATGC 


CTGCCGTTCG 


GAAACGGCGT 


TGGAGTTGGG 


601 


CGCACTCAAT 


CAGGCGTTGC 


AGGAGATTTC 


AAAAACATCC 


GG. . 



This corresponds to the amino acid sequence <SEQ ID 1032; ORF 259>: 

m259 .pep (partial) 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYT EEL PPLLS ALSAV 

51 ALVWLAWAFV SA RSKAKAEK FYREKMIQNE SIHPVXHASL QHLEHKPQIL 

101 ALLVKNHGKG MAEQVRFKAE VLPDDEDART IAAELAKMDM FALGTDAVAS 
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430 440 450 460 470 480 

490 500 510 520 530 540 

m258 . pep EAAWGEVAKRLAHEIRNPLTPIQLSAERXAXKLGGKLDEQDAQILTRSTDTIVKQVAALK 
it I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I : I I I I I I I 
a 258 EAAWGEVAKRLAHEIRNPLTPIQLSAERLAWKLGGKLDEQDAQILTRSTDTIIKQVAALK 

490 500 510 520 530 540 

550 560 570 580 589 

m258 . pep EMVEAFRNYARSPSLKLENQDLNALIGDVLALYEAGPCRFAADLPANRX 

I I I I i I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I 
a258 EMVEAFRNYARSPSLKLENQDLNALIGDVLALYEAGPCRFAAELAGEPLMMAADTTAMRQ 

550 560 570 580 590 600 

a 258 VLHNIFKNAAEAAEEADVPEVRVKSEAGQDGRIVLTVCDNGKGFGREMLHNAFEPYVTDK 

610 620 630 640 650 660 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1029>: 

g259 . seq 

1 atgatgatgc acgcttctgt ccaaagtcgt ttcgcaccga tactttatgt 

51 tttgattttc tttgccggtt ttttgaccgc gcaaatctgg ttcaatcaga 

101 aagcctatac tgaagagctg cctccgcttc tgtccgcatt gtccgccgtc 

151 gcgctggtgt ggctggcgtg ggcgttcgtg tcggtgcgtt caaaggctaa 

2 01 ggcagaaaag ttctaccgcg aaaaaatgat acagaacgaa agcatacacc 
251 ccgtcctgca cgcttctttg caacacttgg aacacaagcc gcaaatgctc 
301 gccctgctgg tcaaaaacca cggcaaaggc atggcggaac aggtcaggtt 

3 51 caaggcggaa gtgctgcccg acgacgaaga cgcgcgcacg attgccgccg 
401 agttggcaaa aatggatatg ttcgcattgg ggacggacgc ggtcgcctcg 
451 ggcgaaacct atgggcgcgt gttcgccgat attttcgagt tgtcggcggc 
501 tttggaaagg cgcgcgttca aagggatact gaaactgacg gcggaatata 
551 aaaaacatct tcggcgatgc ctgccgttcg gaaacggcgt tggatttggg 
601 cgcgctcaat caggcgttga gggaaatctc gaaaacgccg gaaaagccta 
651 a 

This corresponds to the amino acid sequence <SEQ ID 1030; ORF 259.ng>: 

g259.pep 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYT EEL PPLLSALSAV 

51 ALVWLAWAFV SV RSKAKAEK FYREKMIQNE SIHPVLHASL QHLEHKPQML 

101 ALLVKNHGKG MAEQVRFKAE VLPDDEDART IAAELAKMDM FALGTDAVAS 

151 GETYGRVFAD IFELSAALER RAFKGILKLT AEYKKHLRRC LPFGNGVGFG 

201 RAQSGVEGNL ENAGKA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1031>: 



m259 . seq 


(partial) 










1 


ATGATGATGC 


ACGCTTCTGT ' 


CCAAAGCCGT 


TTCGCACCGA 


TACTTTATGT 


51 


TTTGATTTTC 


TTTGCCGGTT 


TTTTGACCGC 


GCAAATCTGG 


TTCAATCAGA 


101 


AAGCCTATAC 


TGAAGAGCTG 


CCTCCGCTTC 


TGTCCGCATT 


GTCCGCCGTC 


151 


GCGCTGGTGT 


GGCTGGCGTG 


GGCGTTCGTG 


TCGGCGCGTT 


CAAAGGCCAA 


201 


GGCGGAAAAG 


TTCTACCGCG 


AAAAAATGAT 


ACAGAACGAA 


AGCATACACC 


251 


CCGTCsTGCA 


CGCCTCTTTG 


CAACACTTGG 


AACACAAGCC 


GCAAATACTC 


301 


GCCCTGCTGG 


TCAAAAACCA 


CGGCAAAGGG 


ATGGCGGAAC 


AGGTCAGGTT 


351 


CAAGGCGGAA 


GTGCTGCCCG 


ACGACGAAGA 


CGCGCGCACG 


ATTGCCGCCG 


401 


AGTTGGCAAA 


AATGGATATG 


TTCGCATTGG 


GGACkGACGC 


GGTCGCCTCG 


451 


GGCGAAACCT 


ATGGACGCGT 


GTTCGCCGAT 


ATTTTCGAGT 


TGTCGGmGGC 


501 


TTTGGAAGGG 


CGCGCGTTCA 


AAGGAATGTT 


GAAACTGACG 


GCGGAATATA 


551 


AA.AACATCT 


TCGGmGATGC 


CTGCCGTTCG 


GAAACGGCGT 


TGGAGTTGGG 


601 


CGCACTCAAT 


CAGGCGTTGC 


AGGAGATTTC 


AAAAACATCC 


GG. . 



This corresponds to the amino acid sequence <SEQ ID 1032; ORF 259>: 

m259.pep (partial) 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYT EEL PPLLS ALSAV 

51 ALVWLAWAFV SA RSKAKAEK FYREKMIQNE SIHPVXHASL QHLEHKPQIL 

101 ALLVKNHGKG MAEQVRFKAE VLPDDEDART IAAELAKMDM FALGTDAVAS 
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151 GETYGRVFAD IFELSXALEG RAFKGMLKLT AEYKXHLRRC LPFGNGVGVG 
2 01 RTQSGVAGDF KNIR . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 259 shows 94.3% identity over a 212 aa overlap with a predicted ORF (ORF 259.ng) 
from N. gonorrhoeae: 

m259/g259 

10 20 30 40 50 60 

m259 pep MMMHASVQSRFAPILYVLIFFAGFLTAQIWFNQKAYTEELPPLLSALSAVALVWLAWAFV 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 

g2 5 9 MMMHASVQSRFAPILYVLIFFAGFLTAQIWFNQKAYTEELPPLLSALSAVAXjVWLAWAFV 

10 20 30 40 50 60 



70 80 90 100 110 120 

m2 5 9 . pep SARSKAKAEKFYREKMIQNESIHPVXHASLQHLEH^^ 

I : I I I t I I I I I I I I I I I I I I I I I I I II I I I I I I I I I h I I I I I M I I I I I I I I I I I I i i 
g 259 SVRSKAKAEKFYREKMIQNESIHPVLHASLQHLEHKPQMLALLVKNHGKGMAEQVRFKAE 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 5 9 pep VLPDDEDART I AAELAKMDMFALGTDAVAS GETYGRVFAD I FELSXALEGRAFKGMLKLT 

I II I M M IN M Ml I ! MMMIillMI 1 1 1 1 IIMI 1 1 II I : 1 1 Ml ! hlli! 

g 259 VLPDDEDART I AAELAKMDMFALGTDAVAS GETYGRVFAD I FELSAALERRAFKG I LKLT 

130 140 150 160 170 180 

190 200 210 

m2 5 9 . pep AEYKKHLRRCLPFGNGVGVGRTQSGVAGDFKNIR 

I II I II MINIMI III I Mill I 

g2 5 9 AEYKKHLRRCLPFGNGVGFGRAQSGVEGNLENAGKAX 

190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1033>: 

a259.seq (partial) 

1 ATGATGATGC ACGCTTCTGT CCAAAGCCGT TTCGCACCGA TACTTTATGT 

51 TTTGATTTTC TTTGCCGGTT TTTTGACCGC GCAAATCTGG TTCAATCAGA 

101 AAGCCTATAC TGAAGAGCTG CCTCCGCTTC TGTCCGCATT GTCCGCCGTC 

151 GCGCTGGTGT GGCTGGCGTG GGCGTTCGTG TCGGCGCGTT CAAAGGCTAA 

201 GGCGGAAAAG TTCTACCGCG AAAAAATGAT ACAGAACGAA AGCATACACC 

251 CCGTCCTGCA CGCTTCTTTG CAACACTTGG AACACAAGCC GCAAATGCTC 

301 GCCCTGCTGG TCAAAAACCA CGGCAAAGGG ATGGCGGAAC AGGTCAGGTT 

351 CAAGGCGGAA GTGCTGCCCG ACGACGAAGA CGCGCGCACG ATTGCCGCCG 

4 01 AGTTGGCAAA AATGGATATG TTTGCATTGG GGACGGACGC GGTCGCCTCG 

451 GGCGAAACCT ATGGACGCGT GTTCGCCGAT ATTTTCGAGT TGTCGGCGGC 

501 TTTGGAAGGG CGCGCGTTCA AAGGAATGTT GAAACTGACG GCGGAATATA 

551 AAAA.CATCT TCGGCGATGC CTGCCGTTCG GAAACGGCGT TGGAGTTGGG 

601 CGCGCTCAAT CAGGCGTTGC AGGAGATTTC AAAAACATCG GAAAAGTCCA 

651 A 

This corresponds to the amino acid sequence <SEQ ID 1034; ORF 259.a>: 

a259.pep (partial) 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYT EEL PPLLSALSAV 

51 ALVWLAWAFV SARSKAKAEK FYREKMIQNE SIHPVLHASL QHLEHKPQML 

101 ALLVKNHGKG MAEQVRFKAE VLPDDEDART IAAELAKMDM FALGTDAVAS 

151 GETYGRVFAD IFELSAALEG RAFKGMLKLT AEYKXHLRRC LPFGNGVGVG 

201 RAQSGVAGDF KNIGKVQ 



m259/a259 98.1% identity in 213 aa overlap 

10 20 30 40 ' 50 60 

m259 . pep MMMHASVQSRFAPILYVLIFFAGFLTAQIWFNQKAYTEELPPLLSALSAVALVWLAWAFV 

| | | | I I 1 | I I I I | I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
a 259 MMMHASVQSRFAPILYVLIFFAGFLTAQIWFNQKAYTEELPPLLSALSAVALVWLAWAFV 



WO 99/57280 



600 



PCT/US99/09346 



10 20 30 40 50 60 

70 80 90 100 110 120 

m259 . pep SARSKAKAEKFYREKMIQNESIHPVXHASLQHLEHKPQILALLVKNHGKGMAEQVRFKAE 
1 I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I | I || : | | | | | | | | | | | | | | | | | | | | | 
a259 SARSKAKAEKFYREKMIQNESIHPVLHASLQHLEHKPQMLALLVKNHGKGMAEQVRFKAE 

70 80 90 100 110 120 

130 140 150 160 170 180 

m259 . pep VLPDDEDARTIAAELAKMDMFALGTDAVASGETYGRVFADIFELSXALEGRAFKGMLKLT 
I I I I I I I I I I I I I I I I I ! I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | 
a259 VLPDDEDARTIAAELAKMDMFALGTDAVASGETYGRVFADIFELSAALEGRAFKGMLKLT 

130 140 150 . 160 170 180 

190 200 210 

m259 . pep AEYKXHLRRCLPFGNGVGVGRTQSGVAGDFKNIR 

I I I I I I I I I I I I II I I I I T I I : I I I I I I I I I I I 
a259 AEYKXHLRRCLPFGNGVGVGRAQSGVAGDFKNIGKVQ 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1035>: 

g259-l.seq 

1 ATGATGATGC ACGCTTCTGT CCAAAGTCGT TTCGCACCGA TACTTTATGT 

51 TTTGATTTTC TTTGCCGGTT TTTTGACCGC GCAAATCTGG TTCAATCAGA 

101 AAGCCTATAC TGAAGAGCTG CCTCCGCTTC TGTCCGCATT GTCCGCCGTC 

151 GCGCTGGTGT GGCTGGCGTG GGCGTTCGTG TCGGTGCGTT CAAAGGCTAA 

201 GGCAGAAAAG TTCTACCGCG AAAAAATGAT ACAGAACGAA AGCATACACC 

251 CCGTCCTGCA CGCTTCTTTG CAACACTTGG AACACAAGCC GCAAATGCTC 

301 GCCCTGCTGG TCAAAAACCA CGGCAAAGGC ATGGCGGAAC AGGTCAGGTT 

351 CAAGGCGGAA GTGCTGCCCG ACGACGAAGA CGCGCGCACG ATTGCCGCCG 

4 01 AGTTGGCAAA AATGGATATG TTCGCATTGG GGACGGACGC GGTCGCCTCG 

4 51 GGCGAAACCT ATGGGCGCGT GTTCGCCGAT ATTTTCGAGT TGTCGGCGGC 

501 TTTGGAA 

This corresponds to the amino acid sequence <SEQ ID 1036; ORF 259-1. ng>: 

g259-l .pep 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYTEEL PPLLSALSAV 
51 ALVWLAWAFV SVRSKAKAEK FYREKMIQNE SIHPVLHASL QHLEHKPQML 
101 ALLVKNHGKG MAEQVRFKAE VLPDDEDART IAAELAKMDM FALGTDAVAS 
151 GETYGRVFAD IFELSAALE 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1037>: 

m259-l .seq 

1 ATGATGATGC ACGCTTCTGT CCAAAGCCGT TTCGCACCGA TACTTTATGT 

51 TTTGATTTTC TTTGCCGGTT TTTTGACCGC GCAAATCTGG TTCAATCAGA 

101 AAGCCTATAC TGAAGAGCTG CCTCCGCTTC TGTCCGCATT GTCCGCCGTC 

151 GCGCTGGTGT GGCTGGCGTG GGCGTTCGTG TCGGCGCGTT CAAAGGCCAA 

201 GGCGGAAAAG TTCTACCGCG AAAAAATGAT ACAGAACGAA AGCATACACC 

251 CCGTCCTGCA CGCCTCTTTG CAACACTTGG AACACAAGCC GCAAATACTC 

301 GCCCTGCTGG TCAAAAACCA CGGCAAAGGG ATGGCGGAAC AGGTCAGGTT 

351 CAAGGCGGAA GTGCTGCCCG ACGACGAAGA CGCGCGCACG ATTGCCGCCG 

401 AGTTGGCAAA AATGGATATG TTCGCATTGG GGACGGACGC GGTCGCCTCG 

451 GGCGAAACCT ATGGACGCGT GTTCGCCGAT ATTTTCGAGT TGTCGGCGGC 

501 TTTGGAAGGG CGCGCGTTCA AAGGAATGTT GAAACTGACG GCGGAATATA 

551 AAAACATCTT CGGCGATGCC TGCCGTTCGG AAACGGCGTT GGAGTTGGGC 

601 GCACTCAATC AGGCGTTGCA GGAGATTTCA AAAACATCGG AAAAGTCCAA 

651 ACGGATATTT TATTGA 

This corresponds to the amino acid sequence <SEQ ID 1038; ORF 259-l>: 

m259-l .pep 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYTEEL PPLLSALSAV 
51 ALVWLAWAFV SARSKAKAEK FYREKMIQNE SIHPVLHASL QHLEHKPQIL 
101 ALLVKNHGKG MAEQVRFKAE VLPDDEDART IAAELAKMDM FALGTDAVAS 
151 GETYGRVFAD IFELSAALEG RAFKGMLKLT AEYKNIFGDA CRSETALELG 
201 ALNQALQEIS KTSEKSKRIF Y* 

g259-l/m259-l 98.8% identity in 169 aa overlap 
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10 20 30 40 50 60 

g259-l.pep MMMHASVQSRFAPILYVLI FFAGFLTAQIWFNQKAYTEELPPLLSALSAVALVWLAWAFV 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I ! I I I I I I I I I I I I I I 
m259-l MMMHASVQSRFAPILYVLIFFAGFLTAQIWFNQKAYTEELPPLLSALSAVALVWLAWAFV 

10 20 30 40 50 60 

70 80 90 100 110 120 

g259-l.pep S VRSKAKAEKFYREKMI QNES I HPVLHAS LQHLEHKPQMLALLVKNHGKGMAEQVRFKAE 
I : I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I : I I I I I I I t I I I I I I I I I I I I I 
m259-l SARSKAKAEKFYREKMIQNESIHPVLHASLQHLEHKPQILALLVKNHGKGMAEQVRFKAE 

70 80 90 100 110 120 

130 140 150 160 169 

g259-l.pep VLPDDEDARTIAAELAKMDMFALGTDAVASGETYGRVFADIFELSAALE 
! I I I I I I I I ! I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I 
m259-l VLPDDEDARTIAAELAKMDMFALGTDAVASGETYGRVFADIFELSAALEGRAFKGMLKLT 

130 140 150 160 170 180 

The following partial DNA sequence was identified in N: meningitidis <SEQ ID 1039>: 

a259-l.seq 

1 ATGATGATGC ACGCTTCTGT CCAAAGCCGT TTCGCACCGA TACTTTATGT 

51 TTTGATTTTC TTTGCCGGTT TTTTGACCGC GCAAATCTGG TTCAATCAGA 

101 AAGCCTATAC TGAAGAGCTG CCTCCGCTTC TGTCCGCATT GTCCGCCGTC 

151 GCGCTGGTGT GGCTGGCGTG GGCGTTCGTG TCGGCGCGTT CAAAGGCTAA 

201 GGCGGAAAAG TTCTACCGCG AAAAAATGAT ACAGAACGAA AGCATACACC 

251 CCGTCCTGCA CGCTTCTTTG CAACACTTGG AACACAAGCC GCAAATGCTC 

301 GCCCTGCTGG TCAAAAACCA CGGCAAAGGG ATGGCGGAAC AGGTCAGGTT 

351 CAAGGCGGAA GTGCTGCCCG ACGACGAAGA CGCGCGCACG ATTGCCGCCG 

401 AGTTGGCAAA AATGGATATG TTTGCATTGG GGACGGACGC GGTCGCCTCG 

451 GGCGAAACCT ATGGACGCGT GTTCGCCGAT ATTTTCGAGT TGTCGGCGGC 

501 TTTGGAAGGG CGCGCGTTCA AAGGAATGTT GAAACTGACG GCGGAATATA 

551 AAAACATCTT CGGCGATGCC TGCCGTTCGG AAACGGCGTT GGAGTTGGGC ' 

601 GCGCTCAATC AGGCGTTGCA GGAGATTTCA AAAACATCGG AAAAGTCCAA 

651 ACGGATATTT TATTGA 

This corresponds to the amino acid sequence <SEQ ID 1040; ORF 259-1. a>: 

a259-l.pep 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYTEEL PPLLSALSAV 

51 ALVWLAWAFV SARSKAKAEK FYREKMIQNE SIHPVLHASL QHLEHKPQML 

101 ALLVKNHGKG MAEQVRFKAE VLPDDEDART IAAELAKMDM FALGTDAVAS 

151 GETYGRVFAD IFELSAALEG RAFKGMLKLT AEYKNIFGDA CRSETALELG 

201 ALNQALQEIS KTSEKSKRIF Y* 

a259-l/m259-l 99.5% identity in 221 aa overlap 

10 20 30 40 50 60 

a259-l.pep MMMHASVQSRFAPILYVLIFFAGFLTAQIWFNQKAYTEELPPLLSALSAVALVWLAWAFV 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m259-l MMMHASVQSRFAPILYVLIFFAGFLTAQIWFNQKAYTEELPPLLSALSAVALVWLAWAFV 
10 20 30 40 50 60 

70 80 90 100 110 120 

a2 59-1. pep SARSKAKAEKFYREKMIQNESIHPVLHASLQHLEHKPQMLALLVKNHGKGMAEQVRFKAE 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I : I I I I I I I I I I ! I I I I | | I | | | 
m259-l SARSKAKAEKFYREKMIQNESIHPVLHASLQHLEHKPQILALLVKNHGKGMAEQVRFKAE 

70 80 90 100 110 120 

130 140 150 160 170 180 

a2 5 9-1. pep VLPDDEDART I AAELAKMDMFALGTDAVASGETYGRVFADIFELSAALEGRAFKGMLKLT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M | I I 
m259-l VLPDDEDART I AAELAKMDMFALGT DA VASGETYGRVFADIFELSAALEGRAFKGMLKLT 

130. 140 150 160 170 180 

190 200 210 220 

a2 59- 1 . pep AEYKNI FGDACRSETALELGALNQALQEI SKTSEKSKRI FYX 
I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I M I I I I I I I I 
m2 5 9 - 1 AEYKNI FGDACRSETALELGALNQALQEI SKTSEKSKRI FYX 

190 200 210 220 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1041>: 

g260 . seq 
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l atgggtgcgg gtgtagtatt cgttgtcttt cagccgttct tcagcctgtt 

51 tcgagcgttg ttcgagggcg gagtcggtat agtcgaggga gcgcacgatg 

101 ccgctgaatg cgacttcttg tccgaggaat ttacccgtat ccggatcggt 

151 gatgttttta ttgattcggt aggtcagata acggcccggt tctttcaggc 

201 ctttggtgta aaccctggcg cctttggtgt acagcagcct gccttccggg 

251 cccgagagca ggcgcggcgc ggcagcggtt tctttgcggg aaacgatttg 

301 cgggtgctgc ataaagacgc ggtagaagtt gacatcgatg gcgggaatac 

351 cgtatccgga cacttcctta tccggactga ttttgacgac ggggatgccg 

401 tctgtctgtt ccaagccgag gcgcggttcg ccgccaacgt agcgcaacac 

451 caatacctgg cccggataaa tcaggtcggg attgtggatt tgatcccggt 

501 tcgcgcccca caggggggga ccattgccac gggctgtaca ggtatttgcc 

551 cgaaataccc cacagggtgt cgccctgttt ga 

This corresponds to the amino acid sequence <SEQ ID 1042; ORF 260.ng>: 

g260 .pep 

1 MGAG WFWF QPFFSLFRAL F EGGVGIVEG AHDAAECDFL SEEFTRIRIG 

51 DVFIDSVGQI TARFFQAFGV NPGAFGVQQP AFRAREQARR GSGFFAGNDL 

101 RVLHKDAVEV DIDGGNTVSG HFLIRTDFDD GDAVCLFQAE ARFAANVAQH 

151 QYLARINQVG IVDLIPVRAP QGGTIATGCT GICPKYPTGC RPV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1043>: 

m260 . seq 

1 ATGGGTGCGG GTATGGTATT CGTTGTCTTT CGGCCGTTCT CCAGCCTGTT 

51 TCGAGCGTTG TTCGAGGACA GAGTCGGTAT AGTCGAGGGA GCGCACGATG 

101 CCGCTGAATG CGACTTCCTG CCCGAGGAAT TTACCCGTAT CCGGATCGGT 

151 GATGTTTTTA TTGATTCGGT AGGTCAGGTA GCGGCCCGGC TCTTTCAGGC 

201. CTTTGGTGTA AACCCTGGTG CCTTTGGTGT ACAGCAGCCT GCCTTCCGGG 

251 CCCGAGwrCA sGCGCGGyGC GGCAGCGGTT TCTTTGCGGG AAACGATTTG 

3 01 CGGATGCCGC ATAAAGATGC GGTAGAAGTT GACATCGATG GCGGGAATAC 
351 CGTATCCGGA CACTTCCTTA TCCGGACTCA TTTTGACGAC GGGGATGCCG 
401 TCTGTCTGTT CCAAGCCGAG GCGCGGTTCG CCGTCAACGT GGCGCAACAC 

4 51 CAATACCTGG TCCGGATAAA TCAGGTCGGG ATTGTGGATT TGATCCCGGT 
501 TCGCGTyCCA CAG 

This corresponds to the amino acid sequence <SEQ ID 1044; ORF 260>: 

m260 .pep 

1 MGAG MVFWF RPFS5LFRAL F EDRVGIVEG AHDAAECDFL PEEFTRIRIG 

51 DVFIDSVGQV AARLFQAFGV NPGAFGVQQP AFRARXXARX GSGFFAGNDL 

101 RMPHKDAVEV DIDGGNTVSG HFLIRTHFDD GDAVCLFQAE ARFAVNVAQH 

151 QYLVRINQVG IVDLIPVRVP Q 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 260 shows 89.5% identity over a 171 aa overlap with a predicted ORF (ORF 260,ng) 
from N. gonorrhoeae: 

m260/g260 

10 20 30 40 50 60 

MGAGMVFWFRPFSSLFRALFEDRVGIVEGAHDAAECDFLPEEFTRIRIGDVFIDSVGQV 

IMMMIIMII MINIM II! M M M! M MM I MM! I M M: ill MM 

MGAGWFWFQPFFSLFRALFEGGVGIVEGAHDAAECDFLSEEFTRIRIGDVFIDSVGQI 
10 20 30 40 50 60 

70 80 90 100 110 120 

AARLFQAFGVNPGAFGVQQPAFRARXXARXGSGFFAGNDLRMPHKDAVEVDIDGGNTVSG 

M N 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II Mllllllllh IIIIIIIIIMIIIIM 

TARFFQAFGVNPGAFGVQQPAFRAREQARRGSGFFAGNDLRVLHKDAVEVDIDGGNTVSG 
70 80 90 100 110 120 

130 140 150 160 170 

HFLIRTHFDDGDAVCLFQAEARFAVNVAQHQYLVRINQVGIVDLIPVRVPQ 

Mill! II M M I M 1 1 II III MIMI M I MM II 1 1 1 1 M 1 1 1 Ml I 

HFLI RTDFDDGDAVCLFQAEARFAANVAQHQ Y LAR I NQVG IVDLI P VRAPQGGT I ATGCT 
130 140 150 160 170 180 



m260 .pep 
g260 

m260 .pep 
g260 

m2 60 .pep 
g260 
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603 



g26 0 GICPKYPTGCRPV 

190 



a260.seq 




1 


ATGGGTGCGG 


51 


TCGAGCGTTG 


101 


CCGCTGAATG 


151 


GATGTTTTTA 


201 


CTTTGGTGTA 


251 


CCCGAGAGCA 


301 


CGGGTGCCGC 


351 


CGTATCCGGA 


401 


TCTGTCTGTT 


451 


CAATACCTGG 


501 


TCGCGTCCCA 


551 


CGAAATGCCC 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1045>: 

GTATGGTATT CGTTGTCTTT CGGCCGTTCT CCAGCCTGTT 
TTCGAGGACA GAGTCGGTAT AGTCGAGGGA GCGCACGATG 
CGACTTCCTG CCCGAGGAAT TTACCCGTAT CCGGATCGGT 
TTGATTCGGT AGGTCAGGTA GCGGCCCGGC TCTTTCAGGC 
AACCCTGGTG CCTTTGGTGT ACAGCAGCCT GCCTTCCGGG 
GGCGCGGCGC GGCAGCGGTT TCTTTGCGGG AAACGATTTG 
ATAAAGATGC GGTAGAAGTT GACATCGATG GCGGGAATAC 
CACTTCCTTA TCCGGACTCA TTTTGACGAC GGGGATGCCG 
CCAAGCCGAG GCGCGGTTCG CCGTCAACGT GGCGCAACAC 
TCCAGATAAA TCAGGTCGGG ATTGTGGATT TGATCCCGGT 
CAGGCGGCC . GCATTGCCAC GGGCTGTACA GGTATTTGCC 
CACAGGGTGT CGCCCTGTTT GA 

This corresponds to the amino acid sequence <SEQ ID 1046; ORF 260.a>: 

a260.pep 

1 MGAGMVFVVF RPFSSLFRAL FEDRVGIVEG AHDAAECDFL PEEFTRIRIG 

51 DVFIDSVGQV AARLFQAFGV NPGAFGVQQP AFRAREQARR GSGFFAGNDL 

101 RVPHKDAVEV DIDGGNTVSG HFLIRTHFDD GDAVCLFQAE ARFAVNVAQH 

151 QYLVQINQVG IVDLIPVRVP QAAXIATGCT GICPKCPTGC RPV* 

m260/a260 97.1 % identity in 1 7 1 aa overlap 

10 20 30 40 50 . 60 

m260.pep MGAGMVFVVFRPFSSLFRALFEDRVGIVEGAHDAAECDFLPEEFTRIRIGDVFIDSVGQV 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I li I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a2 60 MGAGMVFVVFRPFSSLFRALFEDRVGIVEGAHDAAECDFLPEEFTRIRIGDVFIDSVGQV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m260 . pep AARLFQAFGVNPGAFGVQQPAFRARXXARXGSGFFAGNDLRMPHKDAVEVDIDGGNTVSG 
I I I I I i i I i I I I I I I I I I I I I I I I I I ! I I I I I I I I I I 1 : I I I I I I I I I I I I I I I I I I 
a260 AARLFQAFGVNPGAFGVQQPAFRAREQARRGSGFFAGNDLRVPHKDAVEVDIDGGNTVSG 

70 80 90 100 110 120 

130 140 150 160 170 

m260 . pep HFLIRTHFDDGDAVCLFQAEARFAVNVAQHQYLVRINQVGIVDLIPVRVPQ 
I I I I I I I I I I I M I I I I I I I I I I I I I 1 I I I I I i I : I I I I I I I II I I II I I I 
a260 HFLIRTHFDDGDAVCLFQAEARFAVNVAQHQYLVQINQVGIVDLIPVRVPQAAXIATGCT 

130 140 150 160 170 180 

a260 GICPKCPTGCRPVX 

190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1047>: 

g261.seq 

1 atggagcttg ggcatatcgt attccttgtg ctttgcgcgc gttcagacgg 

51 cctttttact ttccagacat tccgccagcc cgcgttcgcg caagatacag 

101 ctcgggcatt cgcggcagcc gccgacgata cccttgtagc aggtgtgggt 

151 ctgttcgcgg atgtagtcca acacgcccat ttcgtccgcc aacgcccacg 

201 tttgcgcctt ggtcaggtac atcagcggcg tgtggatttg aaaatcgtag 

251 tccatcgcca gattaagggt aacgttcatg gatttgacga acacgccgcg 

3 01 gcagtcggga tagcccgaaa aatcggtttc gcacacgccc gcgatgatgt 

351 gccggatacc ctgccctttg gcaaaaatgg cggcgtaaag caggaaaagc 

401 gcgttacgcc cgtccacaaa ggtattggga acgccgttgt cggcggtttc 

451 gatggcggcg gtttcgatgg cggcggtttc gtccatcagg gcgttgtgcg 

501 taatctgccg catcaggctc aaatcgagta cggtttgact gacacccaaa 

551 tcctgcgcga tccactctgc gcgttccagc tcgacggcat ggcgttgccc 

601 gtatcggaag gtgatggctt ggacgttttc gcgcccgtag gtttggattg 
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651 cctgaatcag gcaggtggtc gaatcctgac cgcccgagaa gatgaccaag 
701 gctttttggt ttga 
This corresponds to the amino acid sequence <SEQ ID 1048; ORF 261. ng>: 

g261 .pep 

1 MELGHIVFLV LCARSDGLFT FQTFRQPAFA QDTARAFAAA ADDTLVAGVG 

51 LFADWQHAH FVRQRPRLRL GQVHQRRVDL KIWHRQIKG NVHGFDEHAA 

101 AVGIARKIGF AHARDDVPDT LPFGKNGGVK QEKRVTPVHK GIGNAWGGF 

151 DGGGFDGGGF VHQGWRNLP HQAQIEYGLT DTQILRDPLC AFQLDGMALP 

201 VSEGDGLDVF APVGLDCLNQ AGGRILTARE DDQGFLV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1049>: 

m261 . seq 

1 ATGGAGCTTG GGCATATCGT ATTCCTTATG GTTTGCGCGT GTTCAGACGG 

51 CCTTTTTACT TTCCAGATAT TCCGCCAGCC cGcGTTCGCG CAAGATACAG 

101 CTCGGGCATT CGCGgCAGCC GCCGACGATG CCGTTATAGC AGGTGTGGGT 

151 TTGCTCGCGG ATATAGTCCA GCACGCCCAT TTCGTCCGCC AACGCCCACG 

201 TTTGCGCCTT GGTCAGATAC ATCAGCGGCG TGTGGATTTG AAAATCATAG 

251 TCCATCGCCA AATTAAGGGT AACGTTCATC GATTTGACAA ACACGTCGCG 

3 01 GCAGTCGGGA TAGCCGGAGA AGTCGGTTTC GCACACGCCC GCGATGATGT 
351 GCCGTATCCC CTGCCCTTTG GCGTAAATCG CGGCATAGAG CAGGAAAAGC 

4 01 gCGTTGCGGC CGTCTACAAA GGTATTCGGA ACGCCGTTTT CGGCAGTTTC 
4 51 GATGGCGGCG GTGTCGTCCA TCAGGGCATT GTGCGTAATC TGCCGCATCA 
501 GgCTcAAGTC GAGTACGGTT TGTTTGACGC CCAAATCCTG CGCAATCCAG 
551 CGGGCACGTT CCAGCTCGAC GGCATGGCGT TGCCCGTATT GGAAAGTAAT 
6 01 GGCTTGGACG TTTTCGCGCC CGTAGGTTTG GATTGCCTGA ATCAGGCAGG 
651 TGGTCGAATC CTGACCGCCC GAAAAGATGA CCAAGGCTTG TTGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 1050; ORF 261>: 

m261.pep 

1 MELGHIVFLM VCACSDGLFT FQI FRQPAFA QDTARAFAAA ADDAVIAGVG 

51 LLADIVQHAH FVRQRPRLRL GQIHQRRVDL KIIVHRQIKG NVHRFDKHVA 

101 AVGIAGEVGF AHARDDVPYP LPFGVNRGIE QEKRVAAVYK GIRNAVFGSF 

151 DGGGWHQGI VRNLPHQAQV EYGLFDAQIL RNPAGTFQLD GMALPVLESN 

201 GLDVFAPVGL DCLNQAGGRI LTARKDDQGL LV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 261 shows 79.7% identity over a 237 aa overlap with a predicted ORF (ORF 261. ng) 
from N. gonorrhoeae: 

m26l/g261 

10 20 30 40 50 60 

m2 6 1 . pep ME LGH I VFLMVC AC SDGLFTFQI FRQ P AF AQDT ARAF AAAADD AV I AG VGLL AD I VQHAH 

Ml Ml II h:| I II III III I II I I I I M II I I Ml I II h- M I I hi M M M 
g2 6 1 MELGHI VFLVLCARSDGLFTFQTFRQPAFAQDTARAFAAAADDTLVAGVGLFADWQHAH 

10 20 30 40 50 60 

70 80 90 100 110 120 

m26 1 . pep FVRQRPRLRLGQIHQRRVDLKIIVHRQIKGNVHRFDKHVAAVGIAGEVGFAHARDDVPYP 

III I MM II I hi II Ml II hill I MMM I hh III III -llllll I II I 

g261 FVRQRPRLRLGQVHQRRVDLKIWHRQIKGNVHGFDEHAAAVGIARKIGFAHARDDVPDT 

70 80 90 100 110 120 

130 140 150 160 170 

m261 . pep LPFGVNRGIEQEKRVAAVYKGIRNAVFGSFDGGGV VHQGIVRNLPHQAQVEYGLF 

I II I I h M 1 1 1 h hill I II h 1 1 1 1 1 II I h M I II II I h II M 

g261 LPFGKNGGVKQEKRVTPVHKGIGNAWGGFDGGGFDGGGFVHQGWRNLPHQAQIEYGLT 

130 140 150 160 170 180 

180 190 200 210 220 230 

m261 . pep DAQILRNPAGTFQLDGMALPVLESNGLDVFAPVGLDCLNQAGGRILTARKDDQGLLVX 

h 1 1 1 h I = 1 1 I II I II 1 1 h : I 1 1 1 II I 1 1 1 1 1 1 II I II I I I I I h II I h I I I 

g 2 6 1 DTQI LRDPLCAFQLDGMALPVSEGDGLDVFAPVGLDCLNQAGGRI LTAREDDQGFLVX 

190 200 210 220 230 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1051>: 

a261 . seq 

1 ATGGAGCTTG GGCATATCGT ATTCCTTATG GTTTGCGCGT GTTCAGACGG 

51 CCTTTTTACT TTCCAGATAT TCCGCCAGCC CGCGTTCGCG CAAGATACAG 

101 CTCGGGCATT CGCGGCAGCC GCCGACGATG CCGTTATAGC AGGTGTGGGT 

151 TTGCTCGCGG ATATAGTCCA GCGCGCCCAT TTCGTCCGCC AACGCCCAAG 

201 TTTGCGCCTT GGTCAGATAC ATCAGCGGCG TGTGGATTTG AAAATCATAG 

251 TCCATCGCCA GATTAAGGGT AACGTTCATG GATTTGACAA ACACGTCACG 

301 GCAGTCGGGA TAGCCGGAGA AGTCGGTTTC GCACACGCCC GCGATGATGT 

351 GCCGTATCCC CTGCCCTTTG GCGTAAATCG CGGCATAGAG CAGGAAAAGC 

4 01 GCGTTGCGGC CGTCTACAAA GGTATTCGGA ACGCCGTTTT CGGCAGTTTC 

4 51 GATGGCGGCG GTGTCGTCCA TCAGGGCATT GTGCGTAATC TGCCGCATCA 

501 GGCTCAAGTC GAGTACGGTT TGTTTGACGC CCAAATCCTG CGCAATCCAG 

551 CGGGCACGTT CCAGCTCGAC GGCATGGCGT TGCCCGTATT GGAAAGTAAT 

601 GGCTTGGACG TTTTCGCGCC .CGTAGGTTTG GATTGCCTGA ATCAGGCAGG 

651 TGGTCGAATC CTGACCGCCC GAAAAGATGA CCAAGGCTTT TTGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 1052; ORF 261. a>: 

a2 61 . pep 

1 MELGHIVFLM VCACSDGLFT FQIFRQPAFA QDTAR AFAAA ADDAVIAGVG 

51 LLADIVQRAH FVRQRPSLRL GQIHQRRVDL KIIVHRQIKG NVHGFDKHVT 

101 AVGIAGEVGF AHARDDVPYP LPFGVNRGIE QEKRVAAVYK GIRNAV FGSF 

151 DGGGVVHQGI VRNLPHQAQV EYGLFDAQIL RNPAGTFQLD GMALPVLESN 

201 GLDVFAPVGL DCLNQAGGRI LTARKDDQGF LV* 

m261/a261 97.8% identity in 232 aa overlap 

10 20 30 40 50 60 

m261 . pep MELGHIVFLMVCACSDGLFTFQIFRQPAFAQDTARAFAAAADDAVIAGVGLLADIVQHAH 
I I I I I I 1 I I I I I i I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | | | | | : | | 
a2 61 MELGHIVFLMVCACSDGLFTFQIFRQPAFAQDTT^AFAAAADDAVIAGVGLLADIVQRAH 

10 20 30 40 50 60 

70 80 90 100 110 120 

m261 . pep FVRQRPRLRLGQIHQRRVDLKIIVHRQIKGNVHRFDKHVAAVGIAGEVGFAHARDDVPYP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I 1 I I I | I i | | | | 
a261 FVRQRPSLRLGQIHQRRVDLKIIVHRQIKGNVHGFDKHVTAVGIAGEVGFAHARDDVPYP 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 61.pep LPFGVNRGIEQEKRVAAVYKGIRNAVFGSFDGGGWHQGIVRNLPHQAQVEYGLFDAQIL 
I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | 
a261 LPFGVNRGIEQEKRVAAVYKGIRNAVFGSFDGGGVVHQGIVRNLPHQAQVEYGLFDAQIL 

130 140 150 160 170 180 

190 200 210 220 230 

m261 . pep RNPAGTFQLDGMALPVLESNGLDVFAPVGLDCLNQAGGRILTARKDDQGLLVX 
I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I ! M I | | : | | | 
a2 6 1 RN PAGT FQLDGMALPVLE SNGLDV FAPVGLDCLNQAGGR I LTARK DDQGFL VX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1053>: 

g263 . seq 

1 atggcacgtt taaccgtaca caccctcgaa accgcccccg aagccgccaa 

51 accgcgcgta gaggccgtac ccaaaaacaa cggctttatc cccaacctca 

101 tcggcgtatt ggcaaacgcc cccgaagctt tggcgtttta ccaagaagtc 

151 ggcaagctca acgccgccaa cagcctgacc gccggcgaag tcgaagtgat 

201 ccggatcatc gccgtccgca ccaaccaatg cagcttctgc gtggcagggc 

251 acaccaaact cgcaaccctg aaaaaactcc tgtccgagca atccctcaat 

301 gccgcccgcg ctttggcggc aggtaaatct gacgatgcca aactcggcgc 

351 gcttgccgcc ttcacccaag ccgtaatggc gaaaaaaggc gcagtatccg 

401 acgacgaact caacgccttc ctcgaagcgg gctacaaccg gcagcaggca 
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451 gtcgaagtcg taatgggcgt agccttggca actttgtgca actacgccaa 
501 caacctcgcc caaaccgaaa tcaaccccaa attgcaggca tacgcctaa 

This corresponds to the amino acid sequence <SEQ ID 1054; ORF 263 .ng>: 

g263 .pep 

1 MARLTVHTLE TAPEAAKPRV EAVPKNNGFI PNLIGVLANA PEALAFYQEV 

51 GKLNAANSLT AGEVEVIRII AVRTNQCSFC VAGHTKLATL KKLLSEQSLN 

101 AARALAAGKS DDAKLGALAA FTQAVMAKKG AVSDDELNAF LEAGYNRQQA 

151 VEWMGVALA TLCNYANNLA QTEINPKLQ A YA * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1055>: 

m263.seq (partial) 

1 . . GCAGCAGGCG AATTTGACGA TGCCAAACTC GGCGCGCTCG CCGCCTTCAC 

51 CCAAGCCGTA ATGG CGAAAA AAGGCGCGGT ATCCGACGAG GAACTCAAAG 

101 CATTTTTCGA TGCGGGCTAC AACCAGCAGC AGGCAGTCGA AGTCGTGATG 

151 GGCGT . AsyC TgGCAACCCT GTGCAACTAC GTCAACAACC TCGGACAAAC 

201 CGAAATCAAC CCCGAATTGC AGGCTTACGC CTGA 

This corresponds to the amino acid sequence <SEQ ID 1056; ORF 263>: 

m263.pep (partial) 

1 . .GCAGCAGGCG AATTTGACGA TGCCAAACTC GGCGCGCTCG CCGCCTTCAC 

51 CCAAGCCGTA ATGGCGAAAA AAGGCGCGGT ATCCGACGAG GAACTCAAAG 

101 CATTTTTCGA TGCGGGCTAC AACCAGCAGC AGGCAGTCGA AGTCGTGATG 

151 GGCGT, AsyC TgGCAACCCT GTGCAACTAC GTCAACAACC TCGGACAAAC 

201 CGAAATCAAC CCCGAATTGC AGGCTTACGC CTGA 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 263 shows 85.7% identity over a 77 aa overlap with a predicted ORF (ORF 263.ng) 
from N. gonorrhoeae: 

10 20 30 

AAGEFDDAKLGALAAFTQAVMAKKGAVSDE 

1 1 1 = 1 1 1 1 1 II II I II I M 1 1 1! II ! I h 

QCSFCVAGHTKl^TLKKLLSEQSLNAARALAAGKSDDAKLGALAAFTQAVMAKKGAVSDD 
80 90 100 110 120 130 

40 50 60 70 

ELKAFFDAGYNQQQAVEVVMGVXLATLCNYVNNLGQTEINPELQAYAX 

I hi I -1 1 1 hi Hill II II MIIMhllhl I II Ihlll II I 

ELNAFLEAGYNRQQAVEWMGVALATLCNYANNLAQTEINPKLQAYAX 
140 150 160 170 180 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1057>: 

a263.seq 

1 ATGGCACGTT TAACCGTACA CACCCTCGAA ACCGCCCCCG AAGCCGCCAA 

51 AGCGCGCGTC GAGGCGGTAC TTCAAAACAA CGGCTTTATC CCCAACCTTA 

101 TCGGCGTATT ATCAAACGCC CCCGAAGCCT TGGCGTTTTA CCAAGAAGTC 

151 GGCAAGCTCA ACGCCGCCAA CAGCCTGACC GCCGGCGAAG TCGAAGTAAT 

201 CCAGATTATT GCCGCCCGCA CCAACCAATG CGGCTTCTGC GTGGCAGGGC 

251 ACACCAAACT CGCAACCCTG AAAAAACTCC TTTCCGAACA ATCCGTCAAA 

301 GCCGCGCGCG CTTTGGCGGC AGGCGAATTT GACGATGCTA AACTCGGCGC 

351 GCTCGCCGCC TTTACCCAAG CCGTAATGGC AAAAAAAGGC GCGGTATCCG 

4 01 ACGAGGAACT CAAAGCATTT TTTGATGCGG GCTACAACCA GCAGCAGGCA 

451 GTCGAAGTCG TGATGGGCGT AGCCTTGGCA ACTTTGTGCA ACTACGTCAA 

501 CAACCTCGGA CAAACCGAAA TCAACCCCGA ATTGCAGGCT TACGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 1058; ORF 263.a>: 

a2 63.pep 

1 MARLTVHTLE TAPEAAKARV EAVLQNNGFI PNLIGVLSNA PEALAFYQEV 

51 GKLNAANSLT AGEVEVIQII AARTNQCGFC VAGHTKLATL KKLLSEQSVK 

101 AARALAAGEF DDAKLGALAA FTQAVMAKKG AVSDEELKAF FDAGYNQQQA 

151 VEVVMGVALA TLCNYV NNLG QTEINPELQA YA* 



m263/g263 

m263 .pep 
g263 

m263 .pep 
g263 
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m263/a263 97.4% identity in 77 aa overlap 

10 20 30 

m2 63 . pep AAGEFDDAKLGALAAFTQAVMAKKGAVSDE 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! 
a263 QCGFCVAGHTKLATLKKLLSEQSVKAARALAAGEFDDAKLGALAAFTQAVMAKKGAVSDE 
80 90 100 110 120 130 

40 50 60 70 

m2 63 . pep ELKAFFDAGYNQQQAVEWMGXXLATLCNYVNNLGQTEINPELQAYAX 
I I! I I I I i I I I I I I J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | 
a263 ELKAFFDAGYNQQQAVEVVMGVALATLCNYVNNLGQTEINPELQAYAX 
140 150 160 170 180 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1059>: 

g264.seq 

1 ttgactttaa cccgaaaaac ccttttcctc ctcaccgccg cgttcggcac 

51 acactccctt cagacggcat ccgccgacgc agtggtcaag ccggaaaaac 

101 tgcacgcctc cgccaaccgc agctacaaag tcgccgaatt cacgcaaacc 

151 ggcaacgcct cgtggtacgg cggcaggttt cacgggcgca aaacttccgg 

201 cggagaccgc tacgatatga acgcctttac cgccgcccac aaaaccctgc 

251 ccatccccag ccatgtgcgc gtaaccaaca ccaaaaacgg caaaagcgtc 

3 01 atcgtccgcg tcaacgaccg cggccccttc cacggcaacc gcatcatcga 

351 cgtatccaaa gccgccgcgc aaaaattggg ctttgtcagc caagggacgg 

401 cacacgtcaa aatcgaacaa atcgtcccgg gccaatccgc accggttgcc 

451 gaaaacaaag acatctttat cgacttgaaa tctttcggta cggaacacga 

501 agcacaagcc tatctgaacc aagccgccca aaatttcgcc gcttcgtcat 

551 caagcccgaa cctctcggtt gaaaaacgcc gttacgaata cgttgtcaaa 

601 atgggcccgt ttgcctcgca ggaacgcgcc gccgaagccg aagcgcaggc 

651 acgcggtatg gttcgggcgg tactgacctc cggttga 

This corresponds to the amino acid sequence <SEQ ID 1060; ORF 264.ng>: 

g264 .pep 

1 LTLTRKTLFL LTAAFGTHSL QTAS ADAWK PEKLHASANR SYKVAEFTQT 

51 GNASWYGGRF HGRKTSGGDR YDMNAFTAAH KTLPIPSHVR VTNTKNGKSV 

101 IVRVNDRGPF HGNRIIDVSK AAAQKLGFVS QGTAHVKIEQ IVPGQSAPVA 

151 ENKDIFIDLK SFGTEHEAQA YLNQAAQNFA ASSSSPNLSV EKRRYEYWK 

201 MGPFASQERA AEAEAQARGM VRAVLTSG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1061>: 



m264 . seq 




1 


TTGACTTTAA 


51 


ACACTCCCTT 


101 


TGCACGCCTC 


151 


CCGAAAAACC 


201 


CGGCGGCAGG 


251 


TGAACGCCTT 


301 


CGCGTAACCA 


351 


CCGCGGCCCC 


401 


CGCAAAAATT 


451 


CAAATCGTCC 


501 


TATCGACTTG 


551 


ACCAAGCCGC 


601 


GTTGAAAAAC 


651 


GCAGGAACGC 


701 


CGGTATTGAC 



This corresponds to the amino acid sequence <SEQ ID 1062; ORE 264>: 

m264 .pep 

1 LTLTRKTLFL LTAAFGTHSL QTASA DAWK AEKLHASANR S YKVAGKRYT 

51 PKNQVAEFTQ TGNASWYGGR FHGRKTSGGE RYDMNAFTAA HKTLPIPSYV 

101 RVTNTKNGKS VIVRVNDRGP FHGNRIIDVS KAAAQKLGFV NQGTAHVKIE 

151 QIVPGQSAPV AENKDIFIDL KS FGTEHEAQ AYLNQAAQNF AVSSSGTNLS 

201 VEKRRYEYW KMGPFTSQER AAEAEAQARG MVRAVLTAG* 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 264 shows 91.6% identity over a 239 aa overlap with a predicted ORF (ORF 264.ng) 
from N. gonorrhoeae: 

m264/g264 

10 20 30 40 50 60 

LTLTRKTLFLLTAAFGTHSLQTASADAWKAEKLHASANRSYKVAGKRYTPKNQVAEFTQ 

Ml II I Ml I Ml III li MINIMI Ml MIMIMMMM MM 

LTLTRKTLFLLTAAFGTHSLQTASADAWKPEKLHASANRSYKVA EFTQ 

10 20 30 40 

70 80 90 100 110 120 

TGNASWYGGRFHGRKTSGGERYDMNAFTAAHKTLPIPSYVRVTNTKNGKSVIVRVNDRGP 

Ml II I II 1 1 MIMI M Mill! Ill MMMIM IMMM M M II MMI IMIM 

TGNASWYGGRFHGRKTSGGDRYDMNAFTAAHKTLPIPSHVRVTNTKNGKSVIVRVNDRGP 
50 60 70 80 90 100 



m2 64 .pep 
9264 

m2 64 . pep 
g264 



130 140 150 160 170 180 

m264 . pep FHGNR 1 1 DVS KAAAQKLGFVNQGTAHVKI EQ I VPGQS AP VAENKD I F I DLKS FGTEHEAQ 

I M 1 1 1 1 1 1 1 i 1 1 1 : 1 1 M 1 : 1 1 1 1 1 1 1 1 1 ] I M I II M I M ! f I ' M 1 1 1 1 1 i I i 1 1 ; 

g264 FHGNRIIDVSKAAAQKLGFVSQGTAHVKIEQIVPGQSAPVAENKDIFIDLKSFGTEHEAQ 
110 120 130 140 150 160 

190 200 210 220 230 240 

m264 . pep AYLNQAAQNFAVSSSGTNLSVEKRRYEYWKMGPFTSQERAAEAEAQARGMVRAVLTAGX 

MMIIIIMMMM MIMIMM I III IMMMIMIMI M IMIMI MMM 

g264 AYIiNQAAQNFAASSSSPNLSVEKRRYEYWKMGPFASQERAAEAEAQARGMVRAVLTSGX 
170 180 190 200 210 220 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1063>: 

a264.seq 

1 TTGACTTTAA CCCGAAAAAC CCTTTTCCTC CTCACCGCCG CATTCGGCAT 

51 ACATTCCTTT CAGACGGCAT CCGCCGACGC AGTGGTCAGG GCAGAAAAAC 

101 TGCACGCCTC CGCCAACCGC AGCTACAAAG TCGCCGGAAA ACGCTACACG 

151 CCGAAAAACC AAGTCGCCGA ATTCACGCAA ACCGGCAACG CCTCGTGGTA 

201 CGGCGGCAGG TTTCACGGGC GCAAAACTTC CGGCGGAGAA CGATACGATA 

251 TGAACGCCTT TACCGCCGCC CACAAAACCC TGCCCATCCC CAGCTATGTG 

301 CGCGTAACCA ATACCAAAAA CGGCAAAAGC GTCATCGTCC GCGTCAACGA 

351 CCGCGGCCCC TTCCACGGCA ACCGCATCAT CGACGTATCC AAAGCCGCCG 

4 01 CGCAAAAATT GGGCTTTGTC AACCAAGGGA CGGCGCACGT CAAAATCGAA 

451 CAAATCGTCC CGGGCCAATC CGCACCGGTT GCCGAAAACA AAGACATCTT 

501 CATCGACTTG AAATCTTTCG GTACGGAACA CGAAGCACAA GCCTATCTGA 

551 ACCAAGCCGC CCAAAACCTG GCTTCATCGG CATCAAACCC GAACCTCTCG 

601 GTTGAAAAAC GCCGTTACGA ATACGTCGTC AAAATGGGAC CGTTTGCCTC 

651 GCAGGAACGC GCCGCCGAGG CCGAAGCTCA GGCGCGCGGT ATGGTTCGGG 

701 CGGTATTAAC CGCCGGTTGA 



This corresponds to the amino acid sequence <SEQ ID 1064; ORF 264.a>: 

a264 . pep 

1 LTLTRKTLFL LTAAFGIHSF QTAS ADAVVR AEKLHASANR SYKVAGKRYT 

51 PKNQVAEFTQ TGNASWYGGR FHGRKTSGGE RYDMNAFTAA HKTLPIPSYV 

101 RVTNTKNGKS VIVRVNDRGP FHGNRIIDVS KAAAQKLGFV NQGTAHVKIE 

151 QIVPGQSAPV AENKDIFIDL KS FGTEHEAQ AYLNQAAQNL ASSASNPNLS 

201 VEKRRYEYVV KMGPFASQER AAEAEAQARG MVRAVLTAG* 

m264/a264 96.2% identity in 239 aa overlap 

10 20 30 40 50 60 

m2 64 . pep LTLTRKTLFLLTAAFGTHSLQTASADAVVKAEKLHASANRSYKVAGKRYT PKNQVAEFTQ 

I I I I I I I I I I I I I I I I I i : I I I I I I I I I : I I I II I I | | | | | | | | | | | | | | || | | | | | | j 
a2 64 LTLTRKTLFLLTAAFG I HSFQTASADAWRAEKLHASANRSYKVAGKRYT PKNQVAEFTQ 

10 20 30 4 0 50 60 
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70 80 90 100 110 120 

m2 64 . pep TGNASWYGGRFHGRKTSGGERYDMNAFTAAHKTLPIPSYVRVTNTKNGKSVIVRVNDRGP 
I I I I I I I I I I I I ! I I M I I I I I I I I I I I I I I I I I I I I I I I I t I I II I I I I I I I I I I I I I I 
a 2 64 TGNASWYGGRFHGRKTSGGERYDMNAFTAAHKTLPIPSYVRVTNTKNGKSVIVRVNDRGP 

70 80 90 100 110 120 

130 140 150 160 170 180 

m264 .pep FHGNRIIDVSKAAAQKLGFVNQGTAHVKIEQIVPGQSAPVAENKDIFIDLKSFGTEHEAQ 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I t I I I I I I I I I I 
a2 64 FHGNRIIDVSKAAAQKLGFVNQGTAHVKIEQIVPGQSAPVAENKDIFIDLKSFGTEHEAQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

m2 64 . pep AYLNQAAQNFAVSSSGTNLSVEKRRYEYWKMGPFTSQERAAEAEAQARGMVRAVLTAGX 
111111111:1 I : ! : I I I I I I I I I I II I I I I I I : I I I 1 I I I I I I I I I I I I I I I I I II I 
a264 AYLNQAAQNLASSASNPNLSVEKRRYEYVVKMGPFASQERAAEAEAQARGMVRAVLTAGX 

190 200 210 220 230 240 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1065>: 



m265 . seq 












1 


ATGTCGGTGA 


TTTTGCCGCC 


GACACGCGCC 


AACGCTGCTT 


TTTCGGCTTG 


51 


GGCGCGGCTG 


ATGATTTTGT 


CTTGTTTGTT 


GTGTTGGTGT 


GCGGCGTGTC 


101 


CGTGGTCGTC 


ATCGCCGTGT 


CCGTCGTGGT 


GGGCGAGCGC 


GGGGGCGGAA 


151 


ATGCTCAGCA 


GTGCGGTTGC 


GGCGGAGGTC 


AAGAGAAGGT 


GTTTGATGTT 


201 


CATAT . TTTT 


GCCTTTGTAA 


ATCGTGGGTT 


GGAAAATGTG 


GATATTAATA 


251 


AGGTATCAAA 


TAACCGTCAG 


CCGGCGGTCA 


ATACCGCCCG 


AAC CAT AC CG 


301 


CGCGCCTGAG 


CTTCGGCTTC 


GGCGGCGCGT 


TCCTGCGAGG 


TAAACGGTCC 


351 


CATTTTGACG 


ACGTATTCGT 


AA 







This corresponds to the amino acid sequence <SEQ ID 1066; ORF 265>: 

m265 .pep 

1 MSVILPPTRA NAAFSAWARL MILSCLLCWC AACPWSSSPC PSWWASAGAE 

51 MLS S AVA AEV KRRCLMFIXF AFVNRGLENV DINKVSNNRQ PAVNTARTIP 

101 RAXASASAAR SCEVNGPILT TYS* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 265 shows 88.6% identity over a 123 aa overlap with a predicted ORF (ORF 265.ng) 
from N. gonorrhoeae: 

10 20 30 40 50 60 

MSVILPPTRANAAFSAWARLMILSCLLCWCAACPWSSSPCPSWWASAGAEMLSSAVAAEV 

Mlllllllhlllllllllllllll 1 1 1 Ml 1 1 1 III III 1 1 1 1 1 IN ( Mil! I 

MSVILPPTRAQAAFSAWARLMILSCLPCWCAACPWSSSPCPSWWASAGAEMPNSAVAAAV 
10 20 30 40 50 60 

70 80 90 100 110 120 

KRRCLMF I X FAFVNRGLENVD I NKVSNNRQP AVNTART I PRAXAS AS AARS CE VNGP I LT 

Illlllll Ihlhlhl Mlllllllll MINIMI lllllllllhllllll 

KRRCLMF I - FALVNQGLKNGDINKVSNNRQPEVSTARTIPRACASASAARSCEANGPILT 
70 80 90 100 110 



m265/g265 
m26 5 .pep 
g265 

m265 .pep 
g265 



m265.pep TYSX 
MM 

g265 TYSX 
120 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1067>: 

a265.seq 

1 ATGTCGGTGA TTTTGCCGCC GACACGCGCC AACGCTGCTT TTTCGGCTTG 

51 GGCGCGGCTG ATGATTTTGT CTTGTTTGCT GTGTTGGTGT GCGGCGTGTC 

101 CGTGGTCGTC ATCGCCGTGT CCGTCGTGGT GGGCGAGTGC GGGGGCGGAA 

151 ATGCCCATCA GTGCGGTTGC GGCGGCGGTC AAGAGAAGGC GTTTGAAGTT 

201 CATTTTTGCT CCTGCGAAGT ATCTGGT GGTGT TTGAAGGACG 

251 TAAAGGCGGG ACATCAACCG GCGGTTAATA CCGCCCGAAC CATACCGCGC 

301 GCCTGAGCTT CGGCCTCGGC GGCGCGTTCC TGCGAGGCAA ACGGTCCCAT 

351 TTTGACGACG TATTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 1068; ORF 265. a>: 

a265 .pep 

1 MSVILPPTRA NAAFSAWARL MILSCLLCWC AACPWSSSPC PSWWASAGAE 

51 MPISAVAAAV KRRRLKFIFA PAKYLX . . XC LKDVKAGHQP AVNTARTIPR 

101 A*ASASAARS CEANGPILTT YS* 



m265/a265 79.7% identity in 123 aa overlap 

10 20 30 40 50 60 

m265.pep MSVILPPTRANAAFSAWARLMILSCLLCWCAACPWSSSPCPSWWASAGAEMLSSAVAAEV 
I I | I I I I I I I I I I I i I I ! I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill I 
a2 65 MSVILPPTRANAAFSAWARLMILSCLLCWCAACPWSSSPCPSWWASAGAEMPISAVAAAV 

10 20 30 40 50 60 



70 80 90 100 110 120 

m265.pep KRRCLMFIXFAFVNRGLENVDINKVSNNRQPAVNTARTIPRAXASASAARSCEVNGPILT 
III I It I : :: I ::: I I I I I I I M I I I I I I I I I I I I I I I : I I I I I I 

a 2 65 KRRRLKFI FAPAKYLXXCLKDVKAGHQPAVNTARTIPRAXASASAARSCEANGPILT 

70 80 90 100 110 



m265.pep TYSX 
I I I I 

a265 TYSX 
120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1069>: 

g266 . seq 

1 agttcagacg gcatcgccgc cgacaatgcc caaacagaaa gcccatcatg 

51 accgcatcca tgtacatcct tttggtcttg gcactcatct ttgccaacgc 

101 ccccttcctc acgaccagac tgttcggcgt ggccgcgctc aagcgcaaac 

151 atttcggaca ccacctgatc gagctggcgg caggtttcgc gctgaccgcc 

201 tctcttgcct acatcctcga atcccgtgcg ggagcggtac acaatcaggg 

251 ttgggagttt tacgccaccg tcgtctgcct gtacctcatt ttcgccttcc 

3 01 cgtgtttcgt gcggcggtat ttttggcaca cgcgcaacag ggaataa 

This corresponds to the amino acid sequence <SEQ ID 1070; ORF 266.ng>: 

g266.pep 

1 MQFRRHRRRQ CPNRKPIMTA SMYILLVLA L IFANAPFLTT RLFGVAALKR 

51 KHFGHHL IEL AAGFALTASL AYIL ESRAGA VHNQGWEFY A TWCLYLIFA 

101 FPCFV RRYFW HTRNRE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1071>: 

m266 . seq 

1 ATGCCGTTCC GCAACGCGtT cAGACGGCAT CGCCGCCGAC AACGCCTAAA 

51 CAGAAAGCCC ACCATGACCG CATCCATGTA CATCCTTTTG GTCTTGGCAC 

101 TCATCTTTGC CAACGCCCCC TTCCTCACGA CCAGACTGTT CGGCGTGGCC 

151 rCACTCAAGC GCAAACATTT CGGACACCAC ATGATCGAGC TGGCGGCAGG 

201 TTTCGCGCTG ACCGCCGTTC TTGCCTACAT CCTsGAATCC CGTGCAGGAT 

251 CGGTACACGA TCAGGGTTGG GAGTTTTATG CCACAGTCGT CTGCCTGTAC 

3 01 CTGATTTTTG CGTTTCCATG TTTTGTGTGG CGGTATTTTT GGCACACGCG 

351 CAACAGGGAA TAG 
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This corresponds to the amino acid sequence <SEQ ID 1072; ORF 266>: 

m26 6.pep 

1 MPFRNAFRRH RRRQRLNRKP TMTASMYILL VLA LIFANAP FLTTRLFGVA 

51 XLKRKHFGHH MIELAAGFAL TAVLAYILES RAGSVHDQGW EFYATVVCLY 

101 LIFAFPCFW RYFWHTRNRE * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 266 shows 92. 1% identity over a 1 14 aa overlap with a predicted ORF (ORF 266.ng) 
from N. gonorrhoeae: 

m266/g266 

10 20 30 40 50 60 

m266 . pep MPFRNAFRRHRRRQRLNRKPTMTASMYI LLVLALI FANAPFLTTRLFGVAXLKRKHFGHH 

IMIIIII IIIJ llllll III I I III MM I II II II I III llllll III 
g266 MQFRRHRRRQCPNRKPIMTASMYILLVLALIFANAPFLTTRLFGVAALKRKHFGHH 

10 20 30 40 50 

70 80 90 100 110 120 

m26 6 .pep MIELAAGFALTAVLAYILESRAGSVHDQGWEFYATWCLYLIFAFPCFVWRYFWHTRNREX 

Mill III MM III IMMMMIMM MM llllll II Mill II Mill Ml Ml 

g266 LIE LAAGFALTAS LAY I LE S RAGAVHNQGWE F Y ATWCL YL I F AF P C FVRRY FWHTRNRE X 

60 70 80 90 100 110 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1073>: 



a266 . seq 




1 


ATGCCGTTCC 


51 


CAGAAAGCCC 


101 


TGATTTTTGC 


151 


CCGCTCAAGC 


201 


TTTCGCGCTG 


251 


CGGTACACGA 


301 


CTGATTTTTG 


351 


CAACAGGGAA 



This corresponds to the amino acid sequence <SEQ ID 1074; ORF 266.a>: 

a266.pep 

1 MPFRNAFRRH RRRQCPNRKP AMTASMYILL LLALIFA NAP FLTTKLFGIV 

51 PLKRKHFGHH LIELAAGFAL TAVLAYI LES RAGAVHDQGW EFY ATVVCLY 

101 LIFAFPCFVW RYFWHTRNRE * 

m266/a266 9 1 .7% identity in 1 20 aa overlap 

10 20 30 40 50 60 

m266 . pep MPFRNAFRRHRRRQRLNRKPTMTASMYI LLVLALI FANAPFLTTRLFGVAXLKRKHFGHH 

I I I I I I I I I I I I I I I I I I : I I I I I I I I I : I I I ! I I ! I I I I I I : I I I : : I I I I I I I I I 
a266 MPFRNAFRRHRRRQCPNRKPAMTASMYILLLLALIFANAPFLTTKLFGIVPLKRKHFGHH 

10 20 30 40 50 60 

70 80 90 100 110 120 

m266 . pep MIELAAGFALTAVLAYILESRAGSVHDQGWEFYATVVCLYLIFAFPCFVWRYFWHTRNRE 
: I I I I 1 I I M I I I 1 I I I I I I I I I : I t I I I I I I I I i I t I ! I I I I I i t I I I i ! I I I I I I I i I 
a2 66 LIELAAGFALTAVLAYILESRAGAVHDQGWEFYATVVCLYLIFAFPCFVWRYFWHTRNRE 

70 80. 90 100 110 120 

m266.pep X 

I 

a266 X 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1075>: 

g267 . seq 
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1 atgcaagtcg ccttttttct cgccgtggta ttcaaaaata tgggtttcca 

51 caatcgcatc ggtcgggcag gcctcttcgc agaaaccgca gaagatgcac 

101 ttggtcaggt cgatgtcgta acgcttggtg cggcgggtgc cgtcttcgcg 

151 ttcttccgat tcgatgttga tcgccattgc cggacacacc gcctcgcaca 

201 atttacacgc gatgcagcgt tcctctccgt tcggaaaacg gcgttgcgcg 

251 tgcagaccgc ggaaacgcac ggattgcggc gttttctctt cgggaaaata 

301 aattgtgtct ttgcgggcaa aaaagttttt gagcgttacg cccatgcctt 

3 51 tgaccagttc gccaagcaga aaggttttta ctaa 

This corresponds to the amino acid sequence <SEQ ID 1076; ORF 267.ng>: 

g26 7.pep 

1 MQVAFFLAW FKNMGFHNRI GRAGLFAETA EDALGQVDW TLGAAGAV FA 

51 FFRFDVDRHC RTHRLAQFTR DAAFLSVRKT ALRVQTAETH GLRRFLFGKI 

101 NCVFAGKKVF ERYAHAFDQF AKQKGFY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1077>: 

m267.seq 

1 GTGCAAGTCG CCTTTTTTCT CGCCGTGGTA TTCAAAAATA TGGGTTTCCA 

51 CAATCGCATC AGTCGGGCAT GCCTCTTCGC AGAAACCGCA GAAGATGCAC 

101 TTGGTCAGGT CGATGTCGTA ACGCTTGGTG CGGCGCGTAC CGTCTTCACG 

151 TTCTTCCGAT TCGATGTTAA TCGCCATTGC CGGACACACT GCCTCACACA 

201 ACTTACACGC GATACACCGC TCTTCGCCGT TCGGATACCG CcGCTGCGCG 

251 TGCAGACCGC GGAAACGCAC GGATTGCGGC GTTTTCTCTT CGGGGAAATA 

3 01 AATTGTGTCT TTGCGGGCGA AAAAGTTTTT GAGCGTTACG CCCATACCTT 

351 TTACCAATTC GCCAAGCAGA AAGGTTTTTA CTAA 

This corresponds to the amino acid sequence <SEQ ID 1078; ORF 267>: 

m267 .pep 

1 VQVAFFLAW FKNMGFHNRI SRACLFAETA EDALGQVDW TLGAARTVFT 

51 FFRFDVNRHC RTHCLTQLTR DTPLFAVRIP PLRVQTAETH GLRRFLFGEI 

101 NCVFAGEKVF ERYAHTFYQF AKQKGFY* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 267 shows 82.7% identity over a 127 aa overlap with a predicted ORF (ORF 267.ng) 
from N. gonorrhoeae: 

m267/g267 



10 20 30 40 50 60 

m267 . pep VQVAFFLAWFKNMGFHNRISRACLFAETAEDALGQVDWTLGAARTVFTFFRFDVNRHC 
5 I I I I I I I I I I I I I I I I I I h I I I I I I I I I I I I I II II I II I | I : I I : | | | | | | : | | | 
9 2 6 7 MQVAFFLAWFKNMGFHNRIGRAGLFAETAEDALGQVDWTLGAAGAVFAFFRFDVDRHC 

10 20 30 40 50 60 

70 80 90 100 110 120 

m26 7.pep RTHCLTQLTRDTPLFAVRIPPLRVQTAETHGLRRFLFGEINCVFAGEKVFERYAHTFYQF 
IN hhllh : = = || r I I I J 1 I I I I I f I I I f I = | i | | 1 I | = | | ! | | | I ! = I || 
9267 , RTHRLAQFTRDAAFLSVRKTALRVQTAETHGLRRFLFGKINCVFAGKKVFERYAHAFDQF 

70 80 90 100 110 120 



m267.pep AKQKGFYX 

Ililllll 
g267 AKQKGFYX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1079>: 

a267 . seq 

1 GTGCAAGTCG CCTTTTTTCT CGCCGTGGTA TTCAAAAATA TGGGTTTCCA 

51 CAATCGCATC GGTCGGGCAG GCTTCTTCGC AGAAACCGCA GAAGATGCAC 

101 TTGGTCAGGT CGATGTCGTA ACGCTTGGTG CGGCGCGTGC CGTCTTCGCG 

151 TTCTTCCGAT TCGATGTTGA TCGCCATTGC GGGGCAAACG GCTTCACACA 

201 ATTTACACGC GATGCAGCGT TCCTCGCCGT TTGGATAACG GCGTTGCGCG 

251 TGCAGACCGC GGAAACGCAC GGATTGCGGC GTTTTCTCTT CGGGAAAATA 

301 AATCGTGTCT TTGCGGGCAA AAAAGTTTTT GAGCGTTACG CCCATACCTT 

351 TTACCAATTC GCCAAGCAGA AAGGTTTTTA CTAA 
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This corresponds to the amino acid sequence <SEQ ID 1080; ORF 267. a>: 

a2 67 . pep 

1 VQVAFFLAW FKNMGFHNRI GRAGFFAETA EDALGQVDVV TLGAARAVFA 

51 FFRFDVDRHC GANGFTQFTR DAAFLAVWIT ALRVQTAETH GLRRFLFGKI 

101 NRVFAGKKVF ERYAHTFYQF AKQKG FY * 

m267/a267 82.7% identity in 127 aa overlap 

10 20 30 40 50 60 

m2 67 . pep VQVAFFLAVVFKNMGFHNRISRACLFAETAEDALGQVDVVTLGAARTVFTFFRFDVNRHC 
I I I I I I I I I I I I I I I I I I I I : I I : I S I ! I I I I i I I I I I I I I I I I I : | | : | | | 1 | | : | | | 
a2 67 VQVAFFL AW FKNMGFHNR I GRAGFFAETAEDALGQVDVVTLGAARAV FA FFRFDVDRHC 

10 20 30 40 50 60 

70 80 90 100 110 120 

m267.pep RTHCLTQLTRDTPLFAVRIPPLRVQTAETHGLRRFLFGEINCVFAGEKVFERYAHTFYQF 
: I I : I I I : : : I I I 1 1 I I I I I I I I I I I I I I I : I I I I I I : I I I I I 1 M M I I I 
a2 67 GANGFTQFTRDAAFLAVWITALRVQTAETHGLRRFLFGKINRVFAGKKVFERYAHTFYQF 

70 80 90 100 110 120 



m267.pep AKQKGFYX 
I I I I I I I I 
a 2 67 AKQKGFYX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1081>: 

G268.seq 

1 atgaaaaaaa atttacccgc actggcattg gcaagtatgc tgattttgtc 

51 gggctgcgac cgtttgggaa taggcaaccc gttttcc'gga aaggaaattt 

101 cctgcggaag cgaagagact aaagagattt tggtcaaact ggtccgcgac 

151 aatgtcgaag gtgaaaccgt caaaactttt gacgacgacg cattcaaaga 

201 ccaagcattt gccgatatcg gcatatcgca tatccgcaga atggtcgaac 

251 gtttgggcat aaccgtcgat gaagtccgaa ctaccgagaa aaccgacacg 

301 tccagcaaac tcaaatgtga agccgcgtta aaactggacg tgcccgacga 

351 tgttgtcgat tatgccgtcg ccgccaacca atctataggc aacagccata 

401 agaaaacgcc cgactttttt gaaccctact accgcaaaga aggcgcgtat 

451 tatgtcaaaa ctatttctta cagcgtccag ccgacagacg acaaaagcaa 

501 aatctttgcc gaactcagtc aggcacacga tatcatccat ccgctcagcg 

551 agctggtgtc tatggcactg attaaagagc cgttggacaa agcgaaacaa 

601 aggaacgaaa aacttgaagc ggcagaagcc accgcgcagg aagcgaggga 

651 ggcagaagaa gcggcggcgc aggaggcatt gggtcgggag caggaagccg 

701 cccgcgtatc cgaatgggaa gaacgctaca agctgtcgcg cagcgagttc 

751 gagcagtttt ggaaaggatt gcctcaaact gtacagaata agctgcaagc 

801 ctcgcagaaa acatggaaaa gcggtatgga caagatctgt gccaacaatg 

851 cgaaagccga aggtgaaacg ccaaacggca taaaagtcag tgagttggcg 

901 tgtaaaacgg cagaaaccga agcacgcttg gaagagctgc acaaccgtaa 

951 aaaagccctt atcgacgaaa tggtcaggga agaggacaag aaagaactgc 

1001 caaagcggct ctga 

This corresponds to the amino acid sequence <SEQ ID 1082; ORF 268.ng>: 

m268.pep 

1 MKKNLPALAL ASMLILSG CD RLGIGNPFSG KEISCGSEET KEILVKLVRD 

51 NVEGETVKTF DDDAFKDQAF ADIGISHIRR MVERLGITVD EVRTTEKTDT 

101 SSKLKCEAAL KLDVPDDWD YAVAANQSIG NSHKKTPDFF EPYYRKEGAY 

151 YVKTISYSVQ PTDDKSKIFA ELSQAHDIIH PLSELVSMAL IKEPLDKAKQ 

201 RNEKLEAAEA TAQEAREAEE AAAQEALGRE QEAARVSEWE ERYKLSRSEF 

251 EQFWKGLPQT VQNKLQASQK TWKSGMDKIC ANNAKAEGET PNGIKVSELA 

3 01 CKTAETEARL EELHNRKKAL IDEMVREEDK KELPKRL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1083>: 

m2 68.seq (partial) 

1 . . ATGGCACTGA TTAAAGAGCC GTTGGACAAA GTGAAACAAA GGAACGAAGA 
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51 ACTTGAAGCG GCAGAAGAAG CGGCGGCGCA GGAGGCATTG GGTCGGGAGC 

101 AGGAAGCCGC CCGCGTATCC GAATGGGAAG AACGCTACAA GCTGTCGCGC 

151 AG . CAGTTCG AGCAGTTCTG GAAAGGATTG CCTCAAACCG TACAGAATAA 

201 GCTGCAACCn TCACAGAAAA CATGGAAAAG CGGGATGGAT AAAATCTGTG 

251 CCAACAATGC GAAAGCTGAA GGTAAAACGC CAAACGGCAT AAAATTCAGC 

3 01 GAACTGGCAT GCAAAACGGC GAAAACCGAA GCACGCTTGG AAGAGCTGCA 
351 CAACCGTAAA AAAGCCCTTA TCGACGAAAT GGyCAGGGAA GCGGACAmGA 
401 AAGAACTGTC AAAGCGGCTs TGA 

This corresponds to the amino acid sequence <SEQ ID 1084; ORF 268>: 

m268 .pep (partial) 

1 . . MALIKEPLDK VKQRNEELEA AEEAAAQEAL GREQEAARVS EWEERYKLSR 

51 XQFEQFWKGL PQTVQNKLQP SQKTWKSGMD KICANNAKAE GKTPNGIKFS 

101 ELACKTAKTE ARLEELHNRK KALIDEMXRE ADXKELSKRL * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 268 shows 86.0% identity over a 150 aa overlap with a predicted ORF (ORF 268.ng) 
from N. gonorrhoeae: 

m268/g268 

10 20 
MALI KEPLDKVKQRNEELEAAE 

I I I I I I I I I I : I I I I h I I I I I 
SVQPTDDKSKI FAELSQAHDI IHPLSELVSMALI KEPLDKAKQRNEKLEAAEATAQEARE 
160 170 180 190 200 210 

30 40 50 60 70 80 

- - EAAAOEALGREQEAARVSEWEERYKLSRSQFEQFWKGLPQTVQNKLQPSQKTWKSGMD 

Mil 1 1 1 1 ii ill iiiiMiiiii i ii i him I ::i ii in ill i iiiiiiiiii 

AEEAAAQEALGREQEAARVSEWEERYKLSRSEFEQFWKGLPQTVQNKLQASQKTWKSGMD 
220 230 240 250 260 270 

90 100 110 120 130 140 

KICANNAKAEGKTPNG I KFS ELACKTAKTE ARLEELHNRKKAL I DEMXREADXKELSKRLX 

I !: 1 1 1 1 1 II hi 1 1 i II II 1 1 III I H 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 II I III M 1 1 

KICANNAKAEGETPNGIKVSELACKTAETEARLEELHNR 
280 290 300 310 320 330 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1085>: 

a2 68 . seq 

1 ATGGCACTGA TTAAAGAGCC GTTGGACAAA GCGAAACAAA GGAACGAAGA 

51 ACTTGAAGCG GCAGAAGAAG CGGCGGCGCA GGAGGCATTG GGTCGGGAGC 

101 AGGAAGTCGA CCGCGTATCC GAATGGGAAG AACGCTACAA GCTGTCGCGC 

151 AGCGAGTTCG AGCAGTTCTG GAAAGGATTG CCTCAAACCG TACAGAATAA 

201 GCTGCAAGCC TCACAGAAAA CATGGAAAAG CGGGATGGAT AAAATCTGTG 

251 CCAACAATGC GAAAGCTGAA GGTGAAACGC CAAACGGCAT AAAATTCAGC 

301 GAACTGGCAT GCAAAACGGC GGAAACCGAA GCACGCTTGG AAGAGCTGCA 

351 CAACCGTAAA AAAGCCCTTC TCGACGAAAT GGCCAGGGAA GCGGACAAGA 

4 01 AAGAACTGCC AAAGCGGCTC TGA 

This corresponds to the amino acid sequence <SEQ ID 1086; ORF 268. a>: 

a268 .pep 

1 MALIKEPLDK AKQRNEELEA AEEAAAQEAL GREQEVDRVS EWEERYKLSR 

51 SEFEQFWKGL PQTVQNKLQA SQKTWKSGMD KICANNAKAE GETPNGIKFS 

101 ELACKTAETE ARLEELHNRK KALLDEMARE ADKKELPKRL * 

m268/a268 9 1 .4% identity in 1 40 aa overlap 

10 20 30 40 50 60 

m2 68 .pep MALIKEPLDKVKQRNEELEAAEEAAAQEALGREQEAARVSEWEERYKLSRXQFEQFWKGL 
I 1 I I I I I I I I : II I I I I I I I I I I I I I I I I I I I I I I : i I I I I I I II I I I I : I I I I I I I I 
a2 68 MALIKEPLDKAKQRNEELEAAEEAAAQEALGREQEVDRVSEWEERYKLSRSEFEQFWKGL 



m268 .pep 
g268 

m268 .pep 
g268 

m2 68 .pep 
g268 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m2 68 .pep PQTVQNKLQPSQKTWKSGMDKICANNAKAEGKTPNGIKFSELACKTAKTEARLEELHNRK 
I I I I I I I I I I I I I I I I II I I I I I I I M I I I : I I I I I I I I I I I I I I i : II I I I I I I I I I I 
a2 68 PQTVQNKLQASQKTWKSGMDKICANNAKAEGETPNGIKFSELACKTAETEARLEELHNRK 
70 80 90 100 110 120 

130 140 
KALIDEMXREADXKELSKRLX 
! I I : I I I MM Ml MM 
KALLDEMAREADKKELPKRLX 
130 140 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1087>: 

m268-l . seq 

1 GTGCAATCCC GATATGATGG TTTGCATAAA TTTAAACATA TATGTTCCGC 

51 AGCTATGGCA CTGATTAAAG AGCCGTTGGA CAAAGTGAAA CAAAGGAACG 

101 AAGAACTTGA AGCGGCAGAA GAAGCGGCGG CGCAGGAGGC ATTGGGTCGG 

151 GAGCAGGAAG CCGCCCGCGT ATCCGAATGG GAAGAACGCT ACAAGCTGTC 

201 GCGCAGCGAG TTCGAGCAGT TCTGGAAAGG ATTGCCTCAA ACCGTACAGA 

251 ATAAGCTGCA AGCCTCACAG AAAACATGGA AAAGCGGGAT GGATAAAATC 

301 TGTGCCAACA ATGCGAAAGC TGAAGGTAAA ACGCCAAACG GCATAAAATT 

351 CAGCGAACTG GCATGCAAAA CGGCGAAAAC CGAAGCACGC TTGGAAGAGC 

401 TGCACAACCG TAAAAAAGCC CTTATCGACG AAATGGCCAG GGAAGCGGAC 

4 51 AAGAAAGAAC TGTCAAAGCG GCTCTGA 

This corresponds to the amino acid sequence <SEQ ID 1088; ORF 268-l>: 

m268-l .pep 

1 VQSRYDGLHK FKHICSAAMA LIKEPLDKVK QRNEELEAAE EAAAQEALGR 
51 EQEAARVSEW EERYKLSRSE FEQFWKGLPQ TVQNKLQASQ KTWKSGMDKI 
101 CANNAKAEGK TPNGIKFSEL ACKTAKTEAR LEELHNRKKA LIDEMAREAD 
151 KKELSKRL* 



m268-l/g268 82.3% identity in 164 aa overlap 

10 20 30 

m268-l.pep VQSRYDGLHKFKHICSAAMALIKEPLDKVKQRNE 

: I : I : : : : i I I I II II M I : I I I I I 
g2 68 KEGAYYVKTISYSVQPTDDKSKIFAELSQAHDIIHPLSELVS — MALIKEPLDKAKQRNE 

150 160 170 180 190 200 

40 50 60 70 80 

m2 68-1. pep ELEAAE EAAAQEALGREQEAARVSEWEERYKLSRSEFEQFWKGLPQTVQN 

: \ II I I M II II I II I M II II M II I M M M I M M 11 I II I I II I I I 

g2 68 KLEAAEATAQEAREAEEAAAQEALGREQEAARVSEWEERYKLSRSEFEQFWKGLPQTVQN 
210 220 230 240 250 260 

90 100 110 120 130 140 

m2 68-1. pep KLQASQKTWKSGMDKICANNAKAEGKTPNGIKFSELACKTAKTEARLEELHNRKKALIDE 
I I I I II I I M II I I I I M II M II I M I I I II I I I I M 1 I M I I M M I I I I I I I M II 
g2 68 KLQASQKTWKSGMDKICANNAKAEGETPNGIKVSELACKTAETEARLEELHNRKKALIDE 

270 280 290 300 310 320 

150 159 
m2 68 - 1 . pep MAREADKKELSKRLX 
I : ! I Mill MM 
g2 68 MVREEDKKELPKRLX 
330 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 1089>: 

a268-l.seq 

1 GTGCAATCCC GATATGATGG TTTGCATAAA TTTAAACATA TATGTTCCGC 

51 AGCTATGGCA CTGATTAAAG AGCCGTTGGA CAAAGCGAAA CAAAGGAACG 

101 AAGAACTTGA AGCGGCAGAA GAAGCGGCGG CGCAGGAGGC ATTGGGTCGG 

151 GAGCAGGAAG TCGACCGCGT ATCCGAATGG GAAGAACGCT ACAAGCTGTC 

201 GCGCAGCGAG TTCGAGCAGT TCTGGAAAGG ATTGCCTCAA ACCGTACAGA 

251 ATAAGCTGCA AGCCTCACAG AAAACATGGA AAAGCGGGAT GGATAAAATC 

301 TGTGCCAACA ATGCGAAAGC TGAAGGTGAA ACGCCAAACG GCATAAAATT 

351 CAGCGAACTG GCATGCAAAA CGGCGGAAAC CGAAGCACGC TTGGAAGAGC 



m2 68 .pep 
a268 
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401 TGCACAACCG TAAAAAAGCC CTTCTCGACG AAATGGCCAG GGAAGCGGAC 
4 51 AAGAAAGAAC TGCCAAAGCG GCTCTGA 

This corresponds to the amino acid sequence <SEQ ID 1090; ORF 268-l.a>: 

a268~l .pep 

1 VQSRYDGLHK FKHICSAAMA LIKEPLDKAK QRNEELEAAE EAAAQEALGR 
51 EQEVDRVSEW EERYKLSRSE FEQFWKGLPQ TVQNKLQASQ KTWKSGMDKI 
101 CANNAKAEGE TPNGIKFSEL ACKTAETEAR LEELHNRKKA LLDEMAREAD 
151 KKELPKRL* 

a268-l/m268-l 95.6% identity in 158 aa overlap 

10 20 30 40 50 60 

a268-l pep VQSRYDGLHKFKHICSAAMALIKEPLDKAKQRNEELEAAEEAAAQEALGREQEVDRVSEW 
I | | | I I I I I II I I I I I I I M I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I : Mill 
m268-l VQSRYDGLHKFKHICSAAMALIKEPLDKVKQRNEELEAAEEAAAQEALGREQEAARVSEW 

10 20 30 40 50 60 

70 80 90 100 110 120 

a2 68-1. pep EERYKLSRSEFEQFWKGLPQTVQNKLQASQKTWKSGMDKICANNAKAEGETPNGIKFSEL 
I | || I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I 
m268-l EERYKLSRSEFEQFWKGLPQTVQNKLQASQKTWKSGMDKICANNAKAEGKTPNGIKFSEL 

70 80 90 100 110 120 



130 140 150 159 

a2 68-1. pep ACKTAETEARLEELHNRKKALLDEMAREADKKELPKRLX 

I I I I I : I I I I I I I I M I I M I : I I I I I i ! I Ill 

m2 68-l ACKTAKTEARLEELHNRKKALIDEMAREADKKELSKRLX 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1091>: 

g269.seq 

1 atggtttggc gtgtgaattg cgcggcaacg gcggcgctga ttttttcgtc 

51 cagcccttgg atttgggcgg tggtgtgggt gtggtcgcgg tcggcttttt 

101 cctgcaaacc ttgcgccagc cttgacgcgt ccagtgcgcc ggcgttggcg 

151 gtttcgccgt gggactttat ccggaacacg gcttcgccca aggtgtcggc 

201 ggctttgatg cacagtttta aaaccagggc tttggggcgg ttttctgcgc 

251 cgcccgttgc cattttgctg tccaatcgcg gggttaaaaa accgttgtcg 

301 tttaagtcgc cgtccgtcca agtcgatacg agcgcgcttc tttgcctttc 

351 attgcggtct tcgtaa 

This corresponds to the amino acid sequence <SEQ ID 1092; ORF 269.ng>: 

g269 .pep 

1 MWJRVNCAAT AALIFSSSPW IWAWWVWSR SAFS CKPCAS LD AS SAP ALA 

51 VSPWDFIRNT ASPKVSAALM HSFKTRALGR FSAPPVAILL SNRGVKKPLS 

101 FKSPSVQVDT SALLCLSLRS S* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1093>: 



m269 . seq 












1 


ATGGTTTGGC 


GTGTGAATTG 


CGCGGCAACG 


GCGGTGCTGA 


TTTTTTCGTC 


51 


CAGCCCTTGG 


ATTTGGGCGG 


CGGTGTGGGT 


GTGGTCTCGG 


TCGGCTTTGT 


101 


CTTGCAAACC 


TTGCGCCaCG 


TGCCCGCGTC 


CAGCGCCTGC 


GTTGATGGTT 


151 


TCGCCGTGGG 


ACTTTATCCA 


AAACACGGCT 


TCGCCCAAGG 


TGTCGGCGGC 


201 


TTTGATGCAC 


AGTTTTAAAA 


CCAGGGCTTT 


GGGGCGGTTT 


TCGTCGCCGC 


251 


CTGTCGCCAT 


TTTGCTGTCC 


GAGCGCGGGG 


TTAAAAAGCC 


GTTGTCGTTT 


301 ' 


AAATTTTCGT 


CCGTCCAAGT 


CGATACGAGC 


GCGCTTCTCT 


GCCTTTCGTT 


351 


GCGGTCTTCG 


TAA 









This corresponds to the amino acid sequence <SEQ ID 1094; ORF 269>: 

m269 .pep 

1 MWJRVNCAAT AVLIFSSSPW IWAAVWVWSR SALS CKPCAT CPRPAPALMV 

51 SPWDFIQNTA SPKVSAALMH SFKTRALGRF SSPPVAILLS ERGVKKPLSF 

101 KFSSVQVDTS ALLCLSLRSS * 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 269 shows 87.6% identity over a 121 aa overlap with a predicted ORF (ORF 269.ng) 
from N. gonorrhoeae: 



m269 - pep 
g269 

m269 .pep 
g269 

m2 69 .pep 
g269 



MVWRVNCAATAVLIFSSSPWIWAAVWVWSRSALSCKPCATCP-RPAPALMVSPWDFIQNT 

1 1 1 1 1 Mill hi I II III II I hllll lllhl 1 1 1 1 h 1 1 1 1 IMIMhM 

MVWRVNCAATAALIFSSSPWIWAWWVWSRSAFSCKPCASLDASSAPALAVSPWDFIRNT 



59 



60 



119 



ASPKVSAALMHSFKTRALGRFSSPPVAILLSERGVKKPLSFKFSSVQVDTSALLCLSLRS 

1 1 1 ii ii 1 1 1 1 1 1 1 1 1 in ii hi nun hi 1 1 iii i in minium iii iii 

ASPKVSAALMHSFKTRALGRFSAPPVAILLSNRGVKKPLSFKSPSVQVDTSALLCLSLRS 120 

SX 121 
I I 

SX 122 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1095>: 

a269. seq 

1 ATGGTTTGGC GTGTGAATTG CGCGGCAACG GCGGTGCTGA TTTTTTCGTC 

51 CAGCCCTTGG ATTTGGGCGG CGGTGTGGGT GTGGGCGCGG TCTGCTTTGT 

101 CTTGGAGGTT TTGCGCCAGC GTGCCCGCGT CCAGCGCGCC GGCGTTGACG 

151 GTTTCGCCGT GGGACTTTAT CCAGAACACG GCTTCGCCCA AGGTGTCGGC 

201 GGCTTTGATG CACAGTTTTA AAACCAGGGC TTTGGGGCGG TTTTCGTCGC 

251 CGCCTGTCGC CATTTTGCTG TCCGGGCGCG GGGTTAAAAA GCCGTTGTCG 

301 TTTAAATTTT CGTCCGTCCA AGTCGATACG AGCGCGCTTC TCTGCCTTTC 

351 GTTGTGGTCT TCGTAA 

This corresponds to the amino acid sequence <SEQ ID 1096; ORF 269.a>: 

a269.pep 

1 MVWRVNCAAT AVLIFSSSPW IWAAVWVWAR SALS WRFCAS VPASSAPALT 

51 VSPWDFIQNT ASPKVSAALM HSFKTRALGR FSSPPVAILL SGRGVKKPLS 

101 FKFSSVQVDT SALLCLSLWS S* 



m269/a269 90. 1 % identity in 1 2 1 aa overlap 

10 20 30 



40 



50 



59 



m2 69.pep MVWRVNCAATAVLIFSSSPWIWAAVWVWSRSALSCKPCATCP-RPAPALMVSPWDFIQNT 

I I I | I I I I I I II II I I I I I ! I I I I I I I I : I II I I : I I : I I I I I I I I I I I I I I I 
a2 69 MVWRVNCAATAVLIFSSSPWIWAAVWVWARSALSWRFCASVPASSAPALTVSPWDFIQNT 
10 . 20 30 40 50 60 

60 70 80 90 100 110 119 

m269.pep ASPKVSAALMHSFKTRALGRFSSPPVAILLSERGVKKPLSFKFSSVQVDTSALLCLSLRS 
| | | | j | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I ! 
a2 69 ASPKVSAALMHSFKTRALGRFSSPPVAILLSGRGVKKPLSFKFSSVQVDTSALLCLSLWS 
70 80 90 100 110 120 



m2 69 .pep 
a269 



120 
SX 
! ! 
sx 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1097>: 



g270 . seq 








1 


atgaataaaa 


accgcaaatt 


actgcttgcc 


51 


tgccgccttc 


aagctcgttt 


tgttgcaatg 


101 


aagccgtggc 


ggcgcaatgc 


gatttgaccg 


151 


ggaagccgtg 


tccgcgccgc 


cgccgtttca 


201 


ttatatcgaa 


cacgcgcccg 


ccggcacgga 


251 


gtatgaaaaa 


tatggatatg 


ggtttcaacc 


301 


ccgtcgggga 


cttggcaggc 


agcacgcatc 


351 


aggcaggcgc 


gattttacgg 


cggacattac 


401 


agacggcatt 


taccgccgaa 


taa 



This corresponds to the amino acid sequence <SEQ ID 1098; ORF 270.ng>: 
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g2 70.pep 

1 MNKNRKLLLA ALLLTAFA AF KLVLLQWWQA QQPQAVAAQC DLTEGCTLPD 

51 GSRVRAAAVS TKKPFDIYIE HAPAGTEQVS ISFSMKNMDM GFNRYMFERQ 

101 PSGTWQAARI RLPVCVEGRR DFTADITIGS RTFQTAFTAE * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1099>: 

m270.seq 

1 ATGAATAAAA ACCGTAAATT ACTGCTTGCC GCACTGCTGC TGATTGCCTT 

51 TGCCGCCGTC AAGCTCGTTT TGTTGCAATG GTGGCAGGCG Ca . CAGCCGC 

101 AAGCTGTGGC GGCGCAATGC GATTTGACCG AGGGTTGCAC GCTGCCGGAC 

151 GGAAGCCGCG TCCGCGCCGC CGCcGTTTCA ACCAAAAAAC CGTTTGATAT 

201 TTATATCGAA CACGCGCCCG CCGGCACGGA ACAGGTCAGC ATCAGCTTCA 

251 GTATGAAAAA TATGGATATG GGTTTCaACC GCTATATGTT CGAGCGGCAA 

3 01 cCGTCGGGGA CTTGGCAGGC AGTACGCATC CGCCTGCCCA TCTGTGTCGA 

3 51 AGGCAGGCGC GATTTTACGG CGGACATTAC AATCGGCAGT CGGACATTTC 

401 AGACGGCATT TACCGCCGAA TAA 

This corresponds to the amino acid sequence <SEQ ID 1 100; ORF 270>: 

m2 7 0.pep 

1 MNKNRKLLLA ALLLIAFA AV KLVLLQWWQA XQPQAVAAQC DLTEGCTLPD 

51 GSRVRAAAVS TKKPFDIYIE HAPAGTEQVS ISFSMKNMDM GFNRYMFERQ 

101 PSGTWQAVRI RLPICVEGRR DFTADITIGS RTFQTAFTAE * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 270 shows 96.4% identity over a 140 aa overlap with a predicted ORF (ORF 270.ng) 
from N. gonorrhoeae: 

m270/g270 

10 20 30 40 50 60 

MNKNRKLLLAALLLIAFAAVKLVLLQWWQAXQPQAVAAQCDLTEGCTLPDGSRVRAAAVS . 

IIIIMIIMIIM II II MINIMI: 1 1 1 1 1 1 1 1 1 1 1 1 f I f 1 1 1 1 1 1 1 1 1 f 1 1 i I 

MNKNRKLLLAALLLTAFAAFKLVLLQWWQAQQPQAVAAQCDLTEGCTLPDGSRVRAAAVS 
10 20 30 40 . 50 60 

70 80 90 100 110 120 

TKKPFDIYIEHAPAGTEQVSISFSMKNMDMGFNRYMFERQPSGTWQAVRIRLPICVEGRR 

1 1 M II 1 1 Ml I M 1 1 1 II M I M MMM 1 1 M II M M 1 1 II I M M I MM M I I M 

TKKPFDIY I EHAPAGTEQVS I SFSMKNMDMGFNRYMFERQPSGTWQAARI RLPVCVEGRR 
70 80 90 100 110 120 

130 140 
D FTAD I T I GS RTFQTAFTAEX 

1 1 Ml I II M II! Ill Mill 

DFT AD I T I GS RT FQT AFT AEX 
130 140 



m270 .pep 
g270 

m2 70 .pep 
g270 

m270.pep 
g270 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 1 101>: 

a270 . seq 

1 ATGAATAAAA ACCGTAAATT ACTGCTTGCC GCACTGCTGC TGATTGCCTT 

51 TGCCGCCGTC AAGCTCGTTT TGTTGCAATG GTGGCAGGCG CAGCAGCCGC 

101 AAGCTGTGGC GGCGCAATGC GATTTGACCG AGGGTTGCAC GCTGCCGGAC 

151 GGAAGCCGCG TCCGCGCCGC CGCCGTTTCA ACCAAAAAAC CGTTTGATAT 

201 TTATATCGAA CACGCGCCCG CCGGCACGGA ACAGGTCAGC ATCAGCTTCA 

251 GTATGAAAAA TATGGATATG GGTTTCAACC GCTATATGTT CGAGCGGCAA 

301 CCGTCGGGGA CTTGGCAGGC AGTACGCATC CGCCTGCCCA TCTGTGTCGA 

351 AGGCAGGCGC GATTTTACGG CGGACATTAC AATCGGCAGC CGGACATTTC 

4 01 AGACGGCATT TACCGCCGAA TAA 

This corresponds to the amino acid sequence <SEQ ID 1 102; ORF 270.a>: 

a270 .pep 

1 MNKNRKLLLA ALLLIAFAAV KLVLLQWWQA QQPQAVA AQC DLTEGCTLPD 

51 GSRVRAAAVS TKKPFDIYIE HAPAGTEQVS ISFSMKNMDM GFNRYMFERQ 
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101 PSGTWQAVRI RLPICVEGRR DFTADITIGS RTFQTAFTAE * 

m270/a270 99.3% identity in 1 40 aa overlap 

10 20 30 40 50 60 

m270 . pep MNKNRKLLLAALLLIAFAAVKLVLLQWWQAXQPQAVAAQCDLTEGCTLPDGSRVRAAAVS 
I I I I I I I i 1 I I I I I I I I I I ] I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | 
a270 MNKNRKLLLAALLLIAFAAVKLVLLQWWQAQQPQAVAAQCDLTEGCTLPDGSRVRAAAVS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m270 .pep TKKPFDIYIEHAPAGTEQVS I SFSMKNMDMGFNRYMFERQPSGTWQAVRI RLPICVEGRR 

I ! I I I I I I I I f I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | j | | | 

a270 TKKPFDIYIEHAPAGTEQVS I SFSMKNMDMGFNRYMFERQPSGTWQAVRI RLPICVEGRR 

70 80 90 100 110 120 

130 140 
m270.pep DFTADITIGSRTFQTAFTAEX 
I I I I I I II I I I I I I I I I I I I I 
a270 DFTADITIGSRTFQTAFTAEX 

130 140 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 103>: 

g271 . seq 

1 atgttcagtt cgcggatggc gaggatttgg gcgacggggg taacgttgtg 

51 tatggtcagt ccgtgtccgg cgttgacgac caagcccaaa tcgccggcga 

101 aatgcgcgcc gttttggatg cgctcgaact gcctgatttg ttcggcgtgg 

151 ctttgtgcgt cggcatatgc gccggtgtgc agctcgacaa cgggcgcgcc 

201 gacatcacgg gcggcttgga tttgcctgtc gtcggcatcg ataaacaagg 

251 acacgcgtat gcccgcgtcg gtcaggattt tggcgaattc ggcgattttt 

301 tcctgttgcg ccaatacgtc caaaccgcct tcggtcgtga tttcctgccg 

351 tttttcaggc acgatgcaca cgtcttccgg catcacttta agcgcgtttt 

401 cgagcatttc ttccgtcaac gccatttcaa ggttcaggcg cgtgcggatg 

451 gcgtttttga cggcaaatac atccgcgtct ttgatgtggc ggcggtcttc 

501 gcgcaggtgc atggtaatca ggtctgcacc gtgcgtttcg gcaaccagtg 

551 ccgcctccac ggggctggga taa 

This corresponds to the amino acid sequence <SEQ ID 1 104; ORF 271.ng>: 

g271 .pep 

1 MFSSRMARIW ATGVTLCMVS PCPALTTKPK SPAKCAPFWM RSNCLICSAW 

51 LCASAYA PVC SSTTGAPTSR AAWICLSSAS INKDTRMPAS VRILANSAIF 

101 SCCANTSKPP SWISCRFSG TMHTSSGITL SAFSSISSVN AISRFRRVRM 

151 AFLTANTSAS LMWRRSSRRC MVIRSAPCVS ATSAASTGLG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 105>: 

m271 . seq 

1 AwGTTCAGTT CGCGGATGGC GAGGATTTGG GCGATGGGGG TAACGTTGTG 

51 TATGGTCAGT CCGTGTCCGG CGTTGACGAC CAAGCCCAAA TCGCCGGCGA 

101 AATGCGCGCC GTTTTGGATG CGCTCGAACT GCCTGATTTG TTCGGCGTGG 

151 CTGCGCGCGT CGGCATACGC GCCTGTGTGC AGCTCGACAA CGGGCGCGCC 

201 GACATCACGG GCGGCTTGGA TTTGCCTGTC GTCGGCATCG ATAAACAAAG 

251 ACACGCGTAT GCCTGCGTCG GTCAGGATTT TGGTGAACCC GGCGATTTTT 

301 TCCTGTTGCG CCAATACGTC CAAACCGCCT TCGGTCGTGA TTTCCTGACG 

351 TTTTTCAGGC ACGATGCACA CGTCTTCCGG CATCACTTTC AAAGCGTTTT 

401 CCAACATTTC TTCCGTCAAC GCCATTTCAA GGTTCAGGCG CGTGCGGATG 

451 GCGTTTTTGA CGGCAAACAC GTCCGCGTCT TTGATGTGGC GGCGGTCTTC 

501 GCGCAGGTGC ATGGTAATCA AATCCGCACC GTGCGTTTCG GCAACCAGTG 

551 CCGCCTCCAC GGGGCTGGGA TAA 

This corresponds to the amino acid sequence <SEQ ID 1 106; ORF 271>: 

m271 .pep 

1 XFSSRMARIW AMGVTLCMVS PCPALTTKPK SPAKCAPFWM RSNCLICSAW 

51 LRASAYA PVC SSTTGAPTSR AAWICLSSAS INKDTRMPAS VRILVNPAIF 

101 SCCANTSKPP SWISXRFSG TMHTSSGITF KAFSNISSVN AISRFRRVRM 

151 AFLTANTSAS LMWRRSSRRC MVIKSAPCVS ATSAASTGLG * 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 271 shows 95.2% identity over a 189 aa overlap with a predicted ORP (ORF 271. ng) 
from N. gonorrhoeae: 

m27l/g271 

10 20 30 40 50 60 

m271 . pep XFSSRMARIWAMGVTLCMVSPCPALTTKPKSPAKCAPFWMRSNCLICSAWLRASAYAPVC 

Illlllllll I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I II I 
g271 MFSSRMARIWATGVTLCMVSPCPALTTKPKS PAKCAPFWMRSNCLICSAWLCASAYAPVC 

10 20 30 40 50 60 



70 80 90 100 110 120 

m2 71 .pep SSTTGAPTSRAAWICLSSASINKDTRMPASVRILVNPAIFSCCANTSKPPSWISXRFSG 

1 1 II I II I II II II illlllllll III II II 1 1 hi Illlllllll llllllll INI 

g271 SSTTGAPTSRAAWICLSSASINKDTRMPASVRILANSAIFSCCANTSKPPSWISCRFSG 

70 80 90 100 110 120 



130 140 150 160 170 180 

m271 .pep TMHTSSGITFKAFSNISSVNAISRFRRVRMAFLTANTSASLMWRRSSRRCMVIKSAPCVS 
I I I I I I I I I :: I I I : I f I I I I I I I I I I I I I I I I I I I I f I I 1 I I I M I I I I I I I : I I I t I I 
g 271 TMHTSSGITLSAFSSISSVNAISRFRRVRMAFLTANTSASLMWRRSSRRCMVIRSAPCVS 

130 140 150 160 170 180 



190 

m271 .pep ATSAASTGLGX 
I I I I I I I I I I I 
g271 ATSAASTGLGX 

190 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 107>: 

a271 . seq 

ATGTTCAGTT CGCGGATGGC GAGGATTTGG 
TATGGTCAGT CCGTGTCCGG CGTTGACGAC 
AATGCGCGCC GTTTTGGATG CGCTCGAACT 
CTGCGCGCGT CGGCATACGC GCCTGTGTGC 
GCGGCTTGGA 

GCCCGCGTCG GTCAGGATTT TGGTGAATTC 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



G AC AT C AC GG 
ACACGCGTAT 
TCTTGTTGCG 
TTTTTCCGGC 



GCGATGGGGG 
CAAGCCCAAA 
GCCTGATTTG 
AGCTCGACAA 
TTTGCCTGTC GTCGGCATCG 



TAACGTTGTG 
TCGCTGGCAA 
TTCGGCGTGG 
CGGGCGCGCC 
ATAAACAAGG 
GGCAATTTTG 



CCAATACGTC CAAGCCGCCT TCGGTCGTGA TTTCCTGACG 
ACGATGCACA CGTCTTCCGG CATCACTTTA 



AGCGCGTTTT 
CGTGCGGATG 
GGCGGTCTTC 

ATGGTAATCA GGTCGGCACC GTGCGTTTCG GCAACCAGTG 
GGGGCTGGGA TAA 



CGAGCATTTC TTCCGTCAAC GCCATTTCAA GGTTCAGGCG 
GCGTTTTTGA CAGCAAACAC GTCCGCGTCT TTGATGTGGC 
GCGCAGGTGC 
CCGCCTCCAC 



This corresponds to the amino acid sequence <SEQ ID 1 108; ORF 271 .a>: 

a271 .pep 

1 MFSSRMARIW AMGVTLCMVS PCPALTTKPK SLAKCAPFWM RSNCLICSAW 

51 LRASAYAPVC SSTTGAPTSR AAWICLSSAS INKDTRMPAS VRILVNSAIL 

101 SCCANTSKPP SVVIS*RFSG TMHTSSGITL SAFSSISSVN AISRFRRVRM 

151 AFLTANTSAS LMWRRSSRRC MVIRSAPCVS ATSAASTGLG * 



m271/a271 96.3% identity in 1 89 aa overlap 



10 20 30 40 50 60 

m271 .pep XFSSRMARIWAMGVTLCMVSPCPALTTKPKSPAKCAPFWMRSNCLICSAWLRASAYAPVC 
I | | | I I I I I I I I I ] I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
a271 MFSSRMARIWAMGVTLCMVSPCPALTTKPKSLAKCAPFWMRSNCLICSAWLRASAYAPVC 

10 20 30 40 50 60 

70 80 90 100 110 120 

m271.pep SSTTGAPTSRAAWICLSSASINKDTRMPASVRILVNPAIFSCCANTSKPPSVVISXRFSG 
I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I II I I : I I I I I I I I I I I I I I i I I M I 
a271 SSTTGAPTSRAAWICLSSASINKDTRMPASVRILVNSAILSCCANTSKPPSVVISXRFSG 
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70 80 90 100 110 120 

130 140 150 160 170 180 

m271.pep TMHTSSGITFKAFSNISSVNAISRFRRVRMAFLTANTSASLMWRRSSRRCMVIKSAPCVS 
t I I I I I I I I : : I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I 
a271 TMHTSSGITLSAFSSISSVNAISRFRRVRMAFLTANTSASLMWRRSSRRCMVIRSAPCVS 

130 140 150 160 170 180 

190 

m2 7 1 . pep ATS AASTGLGX 

I I I I I 1 I I I I I 
a271 ' ATSAASTGLGX 

190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 109>: 

g272 . seq 

1 atgactgcaa aggaagaact gttcgcatgg 

51 caaaggttcc gacctgtttg tgacgaccca 

101 tggacggcaa aatcacccgc atcacggacg 

151 tgtatggaaa tcgccttttc gattatgagt 

201 ttcatcgacc aacgagtgca atttcgccat 

251 gcttccgcgt caatgcgatg atacagcgcg 

301 cgcgcgatta ccagcaagat tcccaagttt 

351 ggccttgaag gatgttgcgc tgaaaaaacg 

401 gcggcaccgg ctcgggcaaa tcgacttcgc 

451 cgcaatgaaa attcgttcgg acacatcatc 

501 gtttgtccac gaacacaaaa actgcatcat 

551 tggacacgga aaactggatg gcggcgttga 

601 ccggatgtga tccttatcgg cgaaatccgc 

651 cgccatcgcc tttgccgaaa cggggcattt 

701 ccaacagcac caatcaggcg ctcgaccgca 

751 gagcggcgcg aacaattgct gacggatttg 

801 tatttcgcaa cgcctcgttc cgcgagacgg 

851 cagtcgaggt gctgctcaat tcgcccctga 

901 ggcaacatcc atgaaatcaa agaagtgatg 

951 tatgcagacc ttcgaccaac acctttacca 

1001 tttccttgca ggatgccttg aaaaatgccg 

1051 ttggcggtac agttgcgcag ccgcagggca 

1101 ggaactgctc tga 

This corresponds to the amino acid sequence <SEQ ID 1110; ORF 272.ng>: 

g272 -pep 

1 MTAKEELFAW LRHMNKNKGS DLFVTTHFPP AMKLDGKITR ITDEPLTAEK 

51 CMEIAFSIMS AKQAEEFSST NECNFAISLP DTSRFRVNAM IQRGATALVF 

101 RAITSKIPKF ESLNLPPALK DVALKKRGLV I FVGGTGSGK STSLASLIDY 

151 RNENSFGHII TIEDPIEFVH EHKNCIITQR EVGVDTENWM AALKNTLRQA 

201 PDVILIGEIR DRETMDYAIA FAETGHLCMA TLHANSTNQA LDRIINFFPE 

251 ERREQLLTDL SLNLQAFISQ RLVPRDGGKG RVAAVEVLLN SPLISELIHN 

301 GNIHEIKEVM KKSTTLGMQT FDQHLYQLYE KGEISLQDAL KNADSAHDLR 

351 LAVQLRSRRA QSSDPDLELL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 1 1>: 

m272 . seq 

1 ATGACCGCAA AGGAAGAACT GTTCGCATGG CTGCGCCATA TGAwCCAAAA 

51 CAAAGGTTCC GACCTGTTCG TGACAACCCA TTTCCCGCCC GCAATGAAGC 

101 TGGACGGCAA AATCACCCGC ATCACGGACG AACCGCTGAC GGCGGAAAAA 

151 TGTATGGAAA TCGCCTTTTC GATTATGAGT GCGAAGCAGG CGGAAGAATT 

201 TTCATCGACC AACGAGTGCA ACTTCGCCAT CAGCCTGCCG GACACCAGCC 

251 GCTTCCGCGT CAATGCGATG ATACAGCgCG GCGCGACGGC GTTGGTATTC 

301 CGTACGATTA CCAGCAAGAT TCCCAAGTTT GAAAGCCTGA ACCTGCCGCC 

3 51 AGTCTTGAAG GATGTCGCGC TGAAAAAACG CGGGCTGGTT ATTTTTGTCG 

4 01 GCGGCACCGG CTCGGGTAAA TCGACTTCGC TTGCCTCGCT TATCGACTAC 
451 CGCAATGAAA ATTCGTTCGG ACACATCATC ACCATCGAAG ACCCGATCGA 
501 GTTTGTCCAC GAACACAAAA ACTGCATCAT CACCCAGCGC GAGGTCGGCG 



ctgcgccata 
tttcccgccc 
aaccgctgac 
gcgaagcagg 
cagcctgccg 
gtgcgacggc 
gaaagcctga 
cgggctggtt 
tcgcctcgct 
accatcgaag 
tacccagcgc 
aaaatacgct 
gaccgtgaaa 
gtgtatggcg 
tcatcaactt 
tcgctcaacc 
cggcaagggc 
tttcggagtt 
aaaaaatcca 
attgtatgaa 
attccgcaca 
caaagttccg 



tgaacaaaaa 
gctatgaagc 
ggcggaaaaa 
cggaagaatt 
gacaccagcc 
gttggtattc 
acctgccgcc 
atttttgtcg 
tatcgactac 
atccgatcga 
gaggtcggcg 
gcgtcaggcg 
caatggacta 
acgctgcacg 
cttccccgag 
ttcaggcgtt 
agggtggcgg 
gattcacaac 
ctaccctggg 
aaaggcgaga 
tgatttgcgt 
accccgattt 
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551 TGGATACGGA AAACTGGATG GcGGCGTTGA 

601 CCTGATGTCA TCCTTATCGG CGAAATCCGT 

651 CGCCATTGCC TTTGCCGAAA CGGGGCATTT 

701 CCAACAGCAC CAATCAGGCA CTCGACCGCA 

751 GAGCGGCGCG AACAATTGCT GACGGATTTG 

801 TATTTCGCAA CGCCTCGTTC CGCGAGACGG 

851 CAGTCGAGGT GCTGCTCAAT TCGCCCCtGA 

901 GGCAACATCC ATGAAATCAA AGAAGTGATG 

951 TATG CAGACC TTCGATCAAC ACCTTTACCA 

1001 TTTCCCTGCA AGAAGCATTG AAAAATGCCG 

1051 TTGGCGGTAC AGTTGCGCAG CCGCCGCGCG 

1101 GGnACTGCTC TGA 

This corresponds to the amino acid sequence <SEQ ID 1 1 12; ORF 272>: 

m272.pep 

1 MTAKEELFAW LRHMXQNKGS DLFVTTHFPP AMKLDGKITR ITDEPLTAEK 

51 CMEIAFSIMS AKQAEEFSST NECNFAISLP DTSRFRVNAM IQRGATALVF 

101 RTITSKIPKF ESLNLPPVLK DVALKKRGLV IFVGGTGSGK STSLASLIDY 

151 RNENS FGHII TIEDPIEFVH EHKNCIITQR EVGVDTENWM AALKNTLRQA 

2 01 PDVILIGEIR DRETMDYAIA FAETGHLCMA TLHANSTNQA LDRIINFFPE 

251 ERREQLLTDL SLNLQAFISQ RLVPRDGGKG RVAAVEVLLN SPLISELIHN 

301 GNIHEIKEVM KKSTTLGMQT FDQHLYQLYE KGDISLQEAL KNADSAHDLR 

351 LAVQLRSRRA QSXSPDLXLL * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORP from N. gonorrhoeae 

ORF 272 shows 97.6% identity over a 370 aa overlap with a predicted ORF (ORF 272.ng) 
from N. gonorrhoeae: 

m272/g272 

10 20 30 40 50 60 

MTAKEELFAWLRHMXQNKGSDLFVTTHFPPAMKLDGKITRITDEPLTAEKCMEIAFSIMS 
| | | || I I I I I I I I I : I I I I I I I I I 1 I I M M M I I I I I I 1 I M I I I I I I I I I I I M M I 
MTT^KEELFAWLRHMNKNKGSDLFVTTHFPPAMKLDGKITRITDEPLTAEKCMEIAFSIMS 
10 20 30 40 50 60 

70 80 90 100 110 120. 

AKQAEEFSSTNECNFAISLPDTSRFRVNAMIQRGATALVFRTITSKIPKFESLNLPPVLK 
| t I I 1 1 I I I I I i I I I I I I I I i M I I I I I i I I I M I I I I I I t : I I I I I I I I I I M I I I : t I 
AKQAEEFSSTNECNFAISLPDTSRFRVNAMIQRGATALVFRAITSKIPKFESLNLPPALK 
70 80 90 100 110 120 

130 140 150 160 170 180 

DVALKKRGLVIFVGGTGSGKSTSLASLIDYRNENSFGHIITIEDPIEFVHEHKNCIITQR 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 M I M 1 1 1 M M I M 1 1 II 1 1 1 1 1 i 

DVALKKRGLVIFVGGTGSGKSTSLASLIDYRNENSFGHIITIEDPIEFVHEHKNCIITQR 
130 140 150 160 170 180 

190 200 210 220 230 240 

EVGVDTENWMAALKNTLRQAPDVILIGEIRDRETMDYAIAFAETGHLCMATLHANSTNQA 

| | | | | t I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M M I I I 
EVGVDTENWMAALKNTLRQAPDVILIGEIRDRETMDYAIAFAETGHLCMATLHANSTNQA 

190 200 210 220 230 240 

250 260 270 280 290 300 

LDRIINFFPEERREQLLTDLSLNLQAFISQRLVPRDGGKGRVAAVEVLLNSPLISELIHN 

| I I I I I I I I M I I I I I I I I I I I II I I I I I I I I I I I I II I 1 I I I I I I I I I I I I I I 

LDRI IN FFPEERREQLLTDLSLNLQAFISQRLVPRDGGKGRVAAVEVLLN SPLISELIHN 
250 260 270 280 290 300 

310 320 330 340 350 360 

GNIHEI KE VMKKSTTLGMQT FDQHLYQL YEKG D I S LQEALKN ADS AH DLRLAVQLRSRRA 

II III II ||||ll.lllllllllllll:llll:|llllllllllllll 

GNIHEIKEVMKKSTTLGMQTFDQHLYQLYEKGEISLQDALKNADSAHDLRLAVQLRSRRA 
310 320 330 340 350 360 



AAAACACGCT 
GACCGCGAAA 
GTGTATGGCG 
TCATCAACTT 
TCGCTCAACC 
CGGCAAGGGC 
TTTCGGAGTT 
AAAAAATCCA 
ATTGTATGAA 
ATTCCGCACA 
CAaAGTTyCA 



GCGTCAGGCG 
CAATGGACTA 
ACGCTGCACG 
TTTCCCCGAG 
TTCAGGCGTT 
AGGGTGGCGG 
GATTCACAAC 
CTACCCTGGG 
AAAGG CGAT A 
CGATTTGCGT 
GCCCCGATTT 



m272.pep 
g272 

m272.pep 
g272 

m272.pep 
g272 

m272.pep 
g272 

m272.pep 
g272 

m272 .pep 
g272 
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m272.pep 
g272 



370 
QSXSPDLXLLX 
II : I I I III 
QSSDPDLELLX 

370 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 13>: 

a272 . seq 

ATGACCGCAA AGGAAGAACT GTTCGCATGG 
CAAAGGTTCC GACCTGTTCG TGACGACCCA 
TGGACGGCAA AATCACCCGC ATCACGGACG 
TGTATGGAAA TCGCCTTTTC GAT TAT GAG T 
AACGAGTGCA ACTTCGCCAT 
CAATGCGATG ATACAGCGCG 
CCAGCAAGAT TCCCAAGTTT 
GATGTCGCGC TGAAAAAACG 
CTCGGGCAAA TCGACTTCGC 
ATTCGTTCGG ACACATCATC 
GAACACAAAA ACTGCATCAT 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 



TTCATCGACC 
GCTTCCGCGT 
CGTGCGATTA 
GGTCTTGAAG 
GCGGCACCGG 
CGCAATGAAA 
GTTTGTCCAC 
TGGATACGGA 
CCGGATGTGA 
CGCCATTGCT 
CCAACAGCAC 
GAGCGGCGCG 
TATTTCGCAA 
CAGTCGAGGT 
GGCAATATCC 
TAT G GAG ACT 
TTTCCTTGCA 
TTGGCGGTAC 
GGAACTGCTC 



AAACTGGATG GCGGCGTTGA 
TTCTGATCGG CGAAATCCGC 
TTTGCCGAAA CGGGGCATTT 
CAATCAGGCA CTCGACCGCA 
AACAATTGCT GACGGATTTG 



CGCCTCGTTC 
GCTGCTCAAT 
ATGAAATCAA 
TTCGACCAAC 
GGATGCCTTG 
AGTTGCGCAG 
TGA 



CGCGAGACGG 
TCGCCCCTGA 
AGAAGTGATG 
ACCTTTACCA 
AAAAATGCCG 
CCGCCAGGCG 



CTGCGCCATA 
TTTCCCGCCC 
AACCGCTGAC 
GCGAAGCAGG 
CAGTCTGCCG 
GTGCGACGGC 
GAAAGCCTGA 
CGGGCTGGTT 
TTGCCTCGCT 
ACCATCGAAG 
CACCCAGCGC 
AAAACACGCT 
GACCGCGAAA 
GTGTATGGCG 
TCATCAACTT 
TCGCTCAACC 
CGGCAAGGGC 
TTTCGGAGTT 
AAAAAATCCA 
ATTGTATGAA 
ATTCCGCACA 
CAAAGTTCCG 



TGAACAAAAA 
GCAATGAAGC 
GGCGGAAAAA 
CGGAAGAATT 
GACACCAGCC 
GTTGGTATTC 
ACCTGCCGCC 
ATTTTTGTCG 
TATCGACTAC 
ACCCGATCGA 
GAGGTCGGCG 
GCGTCAGGCA 
CAATGGACTA 
ACGCTGCACG 
TTTCCCCGAG 
TTCAGGCATT 
AGGGTGGCGG 
GATTCACAAC 
CTACCCTGGG 
AAAGGCGAGA 
CGATTTGCGT 
GTCCCGATTT 



This corresponds to the amino acid sequence <SEQ ID 1 1 14; ORF 272. a> 
a272 .pep 

1 MTAKEELFAW LRHMNKNKGS DLFVTTHFPP AMKLDGKITR 

51 CMEIAFSIMS AKQAEEFSST NECNFAISLP DTSRFRVNAM 

101 RAITSKIPKF ESLNLPPVLK DVALKKRGLV IFVGGTGSGK 

151 RNENSFGHI I TIEDPIEFVH EHKNCIITQR EVGVDTENWM 

201 PDVILIGEIR DRETMDYAIA FAETGHLCMA TLHANSTNQA 

251 ERREQLLTDL SLNLQAFISQ RLVPRDGGKG RVAAVEVLLN 

301 GNIHEIKEVM KKSTTLGMQT FDQHLYQLYE KGEISLQDAL 

351 LAVQLRSRQA QSSGPDLELL * 



m272/a272 97.6% identity in 370 aa overlap 

10 20 30 



40 



50 



ITDEPLTAEK 
IQRGATALVF 
STSLASLIDY 
AALKNTLRQA 
LDRIINFFPE 
SPLISELIHN 
KNADSAHDLR 



60 



m272 .pep MTAKEELFAWLRHMXQNKGSDLFVTTHFPPAMKLDGKITRITDEPLTAEKCMEIAFSIMS 
I I II I I I I I I II II : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I 11 I I I 
a272 MTAKEELFAWLRHMNKNKGSDLFVTTHFPPAMKLDGKITRITDEPLTAEKCMEIAFSIMS 
10 20 30 40 50 60 

70 80 90 100 110 120 

m272.pep AKQAEEFSSTNECNFAISLPDTSRFRVNAMIQRGATALVFRTITSKIPKFESLNLPPVLK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I II I I I I 
a272 AKQAEEFSSTNECNFAISLPDTSRFRVNAMIQRGATALVFRAITSKIPKFESLNLPPVLK 

70 80 90 100 110 120 

130 140 150 160 170 180 

m272 .pep DVALKKRGLVIFVGGTGSGKSTSLASLIDYRNENSFGHI ITIEDPIEFVHEHKNCIITQR 

I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I 
a272 DVALKKRGLVIFVGGTGSGKSTSLASLIDYRNENSFGHI ITIEDPIEFVHEHKNCIITQR 

130 140 150 160 170 180 
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190 200 210 220 230 240 

m272 .pep EVGVDTENWMAALKNTLRQAPDVILIGEIRDRETMDYAIAFAETGHLCMATLHANSTNQA 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I f I 1 I I I I I I I I 
a272 EVGVDTENWMAALKNTLRQAPDVILIGEIRDRETMDYAIAFAETGHLCMATLHANSTNQA 

190 200 210 220 230 240 



250 260 , 270 280 290 300 

m272 .pep LDRI INFFPEERREQLLTDLSLNLQAFISQRLVPRDGGKGRVAAVEVLLNSPLISELIHN 

I I I I I I I I I I I I I I I I I I II I I 1 I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I It I I I 
a272 LDRI INFFPEERREQLLTDLSLNLQAFISQRLVPRDGGKGRVAAVEVLLNSPLISELIHN 

250 260 270 280 290 300 



310 320 330 340 350 360 

m272 . pep GNIHEIKEVMKKSTTLGMQTFDQHLYQLYEKGDISLQEALKNADSAHDLRLAVQLRSRRA 
I I I II I I I I I I 1 I I I II I I I I I I I I I I I I I I I : I I I I ; I I I I I I I I I I I I I I I I I I I I : I 
a272 GNIHEIKEVMKKSTTLGh3QTFDQHLYQLYEKGEISLQDALKNADSAHDLRLAVQLRSRQA 
310 320 330 340 350 360 



370 

m272.pep QSXSPDLXLLX 
II -III III 
a272 QSSGPDLELLX 
370 



g273 . seq 




1 


atgagtcttc 


51 


caacgaaaat 


101 


gcaaacacac 


151 


ccattcaccg 


201 


tgccggcaag 


251 


gttgttcctt 


301 


cgggaaattg 


351 


gtttgagaag 


401 


gtcgctcacc 


451 


tgcccacttg 


501 


tttttcttac 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 1 15>: 

aggcggtatt tatatacccc ccaagccgta ccgcacaata 
caggaaaacg gcggtaaagc tcataaacag ggacaaagcg 
cgaccgccgt caggatatag gcgtattcga ggccggaact 
ttttcctgcc gtttcttgtc gcttttgaaa taaaggatga 
cagcgcggca gccgcgcccg acattggcat tgtgttcatt 
aacggttaaa aacccgcccg gccgtgcaac cgttttaagg 
caaaatttgt ttgcgggcgc gtgccgctga aatcaaggcg 
tgtttccnac gcgcccgccc tatgtgccga aatattattt 
tgcaaaatcg ccaagaacgc gctttgcgga atttccacgt 
tttcatacgg cgtttgcctg ctttttgttt ttcaagcagt 
gcgtaa 

This corresponds to the amino acid sequence <SEQ ID 1 1 16; ORF 273. ng>: 

g273 .pep 

1 MSLQAVFIYP PSRTAQYNEN QENGGKAHKQ GQSGKHTDRR QDIGVFEAGT 

51 PFTVFLPFLV AFEIKDDAGK QRGSRARHWH CVHCCSLTVK NPPGRATVLR 

101 REIAKFVCGR VPLKSRRFEK CFXRARPMCR NIICRSPAKS PRTRFAEFPR 

151 C PLVSYGVCL LFVFQAVF SY A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 17>: 



CGACCGCTGT CAGGATATAG GCGTATTCAA GGCCGGAACT 



AATGCTTAAA AACCCGCCTG TCCGTGCAAC CGTTTTAAGG 
CAAAATTTGT TTGCGGGCGC GTGCCCCTGA AATCAGGGCG 



CATACGGCGT TTACCTGCCT TTTGTkTwTC AAGCAGTTTT 
TAA 

This corresponds to the amino acid sequence <SEQ ID 1 1 18; ORF 273>: 

m273 .pep 

1 MSLQAVFIYP PSRTAQYNEN QENGGKAHKQ GQSGKHADRC QDIGVFKAGT 
51 . PFPVFLPLLV AFEIKDDAGK QRGSRARH*H CVHCCSLMLK NPPVRATVLR 
101 RQIAKFVCGR VPLKSGRFEG CSRRAALCAG VICRSPAKSP RTRFAEFPHC 
151 PLVSYGVYLP FVXQAVF SYA * 

Computer analysis of this amino acid sequence gave the following results: 



m273 . seq 




1 


ATGAGTCTTC 


51 


CAACGAAAAT 


101 


GCAAACACGC 


151 


CCATTCCCCG 


201 


TGCCGGCAAG 


251 


GTTGTTCCTT 


301 


CGGCAAATTG 


351 


GTTTGAGGGG 


401 


GCTCACCTGC 


451 


CCACTTGTTT 


501 


TTCTTACGCG 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 273 shows 86.0% identity over a 171 aa overlap with a predicted ORF (ORF 273.ng) 
from N. gonorrhoeae: 

m273/g273 

10 20 30 40 50 60 

m273 . pep MSLQAVFIYPPSRTAQ'YNENQENGGKAHKQGQSGKHADRCQDIGVFKAGTPFPVFLPLLV 
I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I !l!lll:(|||| it I I : I I 
g273 MSLQAVFIYPPSRTAQYNENQENGGKAHKQGQSGKHTDRRQDIGVFEAGTPFTVFLPFLV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m273 . pep AFEIKDDAGKQRGSRARHXHCVHCCSLMLKNPPVRATVLRRQIAKFVCGRVPLKSGRFEG 

I I I ! I 1 I I I I I I I I I I I I I I I I I I I I : 1 I I I 1 I I I I I I : i I I I I I I I I I I | | Ml 
g27 3 AFEIKDDAGKQRGSRARHWHCVHCCSLTVKNPPGRATVLRREIAKFVCGRVPLKSRRFEK 

70 80 90 100 110 120 

130 140 150 160 170 

m273 . pep CSRRA-ALCAGVICRSPAKSPRTRFAEFPHCPLVSYGVYLPFVXQAVFSYAX 
i 11 : I : : I I I I I i I I I I I I I I I I I : I I t I I I I I i I I I I I I i ! I 1 
g273 CFXRARPMCRNIICRSPAKSPRTRFAEFPRCPLVSYGVCLLFVFQAVFSYAX 

130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 19>: 

a273.seq 

1 ATGAGTCTTC AGGCGGTATT TGTATACCCC CCAAGCCGTA CCGCACAATA 

51 CAACGAAAAT CAGGAAAACG GCGGTAAAGC TCATAAACAG GGACAAAGCG 

101 GCAAACACGC CGACCGCCGT CAGGATATAG GCGTATTCCA GACCGGAACT 

151 CCATTCACCG TTTTCCTGCC GCTTTTTGTC GCTTTTGAAA TAAAGGATGA 

201 TGCCGGCAAG CAGCGCGGCA GCCGCGCCCG ACATTAGCAT AATGTTCATT 

251 GTTGTTCCTT AACGGTTAAA AACCCGCCCG TCCGTGCAAC CGTTTTTAAG 

301 AGGCGGTAAA TCACAAAGTT TGTTGGCGGA CGTGCTCTCT TACAATCAGG 

351 GCGGTTTAAG GGGCATGATG CACTGCCCCG TGTGCCGGAT ATTATTTGTC 

4 01 GCTCACCTGC AAAATTGCCA AGAACGCGCT TTGCGGGATT TCCACATTGC 

4 51 CCACTTGTTT CATACGGCGT TTGCCTGCTT TTTGTTTTTC AAGCAGTTTT 

501 TTCTTACGCG TAA 

This corresponds to the amino acid sequence <SEQ ID 1 120; ORF 273.a>: 

a273 .pep 

1 MSLQAVFVYP PSRTAQYNEN QENGGKAHKQ GQSGKHADRR QDIGVFQTGT 

51 PFTVFLPLFV AFEIKDDAGK QRGSRARH*H NVHCCSLTVK NPPVRATVFK 

101 RR*ITKFVGG RALLQSGRFK GHDALPRVPD IICRSPAKLP RTRFAGFPHC 

151 PLVSYGVCLL FVFQAV FSYA * 

m273/a273 80.1% identity in 171 aa overlap 

10 20 30 40 50 60 

m273.pep MSLQAVFI YPPSRTAQYNENQENGGKAHKQGQSGKHADRCQDIGVFKAGTPFPVFLPLLV 

I I I I ! I I : I I I I I I I I I I II I I 1 I I I I I I I I I I I 1 I I I I I I I I I 1 : : I I II I i I I I : I 
a273 MSLQAVFVYPPSRTAQYNENQENGGKAHKQGQSGKHADRRQDIGVFQTGTPFTVFLPLFV 

10 20 30 40 50 60 

70 80 90 100 • 110 119 

m273 . pep AFEIKDDAGKQRGSRARHXHCVHCCSLMLKNPPVRATVL-RRQIAKFVCGRVPLKSGRFE 

II I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I 1 I I : II I : I I | I 1 : I : I I I I : 
a273 AFEIKDDAGKQRGSRARHXHNVHCCSLTVKNPPVRATVFKRRXITKFVGGRALLQSGRFK 

70 80 90 100 110 120 

120 130 140 150 160 170 

m273.pep GCSRRAALCAGVICRSPAKSPRTRFAEFPHCPLVSYGVYLPFVXQAVFSYAX 
I : : : I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I 
a273 GHDALPRV-PDIICRSPAKLPRTRFAGFPHCPLVSYGVCLLFVFQAVFSYAX 
130 140 150 160 170 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 121>: 

g274 .seq 

1 ATGGCGGGGC CGATTTTTGT CGTCatCGCC AgcgTCGCTA TGTTTTTTGT 

51 CGCGCAGCAG CACGCGACAG ATTTGGTTAC GGACGATTAT TATAAGGATG 

101 G C AAG CAT AT CGACATCCAG CTTCATCGGG ATGAAGAAGC CGTCAGACGG 

151 CATATCGGGG TGCAGGTCCT CATTTCTCCC GATATGAATG CGG CAAAAGT 

2 01 GTTTGTCGGc ggCgagtTTG ACGGCAAACA GCCTTTGAAC CTGCTGCTGA 
251 TGCACCCGAC CCGCAAGGCG GACGATCAAA CCGTCGCCCT CAAGCCCGTC 

3 01 GGCAGCGCGC AGAACGGCAG GGCGGAATAT GAGGCGGTgt tcaaAACCCT 

3 51 TCCGCCGGCC AACCACTGGT ATGTGCGCGT GGAggacgCG GCAGGCGTGT 

4 01 GGCGCGTCGA GAACAAATGG ATTACCAGCC AGGGCAATGC GGTCGATTTG 
451 ACCCCGATGG ACAAACTTTT CAATAATGCA GGAAGCAAAT AA 

This corresponds to the amino acid sequence <SEQ ID 1 122; ORF 274.ng>: 

g274 .pep 

1 MAGPIFWIA SVAMFFVAQQ HA TDLVTDDY YKDGKHIDIQ LHRDEEAVRR 

51 HIGVQVLISP DMNAAKVFVG GEFDGKQPLN LLLMHPTRKA DDQTVALKPV 

101 GSAQNGRAEY EAVFKTLPPA NHWYVRVEDA AGVWRVENKW ITSQGNAVDL 

151 TPMDKLFNNA GSK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 123>: 

m274 . seq 

1 ATGGCGGGGC CGATTTTTGT CGTCATCGCC AGCGTCGCTA TGTTTTTTGT 

51 CGCGCAGCAG CACGCGACAG ATTTGGTTAC GGACGATTAT TATAAAGACG 

101 GCAAACATAT CGACATCCAG CTTCATCGGG ATGAAGAAGC CGTCAGACGG 

151 CATATCGGGG TGCAGGTTCT CATTTCCCCC GATATGAATG CGGCAAAAGT 

201 GTTTGTCGGC GGCGAGTTTG ACGGCAAACA GCCTTTGAAC CTGCTGCTGA 

251 TGCACCCGAC CCGCAAGGCG GACGATCAAA CCGTCGCCCT CAAGCCCGTC 

3 01 GGCAGCGCGC AGAACGGCAG GGCGGAATAT GAGGCGGTGT TCAAAACCCT 

3 51 TTCGCCGACC AACCACTGGT ATGTGCGCGT GGAGGACGCG GCAGGCGTGT 

4 01 GGCGCGTCGA GAACAAATGG ATTACCAGCC AAGG CAATGC GGTCGATTTG 
4 51 ACCCCGATGG ACAAGCTTTT CAATAATACT GAAAGCAAAT AA 

This corresponds to the amino acid sequence <SEQ ID 1 124; ORF 274>: 

m274 .pep 

1 MAGPIFWIA SVAMFFVAQQ HA TDLVTDDY YKDGKHIDIQ LHRDEEAVRR 

51 HIGVQVLISP DMNAAKVFVG GEFDGKQPLN LLLMHPTRKA DDQTVALKPV 

101 GSAQNGRAEY EAVFKTLS PT NHWYVRVEDA AGVWRVENKW ITSQGNAVDL 

151 TPMDKLFNNT ESK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 274 shows 97.5% identity over a 163 aa overlap with a predicted ORF (ORF 274.ng) 
from N. gonorrhoeae: 

g274/m274 

10 20 30 40 50 60 

g274 . pep MAGPIFVVIASVAMFFVAQQHATDLVTDDYYKDGKHIDIQLHRDEEAVRRHIGVQVLISP 
I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I | I I I | | | | | 
m27 4 MAGPI FVVI ASVAMFFVAQQHATDLVTDDYYKDGKHI DIQLHRDEEAVRRHIGVQVLI S P 

10 20 30 40 50 60 

70 80 90 100 110 120 

g27 4 .pep DMNAAKVFVGGEFDGKQPLNLLLMHPTRKADDQTVALKPVGSAQNGRAEYEAVFKTLPPA 
I I I I I I I i I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : 
m27 4 DMNAAKVFVGGEFDGKQPLNLLLMHPTRKADDQTVALKPVGSAQNGRAEYEAVFKTLSPT 

70 80 90 100 110 120 

130 140 150 160 

g274 . pep NHWYVRVEDAAGVWRVENKWITSQGNAVDLTPMDKLFNNAGSKX 
I I I I I I I I I I I I I I I I I I I I I I I 1 I | | 1 | | | | | | | | | | | : | | j 
m274 NHWYVRVEDAAGVWRVENKWITSQGNAVDLT PMDKLFNNTESKX 

130 140 150 160 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 125>: 
a274 . seq 

1 ATGGCGGGGC CGATTTTTGT CGTCATCGCC AGCGTCGCTA TGTTTTTTGT 

51 CGCGCAGCAG CACGCGACAG ATTTGGTTAC GGACGATTAT TATAAAGACG 

101 GCAAGCATAT CGACATCCAG CTTCATCGGG ATGAAGAAGC CGTCAGACGG 

151 CATATCGGGG TGCAGGTTCT CATTTCCCCC GATATGAATG CGGCAAAAGT 

201 GTTTGTCGGC GGCGAGTTTG ACGGCAAACA GCCTTTGAAC CTGCTGCTGA 

251 TGCACCCGAC CCGCAAGGCG GACGATCAAA CCGTCGCCCT CAAGCCCGTC 

301 GGCAGCGCGC AGAACGGCAG GGCGGAATAT GAGGCGGTGT TCAAAACCCT 

351 TTCGCCGACC AACCACTGGT ATGTGCGCGT GGAGGACGCG GCAGGCGTGT 

4 01 GGCGCGTCGA GAACAAATGG ATTACCAGCC AAGGCAATGC GGTCGATTTG 

4 51 ACCCCGATGG ACAAACTTTT CAATAATACT GAAAGCAAAT AA 

This corresponds to the amino acid sequence <SEQ ID 1 126; ORP 274.a>: 

a274 .pep 

1 MAGPIFVVIA SVAMFFVAQQ HA TDLVTDDY YKDGKHIDIQ LHRDEEAVRR 
51 HIGVQVLISP DMNAAKVFVG GEFDGKQPLN LLLMHPTRKA DDQTVALKPV 
101 GSAQNGRAEY EAVFKTLSPT NHWYVRVEDA AGVWRVENKW ITSQGNAVDL 
151 TPMDKLFNNT ESK* 

m274/a274 100.0% identity in 163 aa overlap 

10 20 30 40 50 60 

MAGPI FVVI AS VAMFFVAQQHATDLVTDDY YKDGKH I DIQLHRDEEAVRRH I GVQVLI S P 
I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I 1 I I I I I i I I I f II I I 
MAGPIFVVIASVAMFFVAQQHATDLVTDDYYKDGKHIDIQLHRDEEAVRRHIGVQVLISP 
10 20 30 40 50 60 

70 80 90 100 110 120 

DMNAAKVFVGGEFDGKQPLNLLLMHPTRKADDQTVALKPVGSAQNGRAEYEAVFKTLSPT 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
DMNAAKVFVGGEFDGKQPLNLLLMHPTRKADDQTVALKPVGSAQNGRAEYEAVFKTLSPT 
70 80 90 100 110 120 

130 140 150 160 

NHWYVRVEDAAGVWRVENKWITSQGNAVDLTPMDKLFNNTESKX 

I I I I I 1 1 I I I I M I I I I I I I I I I I I I 1 1 I I I I I I I I I I I 1 1 I 1 1 
NHWYVRVEDAAGVWRVENKWITSQGNAVDLTPMDKLFNNTESKX 
130 140 150 160 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 127>: 

g276 . seq 

1 atgattttgc cgccatccat gacgatgatg cggtcggcgg attcgacggt 

51 ggtcaggcgg tgggcgacga tgatgccggt gcggttttcc atcaggcgtt 

101 cgagcgcttg ttggacgagg cgttcggatt cgttgtccaa tgcgctggtg 

151 gcttcgtcca ataataatat cggcgcgtct ttcaaaatgg cgcgggcgat 

201 ggcgacgcgt tgccgctgtc cgccggataa gttgctgccg ttcgatccga 

251 tgggctggtg cagtccgagc ggggatgcgt cgatcaggct ttgcaggttg 

3 01 gcggcttgga gggcggacag gacttcggct tcgcccgcgt cgggacggct 

351 gtatcggacg ttttcaaaca gggtgtcgtc aaacaggaat acgtcttggg 

401 agacgagggc gaattgggcg cgcaggcagt cgagtttgat gtcggcgatg 

451 tcgataccgt ctatgcagat gttgccggca gacggttcga caaagcgggg 

501 cagaaggttg acgacggtgg atttgccgct gccggaacgt ccgaccaggg 

551 cgacgcgttc gccttgtctg atgtcgaggt tgaagttgtc gagggctttg 

601 atgccgtctg aacggtattc gacatcgacg ttgcggaagc tgatgcgccc 

651 ttcgacacgc tgcggcgcga gcgtgccttt gtcctgttcg ggcggggtgt 

701 cgagaaatgc acatacgccg tcggcggcga ggaacatcgt ctgcataggg 

751 atgctgatgt tggcaaggct tttgatgggg gcgtacattt gcagcatcgc 

801 gacgatgaat gccataaatt cgccgatggt ggtgtag 

This corresponds to the amino acid sequence <SEQ ID 1 128; ORF 276.ng>: 

g276 .pep 



m274 .pep 
a274 

m274 .pep 
a274 

m274 .pep 
a274 



WO 99/57280 



PCT/US99/09346 



628 

1 MILPPSMTMM RSADSTWRR WATMMPVRFS IRRSSACWTR RSDSLSNALV 

51 ASSNNNIGAS FKMARAMATR CRCPPDKLLP FDPMGWCSPS GDASIRLCRL 

101 AAWRADRTSA SPASGRLYRT FSNRVSSNRN TSWETRANWA RRQSSLMSAM 

151 SIPSMQMLPA DGSTKRGRRL TTVDLPLPER PTRATRSPCL MSRLKLSRAL 

201 MPSERYSTST LRKLMRPSTR CGASVPLSCS GGVSRNAHTP SAARNIVCIG 

251 MLMLARLLMG AYI CSIATMN AINSPMW* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 129>: 

m276.seq 

1 ATGATTTTGC CGTCGTCCAT CACGATGATG CGGTCGGCCC CTTCGATGGT 

51 GGTCAGGCGG TGGGCGACGA TGATGCCGGT GCGGTTTTCC ATCAGGCGTT 

101 CGAGCGCCTG TTGGACGAGG CGTTCGGATT CGTTGTCTAA TGCGCTGGTG 

151 GCTTCGTCCA ATAATAATAT CGGCGCGTCT TTCAAAATGG CGCGGGCAAT 

201 GGCGACGCGT TGCCGCTGTC CGCCGGATAA GTTGCTGCCG TTCGATCCGA 

2 51 TGGGCTGGTG CAGTCCGAGC GGGGAGCTGT CAATCAGGCT TTGCAGGTTG 

301 GCGGTTTGGA GGGCGAACAG GACTTCGGCT TCGCCCGCGT CGGGACGGCT 

351 GTATCGGACG TTTTCAAACA GGGTGTCGTC AAACAGGAAT ACGTCTTGGG 

4 01 AGACGAGGGC GAATTGGGCG CGCAGGCAGT CGAGTTTGAT GTCGGCGATG 

4 51 TCGATACCGT CTATGCAGAT GTTGCCGGCA GACGGTTCGA CAAAGCGGGG 

501 CAGCAGGTTG ACGACGGTGG ATTTGCCGCT GCCGGAACGT CCGACCAGGG 

551 CGACGCGTTC GCCTTGTCTG ATGTCGAGGT TGAAGTTGTC GAGGGCTTTG 

601 ATGCCGTCTG AACGGTATTC GACATCGACG TTGCGGAAGC TGATGCGCCC 

651 TTCGACACGC TGCGGTGCGA GCGTGCCCTT GTCCTGTTCG GGCGGGGTGT 

701 CGAGAAATGC ACATACACCG TCGGCGGCGA GGAACATCGT CTGCATAGGG 

751 ATGCTGATGT TGGCAAGGCT TTTGATGGGG GCGTACATTT GCAGCATCGC 

801 GACGATGAAT GCCATAAATT CGCCGATGGT GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 1 130; ORF 276>: 

m2 76 .pep 

1 MILPSSITMM RSAPSMWRR WATMMPVRFS IRRSSACWTR RSDSLSNALV 

51 ASSNNNIGAS FKMARAMATR CRCPPDKLLP FDPMGWCSPS GELSIRLCRL 

101 AVWRANRTSA SPASGRLYRT FSNRVSSNRN TSWETRANWA RRQSSLMSAM 

151 SIPSMQMLPA DGSTKRGSRL TTVDLPLPER PTRATRSPCL MSRLKLSRAL 

201 MPSERYSTST LRKLMRPSTR CGASVPLSCS GGVSRNAHTP SAARNIVCIG 

251 MLMLARLLMG AYI CSIATMN AINSPMW* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 276 shows 96.8% identity over a 278 aa overlap with a predicted ORF (ORF 276.ng) 
from N. gonorrhoeae: 

m276/g276 

10 20 30 40 50 60 

m27 6.pep MI LPS S ITMMRSAPSMVVRRWATMMPVRFS IRRSSACWTRRS DSLSNALVAS SNNNIGAS 

I I I I I : I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
g27 6 MILPPSMTMMRSADSTWRRWATMMPVRFSIRRSSACWTRRS DSLSNALVAS SNNNIGAS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m27 6 . pep FKMARAMATRCRCPPDKLLPFDPMGWCSPSGELSIRLCRLAVWRANRTSASPASGRLYRT 
I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I M : I I I : I I I I I I I I I I I I I I 
g27 6 FKMARAMATRCRCPPDKLLPFDPMGWCSPSGDASIRLCRLAAWRADRTSASPASGRLYRT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m27 6 . pep FSNRVSSNRNTSWETRANWARRQSSLMSAMSIPSMQMLPADGSTKRGSRLTTVDLPLPER 
I I I I I I I M I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 11 i I I I I I I I 
g27 6 FSNRVSSNRNTSWETRANWARRQSSLMSAMSIPSMQMLPADGSTKRGRRLTTVDLPLPER 

130 140 150 160 170 180 

190 200 210 220 230 240 

m27 6 . pep PTRATRSPCLMSRLKLSRALMPSERYSTSTLRKLMRPSTRCGASVPLSCSGGVSRNAHTP 

I I I I I I I I I I I I I I I I I I I I I I I I 1 1 II I 1 1 I I I I I I 1 1 I i I I I I I I I I I I I I I I I I I I I 

g27 6 PTRATRSPCLMSRLKLSRALMPSERYSTSTLRKLMRPSTRCGASVPLSCSGGVSRNAHTP 

190 200 210 220 230 240 
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250 260 270 279 

m27 6 pep SAARNIVCIGMLMLARLLMGAYICSIATMNAINSPMVVX 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

g 27 6 SAARNIVCIGMLMLARLLMGAYICSIATMNAINSPMVVX 

250 260 270 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 13 1>: 
a276.seq 

1 ATGATTTTGC CGTCGTCCAT TACGATGATG CGGTCGGCCC CTTCGATGGT 

51 GGTCAGGCGG TGGGCGACGA TGATGCCGGT GCGGTTTTCC ATCAGGCGTT 

101 CGAGCGCCTG TTGGACGAGG CGTTCGGATT CGTTGTCCAA TGCGCTGGTG 

151 GCTTCGTCCA ATAATAATAT CGGCGCGTCT TTCAAAATGG CGCGGGCAAT 

201 GGCAACGCGT TGCCGCTGTC CGCCGGATAA GTTGCTGCCG TTCGATCCGA 

251 TGGGCTGGTG CAGTCCGAGC GGTGATGCGT CGATCAGGCT TTGCAGGTTA 

301 GCGGCTTGGA GGGCGGATAG GACTTCGGCT TCGCCCGCGT CGGGACGGCT 

351 ATATCGGACG TTTTCAAACA GGGTGTCGTC AAACAGGAAT ACGTCTTGGG 

401 AGACGAGGGC AAATTGGGCG CGCAGGCAGT CGAGTTTGAT GTCGGCGATG 

451 TCGATACCGT CTATGCAGAT GTTGCCGGCA GACGGTTCGA CAAAGCGGGG 

501 CAGCAGGTTG ACGACGGTGG ATTTGCCGCT GCCGGAACGT CCGACCAGGG 

551 CGACGCGTTC GCCTTGTCTG ATGTCGAGGT TGAAGCCGTC GAGGGCTTTG 

601 ATGCCGTCCG AACGGTATTC GACATCGACG TTGCGGAAGC TGATGCGCCC 

651 TTCGACACGC TGCGGTGCGA GCGTGCCTTT GTCCTGTTCG GGCGGGGTGT 

701 CGAGAAATGC ACATACGCCG TCGGCGGCGA GGAACATCGT CTGCATAGGG 

751 ATGCTAATGT TGGCAAGGCT TTTGATGGGG GCGTACATTT GCAGCATCGC 

801 GACGATGAAT GCCATAAATT CGCCGATGGT GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 1 132; ORF 276.a>: 
a276.pep 

1 MILPSSITMM RSAPSMVVRR WATMMPVRFS IRRSSACWTR RSDSLSNALV 

51 ASSNNNIGAS FKMARAMATR CRCPPDKLLP FDPMGWCSPS GDASIRLCRL 

101 AAWRADRTSA SPASGRLYRT FSNRVSSNRN TSWETRANWA RRQSSLMSAM 

151 SIPSMQMLPA DGSTKRGSRL TTVDLPLPER PTRATRSPCL MSRLKPSRAL 

201 MPSERYSTST LRKLMRPSTR CGASVPLSCS GGVSRNAHTP SAARNIVCIG 

251 MLMLARLLMG AYI CSIATMN AINSPMVV* 

m276/a276 98.2% identity in 278 aa overlap 

10 20 30 40 50 60 

m276.pep MILPSSITMMRSAPSMVVRRWATMMPVRFSIRRSSACWTRRSDSLSNALVASSNNNIGAS 
| | | | | | | | | | | | I I I I I I I I 1 1 I I 1 I i 1 1 1 1 1 I I I 1 1 I I I I I I I I I I I I I l i I I I I 1 I I I 

a2 7 6 MILPSSITMMRSAPSMVVRRWATMMPVRFSIRRSSACWTRRSDSLSNALVASSNNNIGAS 
10 20 30 40 50 60 

70 80 90 100 110 120 

m27 6 pep FKMARAMATRCRCPPDKLLPFDPMGWCSPSGELSIRLCRLAVWRANRTSASPASGRLYRT 
| | | | | | | I I I I I I I I ! I I I I I I I I I I I I I I I : I I I I I I 1 I : I I I : I I I I I I ! i I I I I I I 
a 27 6 FKMARAMATRCRCPPDKLLPFDPMGWCSPSGDASIRLCRLAAWRADRTSASPASGRLYRT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m276.pep FSNRVSSNRNTSWETRANWARRQSSLMSAMSIPSMQMLPADGSTKRGSRLTTVDLPLPER 

I I I I I I I I I I I I I I I I I I ( I I I M I I I M I I I i I I I I I I I I I I I I ! I I I I I ! 

a 27 6 FSNRVSSNRNTSWETRANWARRQSSLMSAMSIPSMQMLPADGSTKRGSRLTTVDLPLPER 
130 140 150 160 170 180 

190 200 210 220 230 240 

m276.pep PTRATRSPCLMSRLKLSRALMPSERYSTSTLRKLMRPSTRCGASVPLSCSGGVSRNAHTP 
I J I I I I I I I I I I I I I I I I I I I I I I I I I i M I I I I I I I I I I i M I I I I I I I I I I I M I I I 
a276 PTRATRSPCLMSRLKPSRALMPSERYSTSTLRKLMRPSTRCGASVPLSCSGGVSRNAHTP 
190 200 210 220 230 240 

250 260 270 279 

m27 6.pep SAARNIVCIGMLMLARLLMGAYICSIATMNAINSPMVVX 
I I I I I I I I I I I I i I I I I I I i I I I I I I I I I I 1 I I I I I I t I 
a 27 6 SAARNIVCIGMLMLARLLMGAYICSIATMNAINSPMVVX 
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250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 133>: 

g277.seq (partial) 

1 ..atggtacacg tcgccgtagc ttacggtatt gccgtccggc gtttttgccc 

51 aaacgaggtc atagacgttt tccacgcctt gcaggtacat cgccaagcgt 

101 tcgatgccgt aggtaatttc gccgagtacg ggcgtgcaat cgataccgcc 

151 gacttgttgg aaataggtaa actgggttac ttccatgccg ttgagccaga 

201 cttcccagcc caaaccccac gcaccgaggg tggggttttc ccagtcgtct 

251 tcgacaaagc ggatgtcgtg gactttggga tcgatgccca attcgcgcag 

301 ggagtcgaga tagaggtctt ggatattggc gggggcgggt ttgagggcga 

351 cttggaattg gtaatagtgt tgcaggcggt tggggttgtc gccgtagcgg 

401 ccgtctttgg ggcggcggct gggttggacg taggcggcaa accaaggctc 

451 ggggccgagc gcgcgcaggc aggtggcggg atgggatgtg ccggcaccga 

501 cttccatgtc gaagggttgg atgacggtgc agcctttgtc tgcccagaag 

551 gtttgcagtt tgaagatgat ttgttggaag gtaagcatgg cttattgttc 

601 gataaaataa aggttttatt ttactgtttc catagccgct tgaatagatt 

651 tatctcgaag acagcctga 

This corresponds to the amino acid sequence <SEQ ID 1 134; ORF 277.ng>: 

g277.pep (partial) 

1 . .MVHVAVAYGI AVRRFCPNEV IDVFHALQVH RQAFDAVGNF AEYGRAIDTA 

51 DLLEIGKLGY FHAVEPDFPA QTPRTEGGVF PWFDKADW DFGIDAQFAQ 

101 GVEIEVLDIG GGGFEGDLE L VIVLQAVGW AVAAVF GAAA GLDVGGKPRL 

151 GAERAQAGGG MGCAGTDFHV EGLDDGAAFV CPEGLQFEDD LLEGKHGLLF 

201 DKIKVLFYCF HSRLNRFISK TA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 135>: 



m2 77 . seq 




1 


ATGCCCCGCT 


51 


TTTCTTCGGC 


101 


CGCAGCAGCC 


151 


GACTTCGTTT 


201 


GCGTTTTTGC 


251 


ATCGCCAAGC 


301 


GTCGATGCCG 


351 


CGTTGAGCCA 


401 


TCCCAGTCGT 


451 


CAATTCGCGC 


501 


GCTTGAGGGC 


551 


TCGCCGTAGC 


601 


AAACCAAGGC 


651 


TGCCGGCACC 


701 


TCTGCCCAGA 


751 


GGCTTATGA 



This corresponds to the amino acid sequence <SEQ ID 1 136; ORF 277>: 

m277.pep 

1 MPRFEDKLVG RQGEGGVFFG KQAFGLRFW VELAQQPVGI AVFEWGGLL 
51 DFVLWHVAV GDGVAVERFC PNEWDVFYT LQVHRQAFDA VGDFAEYGRA 
101 VDAADLLEIG KLGYFHAVEP DFPAQTPRAE GGVFPWFDK ADWDFGIDA 
151 QFAQRVEIEV LDIGGSGLEG DLE LVIVLQA VGWAVAAVF GAAAGLDVGG 
201 KPRLGAECAQ AGGGMGCAGT DFHVEGLDDG AAFVCPECLQ FEDDLLEGKH 
251 GL* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 277 shows 90.0% identity over a 221 aa overlap with a predicted ORF (ORF 277.ng) 
from N. gonorrhoeae: 

g277/m277 



g277 .pep 



10 20 30 

MVHVAVAYGIAVRRFCPNEVIDVFHALQVH 



WO 99/57280 



PO7US99/09346 



631 



: I I I I I : I : I I : I I I I I I I : I I I :: | | | j 
m277 GLRFVVVELAQQPVGIAV FEVVGGLLDFVLVVHVAVGDGVAVERFCPNEVVDVFYTLQVH 

30 40 50 60 70 80 

40 50 60 70 80 90 

g27 7 . pep RQAFDAVGNFAEYGRAIDTADLLEIGKLGYFHAVEPDFPAQTPRTEGGVFPVVFDKADVV 
M I M M M I II M I II I I I | I | | | ! | | | | | | |f | M: M | I I I I Ml M | I I 

m277 RQAFDAVGDFAEYGRAVDAADLLEIGKLGYFHAVEPDFPAQTPRAEGGVFPVVFDKADVV 
90 100 110 120 130 140 

100 110 120 130 140 150 

g277 . pep DFGIDAQFAQGVEIEVLDIGGGGFEGDLELVIVLQAVGVVAVAAVFGAAAGLDVGGKPRL 
I I I I M I I I I I I I I I I I I II : I : I I I I I I I I I I | || | | | | | | | | | | | | || | | | | | | | j j 
m277 DFGIDAQFAQRVEIEVLDIGGSGLEGDLELVIVLQAVGVVAVAAVFGAAAGLDVGGKPRL 
150 160 170 180 190 200 

160 170 180 190 200 

g277 . pep GAERAQAGGGMGCAGTDFHVEGLDDGAAFVCPEGLQFEDDLLEGKHGLL 

IN M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I | | | | | | 
m2 7 7 GAECAQAGGGMGCAGTDFHVEGLDDGAAFVCPECLQFEDDLLEGKHGLX 
210 220 230 240 250 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 137>: 



a277 . seq 




1 


ATGCCCCGCT 


51 


TTTCTTCGGC 


101 


CGCAGCAGCC 


151 


GACTTCGTTT 


201 


GCGTTTTTGC 


251 


ATCGCCAAGC 


301 


GTCGATGCCG 


351 


CGTTGAGCCA 


401 


TCCCAGTCGT 


451 


CAATTCGCGC 


501 


GCTTGAGGGC 


551 


TCGCCGTAGC 


601 


AAACCAAGGC 


651 


TGCCGGCACC 


701 


TCTGCCCAGA 


751 


GGCTTATGA 



TTTCCACGCC TTGCAGGTAC 
TCGCCGAGTA CGGGGGTGCA 



CCGACTTGTT GGAAATAGGT GAACTGGGTT 



TTGGATATTG 
GTTGCAGGCG 



ACTTCCATAC 
GGTGGGGTTT 
GGTCGATGCC 
GCGGGAGCGG 
GTTGGGGTTG 
CGTAGGCGGC 
GGATGGGATG 



This corresponds to the amino acid sequence <SEQ ID 1138; ORF 277.a> 
a277 . pep 

MPRFEDKLVG RQGEGGVFFG KQAFGLRFVV 

DFVLVV HVAV SYCITVQRFC PNEVIDVFHA 

VDAADLLEIG ELGYFHTVEP DFPAQTPRAE 

QFAQGVEIEV LDIGGSGLEG DLE LVIVLQA 

KPRLGAECAQ TGGGMGCAGT DFHVEGLDDG AAFVCPECLQ 



1 
51 
101 
151 
201 
251 



GL^ 



VELAQQP I G I AVFEVVGGLF 
LQVHRQAFDA VGDFAEYGGA 
GGVFPVVFDK ADVVHFGVDA 
VGVVAVATVF GAAAGLDVGG 
FEDDLLEGKH 



m277/a277 92.5% identity in 252 aa overlap 

10 20 30 40 50 60 

m277 . pep MPRFEDKLVGRQGEGGVFFGKQAFGLRFVVVELAQQPVGIAVFEVVGGLLDFVLVVHVAV 
I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I II I I I I I I I | : | | | | | | | | | | 
a277 MPRFEDKLVGRQGEGGVFFGKQAFGLRFVVVELAQQPIGIAVFEVVGGLFDFVLVVHVAV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m277 . pep GDGVAVERFCPNEVVDVFYTLQVHRQAFDAVGDFAEYGRAVDAADLLEIGKLGYFHAVEP 
' I : I I I I I I I : I I I :: I I I I I I I i I I I I I I I | | I I I I I i I I I I I I : I || I I : I I I 
a277 SYCITVQRFCPNEVIDVFHALQVHRQAFDAVGDFAEYGGAVDAADLLEIGELGYFHTVEP 
70 80 90 100 110 120 



130 140 150 160 170 180 
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m277 . pep DFPAQTPRAEGGVFPVVFDKADVVDFGIDAQFAQRVEIEVLDIGGSGLEGDLELVIVLQA 
II I I I I I I I I I I I I I I I I I I I I I I 11:111111 I I I I I I I I I I I I I I I I M I I I I I II 
a277 DFPAQTPRAEGGVFPVVFDKADVVHFGVDAQFAQGVEIEVLDIGGSGLEGDLELVIVLQA 
130 140 150 160 170 180 



190 200 210 220 230 240 

m277 .pep VGVVAVAAVFGAAAGLDVGGKPRLGAECAQAGGGMGCAGTDFHVEGLDDGAAFVCPECLQ 
I I I I M I : I I I I I I I I I I I I I I I I I I I I I I : I M I I I I I I I I II I I I I I I I II I I I I I I I 
a 277 VGVVAVATVFGAAAGLDVGGKPRLGAECAQTGGGMGCAGTDFHVEGLDDGAAFVCPECLQ 
190 200 210 220 230 240 



250 

m277.pep FEDDLLEGKHGLX 
! i I I I i I I I I I I I 
a27 7 FEDDLLEGKHGLX 
250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 139>: 

g278.seq (partial) 

1 ttgcgtgcaa tcacgcccgg tgcgattttt tcgacagggg cggtcaaagt 

51 tgtattaatc ggacctttgc cgtcgatagg ccgacccaat gcatcgacga 

101 cgcgtccgac caattcgcgt ccgaccggca cttctaaaat acggccggta 

151 caggtaaccg tgtcgccttc tttaatatgt tcgtactcgc ccaacactac 

201 ggcaccgacg gagtcgcgct ccaggttcat cgccaagcct aaagtgttac 

251 ccgggaattc gagcatctca ccttgcattg catctgacaa accatggatg 

3 01 cgaacgatac cgtcagttac cgaaatcacc gtaccacggg tactcacttc 

351 ggcatttaca gacagatttt cgatcttggc tttaatcaga tcgctaattt 

401 cagcaggatt aagctgcatg aaaactctcc taattcgtca tagtcgtgta 

451 caaagcactc agtttgcctt gtacagacaa atccaaaacc tgatcaccca 

501 cttcaacttt ta. . . 

This corresponds to the amino acid sequence <SEQ ID 1 140; ORF 278.ng>: 

g278.pep (partial) 

1 LRAITPGAIF STGA VKWLI GPLPSIGRPN ASTTRPTNSR PTGTSKIRPV 

51 QVTVSPSLIC SYSPNTTAPT ESRSRFIAKP KVLPGNSSIS PCIASDKPWM 

101 RTIPSVTEIT VPRVLTSAFT DR FSILALIR SLISAGLSC M KTLLIRHSRV 

151 QSTQFALYRQ IQNLITHFNF . . . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 14 1>: 

m278 . seq. . 

1 TTGCGCGCAA TCACGCCCGG TGCGATTTTT TCGATAGGGG CGGTCAAAGT 

51 TGTATTAATC GGGCCTTTGC CGTCGATAGG CCGACCCAAT GCATCAACGA 

101 CGCGTCCGAC CAGTTCGCGT CCGACCGGCA CTTCCAAGAT ACGACCGGTA 

151 CAGGTAACCG TGTCGCCTTC TTTAATGTGT TCGTACTCGC CCAACACTAC 

201 GGCGCCGACG GAGTCGCGCT CCAGGTTCAT CGCCAAGCCG AAAGTGTTAC 

251 CCGGGAATTC GAGCATCTCA CCTTGCATTG CATCTGACAA ACCATGGATG 

301 CGAACGATAC CGTCAGTTAC CGAAATTACC GTACCACAGG TACGCACTTC 

3 51 GGCATTTACA GACAGATTTT CGATCTTGGC TTTAATCAAA TCGCTAATTT 

4 01 CAGCAGGATT AAGCTGCATG AAAACTCTCC TAATTCGTCA TAGTCGTGTA 
451 CAAGGCACTC AATTTGCCTT GTACAGACAA ATCCAAAACC TGATCACCCA 
501 CTTCAACTTT TATGCCGCCA ATCAGCTCCG GTTCGATTTC GACAGAGATT 
551 TTCAGCTCGC TGTCGAAACG CTTATTCAGC ATTTGCACCA ACTCGCCGAC 
601 CTGTTTGTCG GTCAACGGAT AGGCACTGTA AATGACGGCA GATTTGATAT 
6 51 GGTTGAATGA 

This corresponds to the amino acid sequence <SEQ ID 1 142; ORF 278>: 

m278 .pep 

1 LRAITPGAIF SIGA VKWLI GPLPSIGRPN ASTTRPTSSR PTGTSKIRPV 

51 QVTVSPSLMC SYSPNTTAPT ESRSRFIAKP KVLPGNSSIS PCIASDKPWM 

101 RTIPSVTEIT VPQVRTSAFT DR FSILALIK SLISAGLSC M KTLLIRHSRV 

151 QGTQFALYRQ IQNLITHFNF YAANQLRFDF DRDFQLAVET LIQHLHQLAD 

201 LFVGQRIGTV NDGRFDMVE* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 278 shows 95.9% identity over a 170 aa overlap with a predicted ORF (ORF 278.ng) 
from N. gonorrhoeae: 

g278/m278 

10 20 30 40 50 60 

g278.pep LRAITPGAIFSTGAVKVVLIGPLPSIGRPNASTTRPTNSRPTGTSKIRPVQVTVSPSLIC 
I | I I I I ( I I I I I I I I I I I I I I I I I I I i I I I I I I I! I : I I I I I I I I I t I II I I I I I I I : I 
m 27 8 LRAITPGAIFSIGAVKVVLIGPLPSIGRPNASTTRPTSSRPTGTSKIRPVQVTVSPSLMC 

10 20 30 40 50 60 



70 80 90 100 110 120 

g278.pep SYSPNTTAPTESRSRFIAKPKVLPGNSSISPCIASDKPWMRTIPSVTEITVPRVLTSAFT 
I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I 
m 278 SYSPNTTAPTESRSRFIAKPKVLPGNSSISPCIASDKPWMRTIPSVTEITVPQVRTSAFT 

70 80 90 100 110 120 

130 140 150 160 170 

g278.pep DRFSILALIRSLISAGLSCMKTLLIRHSRVQSTQFALYRQIQNLITHFNF 
I | I I I I I I I : I I I I I I I I I I i I ! I I I I I I I I : I I I i I I I II I I I II I I I I 
m 27 8 DRFSILALIKSLISAGLSCMKTLLIRHSRVQGTQFALYRQIQNLITHFNFYAANQLRFDF 

130 140 150 160 170 180 



ro278 



DRDFQLAVETLIQHLHQLADLFVGQRIGTVNDGRFDMVE* 
190 200 210 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 143>: 
a278.seq 

TTGCGCGCAA TCACGCCCGG TGCGATTTTT TCGATAGGGG CGGTCAAAGT 
GGGCCTTTGC 
CAGTTCGCGT 

TGTCGCCTTC TTTAATATGT TCGTGCTCGC 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



TGTATTAATC 
CGCGTCCGAC 
CAGGTAACCG 
GGCGCCGACG 
CCGGGAATTC 
CGAACGATAC 
GGCATTTACA 



CGTCGATAGG CCGACCCAAT 
CCGACCGGCA CTTCCAAGAT 



GAGTCGCGCT CCAGGTTCAT CGCCAAGCCG 



GCATCAACGA 
ACGACCGGTA 
CCAACACTAC 
AAAGTGTTAC 



GAGCATCTCA CCTTGCATTG 
CGTCAGTTAC CGAAATCACC 



CATCTGACAA ACCATGGATG 
GTACCACGGG TACGCACTTC 



GACAGATTTT CGATCTTGGC TTTAATCAAA 



CAGCAGGATT AAGCTGCATG AAAACTCTCC TAATTCGTCA 



CAAGGCACTC 
CTTCAACTTT 
TTCAGCTCGC 
CTGTTTGTCG 
GGTTGAATGA 



AATTTGCCTT GTACAGACAA ATCCAAAACC 
TATGCCGCCA ATCAGCTCCG GTTCGATTTC 
TGTCGAAACG CTTATTCAGC ATTTGCGCCA 
GTCAACGGAT AGGCACTGTA AATGACGGCA 



TCGCTAATTT 
TAGTCGTGTA 
TGATCACCCA 
GACAGAGATT 
ACTCGCCGAC 
GATTTGATAT 



This corresponds to the amino acid sequence <SEQ ID 1 144; ORF 278.a>: 



a278, 



pep 
1 
51 
101 
151 
201 



LRA ITPGAIF SIGAVKVVLI 
QVTVSPSLIC SCSPNTTAPT 
RTIPSVTEIT VPRVRTSAFT 
QGTQFALYRQ IQNLITHFNF 
LFVGQRIGTV NDGRFDMVE* 



GPLPSIGRPN 
ESRSRFIAKP 
DRFSILALIK 



ASTTRPTSSR 
KVLPGNSSIS 
SLISAGLSCM 



YAANQLRFDF DRDFQLAVET 



PTGTSKIRPV 
PCIASDKPWM 
KTLLIRHSRV 
LIQHLRQLAD 



m278/a278 98.2% identity in 219 aa overlap 

10 20 30 40 50 60 

.pep LRAITPGAIFSIGAVKVVLIGPLPSIGRPNASTTRPTSSRPTGTSKIRPVQVTVSPSLMC 



m278. 



a278 



m278 .pep 
a278 



I | | | I I I I I I I II I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I : I 
LRAITPGAIFSIGAVKVVLIGPLPSIGRPNASTTRPTSSRPTGTSKIRPVQVTVSPSLIC 
10 20 30 40 50 60 

70 80 90 100 110 120 

SYSPNTTAPTESRSRFIAKPKVLPGNSSISPCIASDKPWMRTIPSVTEITVPQVRTSAFT 
I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I II I I I I I I I I I I : I I I I I I I 
SCSPNTTAPTESRSRFIAKPKVLPGNSSISPCIASDKPWMRTIPSVTEITVPRVRTSAFT 

70 80 90 100 110 120 
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130 140 150 160 170 180 

m27 8.pep DRFSILALIKSLISAGLSCMKTLLIRHSRVQGTQFALYRQIQNLITHFNFYAANQLRFDF 
I I! I I I I I I I I I I II I I I I I I I I 1 I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I II I I 
a278 DRFSILALIKSLISAGLSCMKTLLIRHSRVQGTQFALYRQIQNLITHFNFYAANQLRFDF 

130 140 150 160 170 180 

190 200 210 220 

m27 8.pep DRDFQLAVETLIQHLHQLADLFVGQRIGTVNDGRFDMVEX 
I I I I I I ] I I I I I I I t : I I I I I I I I II I I I I I I I I I I I I I I 
a278 DRDFQLAVETLIQHLRQLADLFVGQRIGTVNDGRFDMVEX 
190 200 210 220 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 145>: 

g279 . seq 

1 atgacgcgga tttgcggctg cttgatttca acggttttga gtgtttcggc 

51 aagtttgtcg gcggcgggtt tcatcaggct gcaatgggaa ggaacggata 

101 ccggcagcgg cagggcgcgt ttggctccgg cttctttggc ggcagccatg 

151 gtgcgtccga cggcggcggc gttgcctgca atcacgactt gtccgggcga 

201 gttgaagttg acggcttcga ccacttcgcc ctgtgcggat tcggcacaaa 

251 tctgcctgac ctgttcatct tccaaaccca aaatggccgc cattgcgcct 

3 01 acgccttgcg gtacggcgga ctgcatcagt tcggcgcgca ggcggacgag 

3 51 tttgacggca tcggcaaaat ccaatgcttc ggcggcgaca agcgcggtgt 
401 attcgccgag gctgtgtccg gcaacggcgg caggcgtttt gccgcccact 

4 51 tccaaatag 

This corresponds to the amino acid sequence <SEQ ID 1 146; ORF 279. ng>: 

g279.pep 

1 MTRICGCLIS TVLSVSASLS AAGFIRLQWE GTDTGSGRAR LAPASLAAAM 
51 VRPTAAA LPA ITTCPGELKL TASTTSPCAD SAQICLTCSS SKPKMAAIAP 
101 TPCGTADCIS SARRRTSLTA SAKSNASAAT SAVYSPRLCP ATAAGVLPPT 
151 SK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 147>: 

m279 . seq 

1 ATAACG CGGA TTTGCGGCTG CTTGATTTCA ACGGTTTTCA GGGCTTCGGC 

51 AAGTTTGTCG GCGGCGGGTT TCATCAGGCT GCAATGGGAA GGTACGGACA 

101 CGGGCAGCGG CAGGGCGCGT TTGGCACCGG CTTCTTTGGC GGCAGCCATG 

151 GCGCGTCCGA CGGCGGCGGC GTTGCCTGCA ATCACGATTT GTCCGGGTGA 

2 01 GTTGAAGTTG ACGGCTTCGA CCACTTCGCT TTGGGCGGCT TCGGCACAAA 

251 TGGCTTTAAC CTGCTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 

301 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA GGCGCACGAG 

351 TTTGACCGCG TCGGCAAAAT TCAATGCGCC GGCGGCAACG AGTGCGGTGT 

401 ATTCGCCGAG GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 

451 TCTAAATAG 

This corresponds to the amino acid sequence <SEQ ID 11 48; ORF 279>: 

ni279 .pep 

1 ITRICGCLIS TVFRASASLS AAGFIRLQWE GTDTGSGRAR LAPASLAAAM 
51 ARPTAAA LPA ITICPGELKL TASTTSLWAA SAQMALTCSS SKPRIAAIAP 
101 TPCGTADCIS SARRRTSLTA SAKFNAPAAT SAVYSPRLCP ATAAGVLPPA 
151 SK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N.gonorrhoeae 

ORF 279 shows 89.5% identity over a 152 aa overlap with a predicted ORF (ORF 279.ng) 
from N. gonorrhoeae: 

10 20 30 40 50 60 

m2 79.pep ITRICGCLISTVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 

: I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I : I I II I I I I I 
g279 MTRICGCLISTVLSVSASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMVRPTAAALPA 

10 20 30 40 50 60 

70 80 90 100 110 120 
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m279 . pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 
II IIIIIIIIMIII I llh I I I M M h :| I I h I I II I I I II I ! I I I M ! N i 
g2 79 ITTCPGELKLTASTTSPCADSAQICLTCSSSKPKMAAIAPTPCGTADCISSARRRTSLTA 

70 80 90 100 110 120 



130 140 150 

m2 79 . pep SAKFNAPAATSAVYSPRLCPATAAGVLPPASKX 

III II III IIIIIIIIMIII IIIMh III 
g2 79 SAKSNASAATSAVYSPRLCPATAAGVLPPTSKX 

130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 149>: 

a279. seq 

1 ATGACNCNGA TTTGCGGCTG CTTGATTTCA ACGGTTTNNA GGGCTTCGGC 

51 GAGTTTGTCG GCGGCGGGTT TCATGAGGCT GCAATGGGAA GGTACNGACA 

101 CNGGCAGCGG CAGGGCGCGT TTGGCGCCGG CTTCTTTGGC GGCAAGCATA 

151 GCGCGCTCGA CGGCGGCGGC ATTGCCTGCA ATCACGACTT GTCCGGGCGA 

201 GTTGAAGTTG ACGGCTTCAA CCACTTCATC CTGTGCGGAT TCGGCGCAAA 

251 TTTGTTTTAC CTGTTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 

301 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA NGCGCACGAG 

351 TTTGACCGCG TCGGCAAAAT CCAATGCGCC GGCGGCAACN AGTGCGGTGT 

401 ATTCGCCGAN GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 

4 51 TCCGAATAG 

This corresponds to the amino acid sequence <SEQ ID 1 150; ORF 279.a>: 

a279.pep 

1 MTXICGCLIS TVXRASASLS AAGFMRLQWE GTDTGSGRAR LAPASLAASI 
51 ARSTAAALPA ITTCPGELKL TASTTSSCAD SAQICFTCSS SKPRIAAIAP 
101 TPCGTADCIS SARXRTSLTA SAKSNAPAAT SAVYSPXLCP ATAAGVLPPA 
151 SE* 

m279/a279 88.2% identity in 152 aa overlap 

10 20 30 40 . 50 60 

m27 9.pep ITRICGCLISTVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 
: I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I II I I I I I I II I I II : : I I I II II II 
a27 9 MTXICGCLISTVXRASASLSAAGFMRLQWEGTDTGSGRARLAPASLAASIARSTAAALPA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m27 9.pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 
I I I I I I I I I I I I I I I I Ml: : I I I I II I I I I I I I I I I I I I I I I I I I I | II ; I II 
a279 ITTCPGELKLTASTTSSCADSAQICFTCSSSKPRIAAIAPTPCGTADCIS SARXRTSLTA 

70 80 90 100 110 120 



130 140 150 

m27 9 . pep SAKFNAPAATSAVYSPRLCPATAAGVLPPASKX 
III I I I I I I I I 1 I I I I I I I I I I I I I I I II : I 
a27 9 SAKSNAPAATSAVYSPXLCPATAAGVLPPASEX 

130 140 150 



Expression of ORF 279 
The primer described in Table 1 for ORF 279 was used to locate and clone ORF 279. 
ORF 279 was cloned in pET and pGex vectors and expressed in E.coli as above-described. 
The products of protein expression and purification were analyzed by SDS-PAGE. Figure 2A 
shows the results of affinity purification and Figure 2B shows the expression in E.coli. 
Purified GST- fusion protein was used to immunize mice whose sera were used for ELIS A 
(positive result), FACS analysis (Figure 2C), western blot (Figure 2D). These experiments 
confirm that 279 is a surface-exposed protein and that it is a useful immunogen. The 
hydrophilicity plots, antigenic index, and amphipatic regions of ORF 279 are provided in 
Figure 6. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
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1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 279 and the amino 
acid sequence encoded thereby is provided herein. 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 151>: 

g280.seq 

1 atgaaacacc tcaaacttac ccttattgcc gcattgctgg ccaccgccgc 

51 aactgccgca ccccttccgg ttgtaaccag tttcagcatt ttaggcgacg 

101 tagccaaaca aatcggcggt gagcgcgtag ccgtacaaag cctcgtcgga 

151 gccaaccaag atactcatgc ctatcacatg accagtggcg acattaaaaa 

201 aatccgcagt gcaaaactcg tcctgctcaa cggcttggga cttgaagccg 

251 ccgacatcca acgcgccgtc aaacagagca aagtatccta tgccgaagcg 

301 accaaaggca tccaacccct caaagccgaa gaagaaggcg gacaccatca 

351 cgaccaccat cacgaccacg atcatgacca cgaaggacac caccacgacc 

401 acggcgaata tgacccccac gtctggaacg accctgttct tatgtccgac 

451 tatgcccaaa acgtcgctga aaccctgata aaggccgatc ccgaaggcaa 

501 agtttattat caacaacgct tgggcaacta ccaaatgcag cttaaaaaac 

551 tgcacagcga cgcacaagcc gcatttaatg ccgtccctgc cgccaaacgc 

601 aaagtcctga ccgggcacga cgcattttcc tacatgggca accgctacaa 

651 catcagcttc atcgccccgc aaggcgtgag cagcgaagcc gagccgtccg 

701 ccaaacaagt cgccgccatc atccggcaaa tcaaacgcga aggcatcaaa 

751 gccgtattta ccgaaaatat caaagacacc cgcatggttg accgcatcgc 

801 caaagaaacc ggcgtcaacg tcagcggcaa actgtattcc gacgcactcg 

851 gcaacgcgcc cgcagacacc tacatcggca tgtaccgcca caacgtcgaa 

901 gccttgacca acgcgatgaa gcaataa 

This corresponds to the amino acid sequence <SEQ ID 1 152; ORF 280.ng>: 

g280.pep 

1 MKHLKLTLIA ALLATAATA A PLPWTSFSI LGDVAKQIGG ERVAVQSLVG 

51 ANQDTHAYHM TSGDIKKIRS AKLVLLNGLG LEAADIQRAV KQSKVSYAEA 

101 TKGIQPLKAE EEGGHHHDHH HDHDHDHEGH HHDHGEYDPH VWNDPVLMSD 

151 YAQNVAETLI KADPEGKVYY QQRLGNYQMQ LKKLHSDAQA AFNAVPAAKR 

201 KVLTGHDAFS YMGNRYNISF IAPQGVSSEA EPSAKQVAAI IRQIKREGIK 

251 AVFTENIKDT RMVDRIAKET GVNVSGKLYS DALGNAPADT YIGMYRHNVE 

3 01 ALTNAMKQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 153>: 

m2 80.seq 

1 ATGAAACACC TCAAACTCAC CCTTATTGCC GCATTGCTGA CCGCCTCCGC 

51 AACTGCCGCC CCCCTGCCGG TTGTAACCAG CTTCAGCATT TTAGGCGACG 

101 TAGCCAAACA AATCGGCGGA GAGCGCGTAT CCATACAAAG TTTGGTCGGA 

151 GCCAACCAAG ATACGCACGC CTATCATATG ACCAGTGGCG ACATTAAAAA 

2 01 AATCCGCAGT GCAAAACTCG TCCTGCTCAA CGGCTTAGGA CTTGAAGCTG 
251 CCGATGTGCA ACGCGCCGTC AAACAAAGCA AAGTATCCTA TACCGAAGCG 

3 01 ACCAAAGGCA TCCAACCCCT CAAAGCCGAA GAAGAAGGCG GACACCATCA 
' 351 CGACCACGAT CATGACCACG AAGGACACCA CCATGACCAC GGCGAATATG 

4 01 ACCCGCACGT CTGGAACGAC CCCGTCCTTA TGTCCGCCTA TGCCCAAAAC 
451 GTTGCCAAAG CCCTGATAAA GGCCGATCCC GAAGGCAAAG TTTATTATCA 
501 ACAACGCTTG GGCAACTACC AAATGCAGCT CAAAAAACTG CACAGCGACG 
551 CACAAGCCGC ATTTAATGCC GTCCCTGCTG CCAAACGCAA AGTCCTGACC 
601 GGGCACGATG CCTTTTCCTA TATGGGCAAA CGTTACCATA TCGAATTCAT 
651 CGCCCCGCAA GGCGTGAGCA GCGAAGCCGA GCCTTCGGCC AAACAAGTCG 
701 CCGCCATCAT CCGACAAATC AAACGCGAAG GCATCAAAGC CGTCTTTACC 
751 GAAAACATCA AGGACACCCG TATGGTTGAC CGTATCGCCA AAGAAACCGG 
801 TGTCAACGTC AGCGGCAAAC TGTATTCCGA CGCACTCGGC AACGCGCCCG 
851 CAGACACCTA CATCGGAATG TACCGCCACA ACATCAAAGC CTTGACCAAC 
901 GCGATGAAGC AATAA 

This corresponds to the amino acid sequence <SEQ ID 1 154; ORF 280>: 

m2 80 .pep 

1 MKHLKLTLIA ALLTASATA A PLPWTSFSI LGDVAKQIGG ERVSIQSLVG 

51 ANQDTHAYHM TSGDIKKIRS AKLVLLNGLG LEAADVQRAV KQSKVSYTEA 

101 TKGIQPLKAE EEGGHHHDHD HDHEGHHHDH GEYDPHVWND PVLMSAYAQN 

151 VAKALIKADP EGKVYYQQRL GNYQMQLKKL HSDAQAAFNA VPAAKRKVLT 
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201 GHDAFSYMGK RYHIEFIAPQ GVSSEAEPSA KQVAAIIRQI KREGIKAVFT 
251 ENIKDTRMVD RIAKETGVNV SGKLYSDALG NAPADTYIGM YRHNI KALTN 
301 AMKQ* 

r Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 280 shows 93.8% identity over a 308 aa overlap with a predicted ORF (ORF 280.ng) 
from N. gonorrhoeae: 

m280/g280 

10 20 30 40 50 60 

MKHLKLTLIAALLTASATAAPLPWTSFSILGDVAKQIGGERVSIQSLVGANQDTHAYHM 

! I I I I i I I I I I I h •• M I M M I I I M I I I II M I II I M I ll-l I I I I M ! | | | M 1 1 
MKHLKLTLIAALLATAATAAPLPWTSFSILGDVAKQIGGERVAVQSLVGANQDTHAYHM 
10 20 30 40 50 60 

70 80 90 100 110 119 

TSGDIKKIRSAKLVLLNGLGLEAADVQRAVKQSKVSYTEATKGIQPLKAEEEGGHHHDH- 

MIMI Mill MM MM IIIMhIIIMIIIIIhl IMMMII III 1 1 II Mil 

TSGDIKKIRSAKLVLLNGLGLEAADIQRAVKQSKVSYAEATKGIQPLKAEEEGGHHHDHH 
70 80 90 100 110 120 

120 130 140 150 160 170 
DHDHEGHHHDHGEYDPHVWNDPVLMSAYAQNVAKALIKADPEGKVYYQQRLGNYQMQ 

M I III 1 1 II 1 1 1 M I M I III II 1 1 II III MM I M I M M I M 1 1 MM M M 

HDHDHDHEGHHHDHGEYDPHVWNDPVLMSDYAQNVAETLIKADPEGKVYYQQRLGNYQMQ 
130 140 150 160 170 180 

180 190 200 210 220 230 

LKKLHSDAQAAFNAVPAAKRKVLTGHDAFSYMGKRYHIEFIAPQGVSSEAEPSAKQVAAI 

II MM MIMI II 1 1 1 MMMMMIMIMM Ml M M M M I M M M M 1 1 1 1 

LKKLHSDAQAAFNAVPAAKRKVLTGHDAFSYMGNRYNISFIAPQGVSSEAEPSAKQVAAI 
190 200 210 220 230 240 

240 250 260 270 280 290 

IRQIKREGIKAVFTENIKDTRMVDRIAKETGVNVSGKLYSDALGNAPADTYIGMYRHNIK 

IMMMII II 1 1 1 MM Mill MMIIIMM Mill II II IMMM 1 1 MM I- 

IRQIKREGIKAVFTENIKDTRMVDRIAKETGVNVSGKLYSDALGNAPADTYIGMYRHNVE 
250 260 270 280 290 300 

3 00 
ALTNAMKQX 

MIMMM 

ALTNAMKQX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 155>: 

a280.seq 

1 ATGAAACACC CCAAACTCAC CCTTATCGCC GCATTGCTGA CCACTGCCGC 

51 AACTGCCGCC CCCCTGCCGG TTGTAACCAG CTTCAGCATT TTAGGCGACG 

101 TAGCCAAACA AATCGGCGGA GAGCGCGTAT CCATACAAAG TTTGGTCGGA 

151 GCCAACCAAG ATACGCACGC CTATCATATG ACCAGCGGCG AC AT T AAAAA 

201 AATCCGCAGT GCAAAACTCG TCCTGATTAA CGGCTTAGGA CTTGAAGCTG 

251 CCGACATCCA ACGTGCCGTC AAACAGAGCA AAGTATCCTA TGCCGAAGCG 

301 ACCAAAGGCA TCCAACCCCT CAAAGCCGAA GAAGAAGGCG GACACCATCA 

351 CGACCACGAT CATGACCACG ACCATGACCA CGAAGGACAC CACCACGACC 

4 01 ACGGCGAATA TGACCCCCAC GTCTGGAACG ACCCCGTCCT TATGTCCGCC 

4 51 TATGCCCAAA ACGTCGCCGA AGCCCTGATA AAGGCCGACC CCGAAGGCAA 

501 AGTTTATTAT CAACAACGCT TGGGCAACTA CCAAATGCAG CTCAAAAAAC 

551 TGCACAGTGA CGCACAAGCC GCATTTAATG CCGTCCCTGC CGCCAAACGC 

601 AAAGTCCTGA CCGGGCACGA TGCCTTTTCC TATATGGGCA AACGTTACCA 

651 TATCGAATTC ATCGCCCCAC AAGGTGTGAG CAGCGAAGCC GAGCCTTCAG 

701 CCAAACAAGT CGCCGCCATC ATCCGACAAA TCAAACGCGA AGGCATCAAA 

751 GCCGTATTTA CCGAAAATAT CAAAGACACC CGCATGGTTG ACCGCATCGC 



m280 .pep 
g280 

m2 80 .pep 
g280 

m280.pep 
g280 

m2 80 .pep 
g280 

m2 80 . pep 
g280 

m2 80 .pep 
g280 
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801 CAAAGAAACC GGTGTCAACG TCAGCGGCAA ACTGTATTCC GACGCACTCG 
851 GCAACGCACC CGCAGACACC TACATCGGCA TGTACCGCCA CAACATCAAA 
901 GCCTTAACCA ACGCGATGAA GCAATAA 

This corresponds to the amino acid sequence <SEQ ID 1 156; ORF 280.a>: 

a280.pep 

1 MKHPKLTLIA ALLTTAATA A PLPVVTSFSI LGDVAKQIGG ERVSIQSLVG 

51 ANQDTHAYHM TSGDIKKIRS AKLVLINGLG LEAADIQRAV KQSKVSYAEA 

101 TKGIQPLKAE EEGGHHHDHD HDHDHDHEGH HHDHGEYDPH VWNDPVLMSA 

151 YAQNVAEALI KADPEGKVYY QQRLGNYQMQ LKKLHSDAQA AFNAVPAAKR 

201 KVLTGHDAFS YMGKRYHIEF IAPQGVSSEA EPSAKQVAAI IRQIKREGIK 

251 AVFTENIKDT RMVDRIAKET GVNVSGKLYS DALGNAPADT YIGMYRHNIK 

301 ALTNAMKQ* 

m280/a280 96.4% identity in 308 aa overlap 

10 20 30 40 50 60 

m280 . pep MKHLKLTLIAALLTASATAAPLPVVTSFSILGDVAKQIGGERVSIQSLVGANQDTHAYHM 
IN i I I I i ! I M I :: I I I I I I ! I I I I I I I I I | | ! t I I I I I I I I I | | ! | I | | | | | ) I t I I 
a280 MKHPKLTLIAALLTTAATAAPLPVVTSFSILGDVAKQIGGERVSIQSLVGANQDTHAYHM 

10 20 30 40 50 60 

70 80 90 100 110 120 

m280 . pep TSGDIKKIRSAKLVLLNGLGLEAADVQRAVKQSKVSYTEATKGIQPLKAEEEGGHHHDHD 
I ! I t I I I ! I II I I I I : I i I I ! I I I I: I i | | | | | | | | | : | | | | | | | | | | | | | j | | i | | | | | 
a280 TSGDIKKIRSAKLVLINGLGLEAADIQRAVKQSKVSYAEATKGIQPLKAEEEGGHHHDHD 

70 80 90 100 110 120 

130 140 150 160 170 

m280 . pep HDH EGHHHDHGEYDPHVWNDPVLMSAYAQNVAKALIKADPEGKVYYQQRLGNYQMQ 

I M I I I I I M I M M I I I I I I I I I I I I I I I I I : t I I I I I I I I I I I I | | | | | | I | 1 | 

a280 HDHDHDHEGHHHDHGEYDPHVWNDPVLMSAYAQNVAEALIKADPEGKVYYQQRLGNYQMQ 

130 140 150 160 170 180 

180 190 200 210 220 230 

m280 . pep LKKLHSDAQAAFNAVPAAKRKVLTGHDAFSYMGKRYHIEFIAPQGVSSEAEPSAKQVAAI 
I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | j | | | | | | 
a280 LKKLHSDAQAAFNAVPAAKRKVLTGHDAFSYMGKRYHIEFIAPQGVSSEAEPSAKQVAAI 

190 200 210 220 230 240 

240 250 260 270 280 290 

m280 . pep IRQIKREGIKAVFTENIKDTRMVDRIAKETGVNVSGKLYSDALGNAPADTYIGMYRHNIK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I | | | | | | | | | | | | 
a280 IRQIKREGIKAVFTENIKDTRMVDRIAKETGVNVSGKLYSDALGNAPADTYIGMYRHNIK 

250 260 270 280 290 300 

300 

m2 8 0 . pep ALTNAMKQX 
I I I I I I I I I 
a280 ALTNAMKQX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 157>: 

g281.seq 

1 atgcactacg ccctcgcatc cgtcttctgc ctgtccctca gcgccgcacc 

51 cgtcggcgta ttcctcgtca tgcgccgtat gagcctgata ggcgacgcat 

101 tgagccacgc cgtcctgccc ggtgccgccg tcggctacat gtttgccggc 

151 ttgagcctgc ccgctatggg tgtgggcggg tttgccgccg gtatgctgat 

201 ggcgctgctt gccggactcg tcagccgctt taccaccctg aaagaagatg 

251 ccaactttgc cgccttttac ctgagcagcc tcgccatcgg cgtaatcctc 

301 atcagcaaaa acggcagcag cgtcgattta ctccacctcc ttttcggatc 

351 tgtgcttgcc gtcgatattc ccgcactgca actcatcgcc gccgtctccg 

401 gcctcacgct cattaccctt gccgtcatct accgccccct ggtgctagaa 

451 agcatagacc cccttttcct caagtccgtc aacggcaaag gcgggctttg 
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501 gcacgtcatt ttcctcatcc tcgtcgttat gaacctcgta tccggcttcc 

551 aagctctcgg catcctgatg tcggtcggaa ttatgatgct gcccgccatt 

601 accgcccgtt tatgggcaag aaatatgggg acgctcattc tgttgtccgt 

651 cctcatcgcc cttttttgcg gtttgatcgg gctgctcatt tcctaccaca 

701 tcgaaatccc ttccggcccc gccatcatcc tctgttgcag cgtcctttat 

751 cttttttccg tcatactcgg caaagaaggc ggcatcttgc ccaaatggtt 

801 caaaaaccac cgccaccaca ccacctga 

This corresponds to the amino acid sequence <SEQ ID 1 158; ORF 281.ng>: 

g281 -pep 

1 MHYALASVFC LSLSAA PVGV FLVMRRMSLI GDALSHAVLP GAAVGYMFAG 

51 LSLPAMG VGG FAAGMLMALL AGLV SRFTTL KEDAN FAAFY L5SLAIGVIL 

101 ISKNGSSVD L LHLLFGSVLA VDIPAL Q LIA AVSGLTLITL AVIY RPLVLE 

151 SIDPLFLKSV NGKGGLW HVI FLILWMNLV SGFQALGILM SVGIMMLPAI 

201 TARLWARNMG TLI LLSVLIA LFCGLIGLLI SYHIEIPSGP AIILCCSVLY 

251 LFSVILGKEG GILPKWFKNH RHHTT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 159>: 



m281 . seq 


(partial) 




1 


ATGCGCTACG 


CCCTCGCATC 


51 


CGTCGGCGTA 


TTCCTCGTCA 


101 


TGAGCCACGC 


CGTCCTGCCC 


151 


TTGAGCCTGC 


CCGCCATGGG 


201 


GGCACTGCTT 


GCCGGACTCG 


251 


CCAACTTTGC 


CGCCTTTTAT 


301 


GTCAGCAAAA 


ACGGGAGCAG 


351 


TGTACTTGCC 


GTCGATATTC 


401 


GCCTCACGCT 


CATTACCCTT 


451 


AGCATCGACC 


CCCTGTTTCT 


501 


GCACGTCCTC 


TTTCTCGTCC 


551 


AAGCCCTCGG 


CACACTCATG 


601 


ACCGCCCGCC 


TGTGGGCGAA 


651 


TCTGACAGCC 


CTGCTGTGCG 


701 


TCGAAATTCC 


TTCCGGTCCC 


751 


CTCTTTTCCG 


TCATACTCGG 


This corresponds to the amino acid 



m281.pep (partial) 

1 MRYALASVFC LSLSA APVGV FLVMRRMSLI GDALSHAVLP GAAVGYMFAG 

51 LSLPAMG LGG VAAGMLMALL AGLV SRFTTL KEDAN FAAFY LSSLAIGWL 

101 VSKNGSSVD L LHLLFGSVLA VDIPAL Q LIA AVSSLTLITL AVIY RPLVLE 

151 SIDPLFLKSV GGKGGLW HVL FLVLWMNLV SGFQALGTLM SVGLMMLPAI 

201 TARLWAKHMG ALI LLSVLTA LLCGLSGLLI SYHIEIPSGP AIILCCSVLY 

251 LFSVILGKEG GILT. . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from gonorrhoeae 

ORF 281 shows 93.5% identity over a 263 aa overlap with a predicted ORF (ORF 281.ng) 
from N. gonorrhoeae: 

m281/g281 

10 20 30 40 50 60 

m281 . pep MRYALASVFCLSLSAAPVGVFLVMRRMSLIGDALSHAVLPGAAVGYMFAGLSLPAMGLGG 

I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I = I I 
g2 8 1 MHYALASVFCLSLSAAPVGVFLVMRRMSLIGDALSHAVLPGAAVGYMFAGLSLPAMGVGG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m281.pep VAAGMIJ^LAGLVSRFTTLKEDANFAAFYLSSLAIGVVLVSKNGSSVDLLHLLFGSVLA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I : I : I I I I I I I I I I I I I I I I I I I 
g281 F AAGMLMALLAGLVSRFTTLKEDANFAAF YLS SLAIGVILIS KNGS S VDLLHLLFGS VLA 

70 80 90 100 110 120 



m2 81 .pep 



130 140 150 160 170 180 

VDIPALQLIAAVSSLTLITLAVIYRPLVLESIDPLFLKSVGGKGGLWHVLFLVLWMNLV 
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g 2 8 1 VDI PALQLI AAVSGLTL I TLAVI YRPLVLES I DPLFLKS VNGKGGLWHVI FLI LWMNLV 

130 140 150 160 170 180 

190 200 210 220 230 240 

m2 81.pep SGFQALGTLMSVGLMMLPAITARLWAKHMGALILLSVLTALLCGLSGLLISYHIEIPSGP 

I II Ml I IMIhMIIMIMIIhHhIIIIIII Ihlll II II Ml Illllll 

g2 81 SGFQALGILMSVGIMMLPAITARLWARNMGTLILLSVLIALFCGLIGLLISYHIEIPSGP 

190 200 210 220 230 240 

250 260 
AI I LCCSVLYLFS VI LGKEGGI LT 

f lllllilllMIIIIIIIMIII 

AI I LCCSVLYLFS VI LGKEGG I LPKWFKNHRHHTTX 
250 260 270 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 161 >: 

a281. seq 

1 ATGCGCTACG CCCTCGCATC CGTCTTCTGC CTGTCCCTCA GTGCCGCACC 

51 CGTCGGCGTA TTCCTCGTCA TGCGCCGTAT GAGCCTGATA GGCGACGCAT 

101 TGAGCCACGC CGTCCTGCCC GGTGCCGCCG TCGGCTACAT GTTTGCCGGC 

151 TTAAGCCTGC CCGCCATGGG TTTGGGCGGC GTAGCCGCAG GTATGCTGAT 

201 GGCACTGCTT GCCGGACTCG TCAGCCGCTT CACCACCCTG AAAGAAGATG 

251 CCAACTTTGC CGCCTTTTAT CTCAGCAGCC TCGCCATCGG TGTAGTCCTC 

301 GTCAGCAAAA ACGGCAGCAG CGTCGATTTG CTCCACCTCC TTTTCGGCTC 

351 CGTACTTGCC GTCGATATTC CTGCCCTGCA ACTCATCGCC GCCGTATCCA 

4 01 CCCTCACACT GCTTACCCTT GCCGTCATCT ACCGCCCGCT CGTACTCGAA 

4 51 AGCATCGACC CCCTGTTTCT CAAATCTGTC GGCGGCAAAG GCGGGCTTTG 

501 GCACGTCCTC TTTCTCGTCC TGGTCGTCAT GAACCTCGTA TCCGGCTTTC 

551 AAGCCCTCGG CACACTCATG TCCGTCGGAC TTATGATGCT GCCAGCCATT 

601 ACCGCCCGCC TATGGGCGAA GCACATGGGC GCACTCATCC TCCTATCCGT 

651 TCTGACAGCC CTGCTGTGCG GCTTGAGCGG ACTGCTCATT TCCTACCACA 

701 TCGAAATTCC TTCCGGTCCC GCCATCATCC TCTGTTGCAG CGTCCTTTAT 

751 CTCTTTTCCG TCATACTCGG CAAAGAAGGC GGCATTCTGA CCAAATGGCT 

801 CAAAAACCAC CGCCACCACA CCACCTGA 

This corresponds to the amino acid sequence <SEQ ID 1 162; ORF 281.a>: 

a281 .pep 

1 MRYALASVFC LSLSA APVGV FLVMRRMSLI GDALSHAVLP GAAVGYMFAG 

51 LSLPAMG LGG VAAGMLMALL AGLV SRFTTL KEDAN FAAFY LSSLAIGWL 

101 VSKNGSSVD L LHLLFGSVLA VDIPAL Q LIA AVSTLTLLTL AVIY RPLVLE 

151 SIDPLFLKSV GGKGGLWH VL FLVLVVMNLV SGFQALGTLM SVGLMMLPAI 

201 TARLWAKH MG ALILLSVLTA LLCGL SGLLI SYHIEIPSGP AIILCCSVLY 

251 LFSVILGKEG GILTKWLKNH RHHTT* 

m281/a281 99.2% identity in 264 aa overlap 

10 20 30 40 50 60 

m281 . pep MRYALASVFCLSLSAAPVGVFLVMRRMSLIGDALSHAVLPGAAVGYMFAGLSLPAMGLGG 
I I I I I I I I I I I I I I I I I I i 1 II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I 1 I I I 
a281 MRYALASVFCLSLSAAPVGVFLVMRRMSLIGDALSHAVLPGAAVGYMFAGLSLPAMGLGG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m281 . pep VAAGMLMALLAGLVSRFTTLKEDANFAAFYLSSLAIGVVLVSKNGSSVDLLHLLFGSVLA 
I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I II I I I I I I I I 
a281 VAAGMLMALLAGLVSRFTTLKEDANFAAFYLSSLAIGVVLVSKNGSSVDLLHLLFGSVLA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m281.pep VDI PALQLIAAVSSLTLITLAVI YRPLVLES I DPLFLKS VGGKGGLWHVLFLVLVVMNLV 

I I I I I I I I I I I I I : I I I : I I I I I I I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I I [ I I 
a281 VDI PALQLIAAVSTLTLLTLAVIYRPLVLES I DPLFLKS VGGKGGLWHVLFLVLVVMNLV 

130 140 150 160 170 180 



ra2 81 .pep 
g281 
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190. 200 210 220 230 240 

m281 . pep SGFQALGTLMSVGLMMLPAITARLWAKHMGALILLSVLTALLCGLSGLLISYHIEIPSGP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | 
a 281 SGFQALGTLMSVGLMMLPAITARLWAKHMGALILLSVLTALLCGLSGLLISYHIEIPSGP 

190 200 210 220 230 240 

250 260 
m281.pep AIILCCSVLYLFSVILGKEGGILT 
I I I ! I I I I I I I I I I I I I I I I I I I I 
a281 AIILCCSVLYLFSVILGKEGGILTKWLKNHRHHTTX 

250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 163>: 

g282.seq 

1 atgggattgg gtatggaaat cggcaagctg attgtggctc ttttggtgct 

51 gatcaatccg tttagcgcgt tgtcgcttta ccttgacctg accaacggac 

101 acagcacgaa ggagcgcagg aaggtcgcgc ggacggccgc cgtcgccgtg 

151 tttgccgtga ttgcggtatt tgcgctgatc ggcggtgcgc tattgaaggt 

201 tttgggcatc agcgtcggtt cgtttcaggt cggcggcggg attttggtgc 

251 tgctgatcgc catttcgatg atgaacggca acgacaatcc cgccaagcag 

3 01 aatctcggcg cgcagccgga aacggggcaa gcgcgccccg cccgcaatgc 

351 aggggcgatt gccgtcgtgc ccatcgccat accgatcacc atcggtccgg 

401 gcggtatttc gactgtgatt atttatgctt cggcagccaa aacgtacagc 

451 gatatcgcgc tgattatcgc ggccggtttg gtggtcagtg cgatttgtta 

501 tgccatttta atcgttgccg ggaaggtcag ccgcctgctg ggcgcgacgg 

551 ggctgacgat tttaaaccgc attatgggta tgatgctggc ggcggtatcg 

601 gtggagatta ttgtgtcggg actgaaaacg atattcccgc aactggcagg 

651 ttga 

This corresponds to the amino acid sequence <SEQ ID 1 164; ORF 282.ng>: 

g282 .pep 

1 MGLGMEIGKL IVALLVLINP FSALS LYLDL TNGHST KERR KVARTAAVAV 

51 FAVIAVFALI GGAL LKVLGI SVGS FQVGGG ILVLLIAISM M NGNDNPAKQ 

101 NLGAQPETGQ ARPARNAG AI AWPIAIPIT IGPGGI STVI IYASAAKTYS 

151 DIAIi IIAAGL WSAICYAIL IV AGKVSRLL GATGLTILNR IMGMMLAAVS 

2 01 VEIIV5G LKT IFPQLAG* 

The following partial DNA sequence was identified in ^meningitidis <SEQ ID 1 165>: 



m282 . seq 












1 


ATGGGATTGG 


GCATGGAAAT 


CGGCAAGCTG 


ATTGTGGCTT 


TTTTGGTGCT 


51 


GATTAATCCG 


TTTAGCGCGT 


TGTCGCTTTA 


CCTTGACCTG 


ACCAACGGGC 


101 


ACAGCACGAA 


GGAGCGCAGG 


AAGGTCGCGC 


GGACGGCCGC 


CGTTGCCGTG 


151 


TTTGCCGTGA 


TTGCGGTATT 


TGCGCTGATC 


GGCGGTACGC 


TGCTGAAGGT 


201 


TTTGGGCATC 


AGCGTCGGTT 


CGTTTCAGGT 


CGGCGGCGGG 


ATTTTGGTGC 


251 


TGCTGATCGC 


CATTTCGATG 


ATGAACGGCA 


ACGACAATCC 


CGCCAAGCAG 


301 


AATCTCGGCG 


CGCAGCCGGA 


AACGGGGCAG 


GCGCGCCCCG 


CCCGCAATGC 


351 


CGGAGCGATT 


GCCGTCGTGC 


CCATCGCCAT 


ACCGATCACC 


ATCGGCCCGG 


401 


GCGGTATTTC 


GACCGTGATT 


ATTTACGCTT 


CGGCGGCTAA 


AACATACGGC 


451 


GACATCGCGT 


TGATTATCGC 


GGCCGGTTTG 


GTGGTCAGTG 


CGATTTGTTA 


501 


TGCCATTTTA 


ATCGTTGCCG 


GGAAGGTCAG 


CCGCCTGCTG 


GGCGCGACGG 


551 


GGCTGACGAT 


TTTAAACCGC 


ATTATGGGTA 


TGATGCTGGC 


GGCGGTATCG 


601 


GTGGAGATTA 


TTGTGTCGGG 


ACTGAAAACG 


ATATTCCCGC 


AACTGGCAGG 


651 


TTGA 











This corresponds to the amino acid sequence <SEQ ID 1 166; ORF 282.ng>: 

m2 82 -pep 

1 MGLGMEIGKL IVAFLVLINP FSALS LYLDL TNGHST KERR KVARTAAVAV 

51 FAVIAVFALI GGTL LKVLGI SVGS FQVGGG ILVLLIAISM M NGNDNPAKQ 

101 NLGAQPETGQ ARPARNAG AI AWPIAIPIT IGPGGI STVI IYASAAKTYG 

151 DIA LIIAAGL WSAICYAIL I VAGKVSRLL GATGLTILNR IMGMMLAAVS 

201 VEIIVSG LKT IFPQLAG* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 282 shows 98.6 % identity over a 217 aa overlap with a predicted ORF (ORF 282.ng) 
from N. gonorrhoeae: 

m282/g282 

10 20 30 40 50 60 

MGLGMEIGKLIVAFLVLINPFSALSLYLDIjTNGHSTKERRKVARTAAVAVFAVIAVFALI 

I II Mill! I M hi I II I IIIMIIIIMMI llllll 1 1 Mil 1 1 1 II I ! 1 1 1 III II 

MGLGMEIGKLIVALLVLINPFSALSLYLDLTNGHSTKERRKVARTAAVAVFAVIAVFALI 
10 20 30 40 50 60 

70 80 90 100 110 120 

GGTLLKVLG I S VGS FQVGGGI LVLLI AI SMMNGNDNPAKQNLGAQPETGQARPARNAGAI 

1 1 : ; I M 1 1 h II 1 1 1 II I ! I II I II ! II II I M 1 1 1 1 1 1 1 1 1 1 i 1 1 1 : 1 1 1 1 M . M ; I 

GGALLKVLG I S VGS FQVGGGI LVLLI AI SMMNGNDNPAKQNLGAQPETGQARPARNAGAI 
70 80 90 100 110 120 

130 140 150 160 170 180 

AWPIAIPITIGPGGISTVIIYASAAKTYGDIALIIAAGLWSAICYAILIVAGKVSRLL 

! I MM II li i 1 1 II MIMI IIIIIMhll MM I MM 1 1 III II II II I II Ml !l 

AWPIAIPITIGPGGISTVII YASAAKTYSDIALIIAAGLWSAICYAILIVAGKVSRLL 
130 140 150 160 170 180 

190 200 210 

GATGLT I LNRIMGMMLAAVSVE 1 1 VSGLKTI FPQLAGX 

1 1 M II 1 1 II 1 1 M 1 1 1 M II II II I Ml MM I MM I 

GATGLT I LNRIMGMMLAAVSVE 1 1 VSGLKTI FPQLAGX 
190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 167>: 

a282.seq 

1 ATGGGATTGG 

51 GATTAATCCG 

101 ACAGCACGAA 

151 TTTGCCGTGA 

201 TTTGGGCATC 

251 TGCTGATTGC 

301 AATCTCGGCG 

351 CGGAGCGATT 

4 01 GCGGTATTTC 

4 51 GACATCGCGT 

501 TGCCATTTTA 

551 GGCTGACGAT 

601 GTGGAGATTA 

651 TTGA 

This corresponds to the amino acid sequence <SEQ ID 1 168; ORF 282. a>: 
a282 .pep 

1 MGLGMEIGKL IVAFLVLINP FSALS LYLDL TNGHSTKERR KVARTAAVAV 

51 FAVIAVFALI GG TLLKVLGI S VGS FQVGGG ILVLLIAISM M NGNDNPAKQ 

101 NLGAQPETGQ VRPARNAGA I AVVPIAIPIT IGPGGI STVI IYASAAKTYG 

151 DIAL I IAAGL VVSAICYAIL I VAGKVSRLL GATGLTILNR IMGMMLAAVS 

201 VEIIVSG LKM IFPQLAG* 

m282/a282 99. 1 % identity in 2 1 7 aa overlap 

10 20 30 40 50 60 

m282 . pep MGLGMEIGKLIVAFLVLINPFSALSLYLDLTNGHSTKERRKVARTAAVAVFAVIAVFALI 
I I I t I I I \ I I I I I I I I I II I I I I I I I I I I ! I I I I I I I I I I I i I I I I I (I I I I I I I I I I I I 
a282 MGLGMEIGKLIVAFLVLINPFSALSLYLDLTNGHSTKERRKVARTAAVAVFAVIAVFALI 

10 20 30 40 50 60 



m2 82 .pep 
g282 

m282 .pep 
g282 

m2 82 -pep 
g282 

m2 82 -pep 
g282 



GCATGGAAAT CGGCAAGCTG 
TTTAGCGCGT TGTCGCTTTA 
GGAGCGCAGG AAGGTCGCGC 
TTGCGGTATT TGCGCTGATC 
AGCGTCGGTT CGTTTCAGGT 
CATTTCGATG ATGAACGGCA 
CGCAGCCGGA AACGGGGCAG 
GCCGTCGTGC CCATCGCCAT 
GACCGTGATT ATTTACGCTT 
TGATTATCGC GGCCGGTTTG 
ATCGTTGCCG GGAAGGTCAG 
TTTAAACCGT ATCATGGGTA 
TTGTGTCGGG ACTGAAAATG 



ATTGTGGCTT TTTTGGTGCT 
CCTTGACCTG ACCAACGGGC 
GGACGGCCGC CGTTGCCGTG 
GGCGGTACGC TGCTGAAGGT 
CGGCGGCGGA ATTTTGGTGT 
ACGACAATCC CGCCAAGCAG 
GTGCGCCCCG CCCGCAATGC 
ACCGATCACC ATCGGCCCGG 
CGGCGGCTAA AACATACGGC 
GTGGTCAGTG CGATTTGTTA 
CCGCCTGCTG GGTGCGACGG 
TGATGCTGGC GGCGGTATCG 
ATATTCCCGC AACTGGCAGG 
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70 80 90 100 110 120 

m282 . pep GGTLLKVLGISVGSFQVGGGILVLLIAISMMNGNDNPAKQNLGAQPETGQARPARNAGAI 
I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I II I I I ( I I I I I I I : I I I | 1 I | | | 
a282 GGTLLKVLGISVGSFQVGGGILVLLIAISMMNGNDNPAKQNLGAQPETGQVRPARNAGAI 
70 80 90 100 110 120 

130 140 150 160 170 180 

m282 .pep AVVPI AIPITIGPGGISTVI I YASAAKTYGDIALI IAAGLVVSAICYAILIVAGKVSRLL 

i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I | | I | | | 
a282 AVVPIAIPITIGPGGISTVI I YASAAKTYGDIALI IAAGLVVSAICYAILIVAGKVSRLL 

130 140 150 160 170 180 

190 200 210 

m282 .pep GATGLTILNRIMGMMLAAVSVEIIVSGLKTIFPQLAGX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a282 GATGLTILNRIMGMMLAAVSVEIIVSGLKMIFPQLAGX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 169>: 

g283.seq 

1 atgaactttg ctttatccgt catcacattt accctcgcct ctttcctgcc 

51 cgtcccgcct gccggaaccg ccgtctttac ttggaaagac ggcggcggca 

101 acagctattc ggatgtgccg aaacagcttc atcccgacca gagccaaatc 

151 ctcaacctgc ggacgctcca aaccaaaccg gcggtcaagc ccaaacctgc 

201 cgtcgatacg aatgcggaca gtgcgaagga aaacgaaaag gatatcgccg 

251 agaaaaacgg gcagcttgag gaagaaaaga aaaaaattgc cgaaaccgaa 

301 cggcagaaca aagaagaaaa ctgccggatt tcaaaaatga acctgaaggc 

351 ggtgggaaac tcaaatgcga aaaacaagga tgatttgatc cgtaaataca 

401 ataacgccgt aaacaaatac tgccgttaa 

This corresponds to the amino acid sequence <SEQ ID 1 170; ORF 283.ng>: 

g283.pep 

1 MNFALSVITF TLASFLPVPP AGT AVFTWKD GGGNSYSDVP KQLHPDQSQI 

51 LNLRTLQTKP AVKPKPAVDT NADSAKENEK DIAEKNGQLE EEKKKIAETE 

101 RQNKEENCRI SKMNLKAVGN SNAKNKDDLI RKYNNAVNKY CR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 171 >: 

m283 . seq 

1 ATGAACTTTG CTTTATCCGT CATTATGTTG ACCCTCGCCT CTTTCCTGCC 

51 CGTCCCGCCT GCCGGAGCCG CCGTCTTTAC TTGGAAGGAC GGCGGCGGCA 

101 ACAGCTATTC GGATGTACCG AAACAGCTTC ATCCCGACCA AAGCCAAATC 

151 TTAAACCTGC GGACGCGCCA AACCAAACCG GCGGTCAAAC CCGCCCAAGC 

201 CGACGCAGGG AAGCGCACAG ACGGCGCGGC ACAGGAAAAC AATCCCGACA 

251 CTGCCGAGAA AAACCGGCAG CTTGAGGAAG AAAAGAAAAG AATTGCCGAA 

301 ACCGAACGGC AGAACAAAGA AGAAAACTGC CGGATTTCAA AAATGAACCT 

351 GAAGGCGGTG GGAAATTCAA ATGCAAAAAA CAAGGATGAT TTGATTCGGA 

4 01 AATACAATAA CGCCGTAAAC AAATACTGCC GTTAA 

This corresponds to the amino acid sequence <SEQ ID 1 172; ORF 283>: 

m283.pep 

1 MNFALSVIML TLASFLPVPP AGA AVFTWKD GGGNSYSDVP KQLHPDQSQI 

51 LNLRTRQTKP AVKPAQADAG KRTDGAAQEN NPDTAEKNRQ LEEEKKRIAE 

101 TERQNKEENC RISKMNLKAV GNSNAKNKDD LIRKYNNAVN KYCR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

m283/g283 86.1% identity in 144 aa overlap 

10 20 30 40 50 60 

m283.pep MNFALSVIMLTLASFLPVPPAGAAVFTWKDGGGNSYSDVPKQLHPDQSQI LNLRTRQTKP 

I I I I I I I I : I I I I I I I I I I I I : I I I I I I I I I I I I I I I I II I I I I I I I ! I | I I | j till 
g283 MNFALSVITFTLASFLPVPPAGTAVFTWKDGGGNSYSDVPKQLHPDQSQILNLRTLQTKP 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m283 .pep AVKPAQADAGKRTDGAAQENNPDTAEKNRQLEEEKKRIAETERQNKEENCRISKMNLKAV 
I I I I I : : I : I : I I : I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I II I I 
g283 AVKPKPA-VDTNAD-SAKENEKDIAEKNGQLEEEKKKIAETERQNKEENCRISKMNLKAV 

70 80 90 100 110 

130 140 
m283 . pep GNSNAKNKDDLIRKYNNAVNKYCRX 
I I I I M I I I I I I I I I I I I I I I 1 I I I 
g283 GNSNAKNKDDLIRKYNNAVNKYCRX 
120 130 140 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 173>: 

a283 . seq 

1 ATGAACTTTG CTTTATCCGT CATTATGTTG ACCCTCGCCT CTTTCCTGCC 

51 CGTCCCGCCT GCCGGAGCCG CCGTCTTTAC TTGGAAGGAC GGCGGCGGCA 

101 ACAGCTATTC GGATGTACCG AAACAGCTTC ATCCCGACCA AAGCCAAATC 

151 TTAAACCTGC GGACGCGCCA AACCAAACCG GCGGTCAAAC CCGCCCAAGC 

201 CGACGCAGGG AAGCGCACAG ACGGCGCGGC ACAGGAAAAC AATCCCGACA 

251 CTGCCGAGAA AAACCGGCAG CTTGAGGAAG AAAAGAAAAG AATTGCCGAA 

301 ACCGAACGGC AGAACAAAGA AGAAAACTGC CGGATTTCAA AAATGAACCT 

351 GAAAGCGGTG GGAAATTCAA ATGCAAAAAA CAAGGATGAT TTGATTCGGA 

4 01 AATACAATAA CGCCGTAAAC AAATACTGCC GTTAA 

This corresponds to the amino acid sequence <SEQ ID 1 174; ORF 283. a>: 

a283 .pep 

1 MNFALSVIML TLASFLPVPP AGA AVFTWKD GGGNSYSDVP KQLHPDQSQI 

51 LNLRTRQTKP AVKPAQADAG KRTDGAAQEN NPDTAEKNRQ LEEEKKRIAE 

101 TERQNKEENC RISKMNLKAV GNSNAKNKDD LIRKYNNAVN KYCR* 

m283/a283 100.0% identity in 144 aa overlap 

10 20 30 40 50 60 

m283.pep MNFALSVIMLTLASFLPVPPAGAAVFTWKDGGGNSYSDVPKQLHPDQSQI LNLRTRQTKP 

I- 1 I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I 

a283 MNFALSVIMLTLASFLPVPPAGAAVFTWKDGGGNSYSDVPKQLHPDQSQILNLRTRQTKP 

10 20 30 40 50 60 

70 80 90 100 110 120 

m283.pep AVKPAQADAGKRTDGAAQENNPDTAEKNRQLEEEKKRIAETERQNKEENCRISKMNLKAV 
I I i I I I I I I I I I I I I I I I I I I I i I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a283 AVKPAQADAGKRTDGAAQENNPDTAEKNRQLEEEKKRIAETERQNKEENCRISKMNLKAV 

70 80 90 100 110 120 

130 140 
m283 . pep GNSNAKNKDDLIRKYNNAVNKYCRX 
I I i I I I I I I I I I I I I I I I I I I I I I I 
a 2 8 3 GNSNAKNKDDL I RKYNNAVNK YCRX 

130 140 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 175>: 

g284 .seq. 

1 atgccgtctg aaactcgaaa tcggtttcag acggcattgg tttacgcggc 

51 aggttggggc ttagcggtct ttgtaacggc attcgctttt gcctgcaaaa 

101 gagtcgccgg ctttgcgttt gcctttgaag ccttcgccgg tttttttgaa 

151 actgtctttc ttaaagcctt ctttcttgaa accttcgccg cgcgttttgc 

201 cgccgaagcc ttctttgccc ggtttatgat cgccgcgccg gccgccggat 

251 ttcctatcgc cccagccgcc tttgcctttc ggcttgccgc ctgcggattt 

301 gcgtttgcgg gccggctcca tgccttcgat ggtcagttcg ggcagtttgc 

351 ggttaatgta tttttcgatt ttgtggactt tgacgtattc gttcacttcg 

401 gcaaacgtaa tcgcaatacc cgtgcggcct gcgcggccgg tgcgcccgat 

451 gcggtggacg tagtcttccg cctgtttcgg caggtcgtag tttatgacgt 
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501 gggtaatggt cggtacgtca ataccgcgtg cggcaacgtc ggtggcaacc 

551 aaaattttgc agcggccttt acgcaaatcc gtcagcgtgc ggttgcgcca 

601 gccctgcggc atatcgccgt gcaggcagtt ggcggcgaaa cctttttcgt 

651 acaattcatc cgcgatgact tcggtcatcg ctttggtgga cgtgaaaatc 

701 acacattggt cgatgttggc atcgcgcagg atgtggtcga gcaggcggtt 

751 tttgtggcgc atatcgtcgc agtacaacaa ctgctcttcg attttgcctt 

801 ggccgtccac gcgttcgact tcgataattt cagagtcttt ggtcagtttg 

851 cgcgccagtt tgccgactgc gccgtcccaa gtggcggaga acaataa 

This corresponds to the amino acid sequence <SEQ ID 1 176; ORF 284.ng>: 

g284 .pep 

1 MPSETRNRFQ TALVYAAGWG LAVFVTAFA F ACKRVAG FAF AFEAFAGFFE 

51 TVFLKAFFLE TFAARFAAEA FFARFMIAAP AAGFPIAPAA FAFRLAACGF 

101 AFAGRLHAFD GQFGQFAVNV FFDFVDFDVF VHFGKRNRNT RAACAAGAPD 

151 AVDVVFRLFR QVVVYDVGNG RYVNTACGNV GGNQNFAAAF TQIRQRAVAP 

201 ALRHIAVQAV GGETFFVQFI RDDFGHRFGG RENHTLVDVG IAQDWEQAV 

251 FVAHIVAVQQ LLFDFALAVH AFDFDNFRVF GQFARQFADC AVPSGGEQ* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 177>: 

m284 . seq. . 

1 ATGCCGTCTG AAACTCGAAA TCGGTTTCAG ACGGCATTGG TTTATGCGGC 

51 AGGTTGGGGC TTAGCGGTCT TTGTAACGGC GTTCGCCTTT GCCTGCAAAA 

101 GAATCGCCGG CTTTGCGTTT GCCTTTGAAG CCTTCGCCGG TTTTTTTGAA 

151 ACCGTCTCTC TTAAAGCCTT CTTTCTTGAA ACCTTCGCCG CGCGTTTTGC 

201 CGCCGAAGCC TTCTTTGCTC GGTTTATGAT CGCCGCGCCA ACCGCCGGAT 

251 TTACGATCG'C CCCAGCCGCC TTTGCCTTTC GGCTTGCCGC CTGCGGATTT 

301 GCGTTTGCGG GTCGGTTCCA TGCCTTCGAT GGTCAGTTCG GGCAGTTTTC 

351 GGTTAATGTA TTTTTCGATT TTGTGGACTT TGACGTATTC GTTCACTTCG 

401 GCAAACGTAA TCGCAATACC CGTGCGGCCT GCGCGGCCGG TGCGCCCGAT 

451 GCGGTGGACG TAGTCTTCCG CCTGTTTCGG CAGGTCGTAG TTGATAACGT 

501 GGGTAATGGT CGGTACGTCG ATACCGCGTG CGGCAACATC GGTGGCAACC 

551 AAAATTTTGC AGCGGCCTTT ACGCAAATCC ATCAGCGTGC GGTTGCGCCA 

601 GCCTTGCGGC ATATCGCCGT GCAGGCAGTT TGCGGCGAAA CCTTTTTCGT 

651 ACAGTTCATC CGCAATGACT TCGGTCATGG CTTTGGTGGA CGTGAAAATC 

701 ACGCATTGAT CGATATTGGC ATCGCGCAAG ATATGATCGA GCAGGCGGTT 

7 51 TTTGTGGCGC ATATCGTCGC AGTACAGCAG TTGTTCTTCG ATTTTGCCTT 

801 GATCGTCCAC GCGTTCGACT TCGATGATTT CAGGGTCTTT GGTCAGTTTG 

851 CGCGCCAGTT TGCCGACCGC GCCGTCCCAA GTGGCGGAGA ACAACAAAGT 

901 CTGACGGTCG CTCGGCGTTG CTTCCACGAT GGTTTCGATG TCGTCGATAA 

951 AGCCCATATC CAACATACGG TCGGCTTCGT CCAAAATCAG CACTTCCAAA 

1001 CGTTCAAAAT CAACTTTGCC GCTTTGCATC AGGTCCATCA GACGGCCCGG 

1051 CGTGGCGACA ATCAGATCGA CCGGTTTGCT CAGGGCACGG GTTTGGTAGC 

1101 CGAAAGACGC GCCGCCGACG ATGCTGACGG TGCGGAACCA ACGCATATTT 

1151 TTGGCATACG CCAGCGCGTT TTTCTCGACT TGAGCCGCCA GTTCGCGGGT 

1201 CGGGGTCAAC ACCAAAGCAC GCGGGCCTTT GCCCGGTTTT TCGCTGCGTT 

1251 TGGTCAGTTT TTGCAAAGTC GGTAA 



This corresponds to the amino acid sequence <SEQ ID 1 178; ORF 284>: 

m284 .pep 

1 MPSETRNRFQ TALVYAAGWG LAVFVTAFA F ACKRIAGFAF AFEAFAGFFE 

51 TVSLKAFFLE TFAARFAAEA FFARFMIAAP TAGFTIAPAA FAFRLAACGF 

101 AFAGRFHAFQ GQFGQFSVNV FFDFVDFDVF VHFGKRNRNT RAACAAGAPD 

151 AVDVVFRLFR QVVVDNVGNG RYVDTACGNI GGNQNFAAAF TQIHQRAVAP 

201 ALRHIAVQAV CGETFFVQFI RNDFGHGFGG RENHALIDIG IAQDMIEQAV 

251 FVAHIVAVQQ LFFDFALIVH AFDFDDFRVF GQFARQFADR AVPSGGEQQS 

301 LTVARRCFHD GFDVVDKAHI QHTVGFVQNQ HFQTFKINFA ALHQVHQTAR 

351 RGDNQIDRFA QGTGLVAERR AADDADGAEP THIFGIRQRV FLDLSRQFAG 

401 RGQHQSTRAF ARFFAAFGQF LQSR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



m284/g284 92.3% identity in 298 aa overlap 
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10 20 30 40 50 60 

m284 . pep MPSETRNRFQTALVYAAGWGLAVFVTAFAFACKRIAGFAFAFEAFAGFFETVSLKAFFLE 
I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I | | | | | | | | | j | | 
g284 MPSETRNRFQTALVYAAGWGLAVFVTAFAFACKRVAGFAFAFEAFAGFFETVFLKAFFLE 

10 20 30 40 50 60 



70 80 90 100 110 120 

m284 . pep TFAARFAAEAFFARFMIAAPTAGFTIAPAAFAFRLAACGFAFAGRFHAFDGQFGQFSVNV 
I I I I I I I I I I I I I I I I I I M : I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I | | | : | | | 
g284 TFAARFAAEAFFARFMIAAPAAGFPIAPAAFAFRLAACGFAFAGRLHAFDGQFGQFAVNV 

70 80 90 100 110 120 



130 140 150 160 170 180 

m284 . pep FFDFVDFDVFVHFGKRNRNTRAACAAGAPDAVDWFRLFRQVVVDNVGNGRYVDTACGNI 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I : | I | | | : 
g2 8 4 FFDFVDFDVFVHFGKRNRNTRAACAAGAPDAVDVVFRLFRQVVVYDVGNGRYVNTACGNV 

130 140 150 160 170 180 



190 200 210 220 230 240 

m284 . pep GGNQNFAAAFTQIHQRAVAPALRHIAVQAVCGETFFVQFIRNDFGHGFGGRENHALIDIG 
I I I I ! I I I I I I I I : I I I I I I I I I I I I I I II I I I I I I II I I : I I I I I I I 1 ! I I : I : t : I 
g284 GGNQNFAAAFTQIRQRAVAPALRHIAVQAVGGETFFVQFIRDDFGHRFGGRENHTLVDVG 

190 200 210 220 230 240 



250 260 270 280 290 300 

m2 8 4 . pep IAQDMIEQAVFVAHIVAVQQLFFDFALIVHAFDFDDFRVFGQFARQFADRAVPSGGEQQS 

I I I I :: I I I I I I i I I I I I I I I : I I i I i I I I I I I I : I I I I I I I t I I I I I I I I I I I I I 
g284 IAQDVVEQAVFVAHIVAVQQLLFDFALAVHAFDFDNFRVFGQFARQFADCAVPSGGEQX 

250 260 270 280 290 

310 320 330 340 350 360 

m284 . pep LTVARRCFHDGFDVVDKAHIQHTVGFVQNQHFQTFKINFAALHQVHQTARRGDNQIDRFA 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 179>: 

a284 . seq 

1 ATGCCGTCTG AAACTCGAAA TCGGTTTCAG ACGGCATTGG TTTATGCGGC 

51 AGGTTGGGGC TTAGCGGTCT TTGTAACGGC GTTCGCCTTT GCCTGCAAAA 

101 GAATCGCCGG CTTTGCGTTT GCCTTTGAAG CCTTCGCCGG TTTTTTTGAA 

151 ACCGTCTCTC TTAAAGCCTT CTTTCTTGAA ACCTTCGCCG CGCGTTTTGC 

201 CGCCGAAGCC TTCTTTGCTC GGTTTATGAT CGCCGCGCCA ACCGCCGGAT 

251 TTACGATCGC CCCAGCCGCC TTTGCCTTTC GGCTTGCCGC CTGCGGATTT 

301 GCGTTTGCGG GTCGGTTCCA TGCCTTCGAT GGTCAGTTCG GGCAGTTTTC 

351 GGTTAATGTA TTTTTCGATT TTGTGGACTT TGACGTATTC GTTCACTTCG 

4 01 GCAAACGTAA TCGCAATACC CGTGCGGCCT GCGCGGCCGG TGCGCCCGAT 

451 GCGGTGGACG TAGTCTTCCG CCTGTTTCGG CAGGTCGTAG TTGATAACGT 

501 GGGTAATGGT CGGTACGTCG ATACCGCGTG CGGCAACGTC GGTGGCAACC 

551 AAAATTTTGC AGCGGCCTTT GCGCAAATCC ATCAGCGTGC GGTTGCGCCA 

601 GCCTTGCGGC ATATCGCCGT GCAGGCAGTT GGCGGCGAAA CCTTTTTCGT 

651 ACAATTCATC CGCGATGACT TCGGTCATGG CTTTGGTGGA CGTGAAAATC 

7 01 ACGCATTGAT CGATGTCGGC ATCGCGCAAG AT AT GAT CG A GCAGGCGGTT 

751 TTTGTGGCGC ATATCGTCGC AGTACAGCAG TTGTTCTTCG ATTTTGCCTT 

801 GGTCGTCCAC GCGTTCGACT TCGATGATTT CAGGGTCTTT GGTCAGTTTG 

851 CGCGCCAGTT TGCCGACCGC GCCGTCCCAA GTGGCGGAGA ACAACAAAGT 

901 CTGACGGTCT TCCGGCGTGG CTTCGACGAT GGTTTCGATG TCGTCGATAA 

951 AGCCCATATC CAACATACGG TCGGCTTCGT CCAAAATCAG CACTTCCAAG 

1001 CGGGCGAAAT CGACTTTGCC GCTTTGCATC AAGTCCATCA GACGGCCCGG 

1051 CGTGGCGACA ATCAGATCGA CCGGTTTGCT CAGGGCGCGG GTTTGGTAGC 

1101 CGAACGATGC ACCACCGACG ATGCTGACGG TACGGAACCA ACGCATATTT 

1151 TTGGCATACG CCAGCGCGTT TTTCTCGACT TGAGCCGCCA ATTCGCGGGT 

1201 CGGCGTCAAC ACCAACGCGC GCGGGCCTTT GCCCGGTTTT TCGCTGCGTT 

1251 TGGTCAGTCG CTGCAAAGTC GGTAA 



This corresponds to the amino acid sequence <SEQ ID 1 180; ORF 284.a>: 

a284 .pep 

1 MPSETRNRFQ TALVYAAGWG LAVFVTAFA F ACKRIAGFAF AFEAFAGFFE 
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51 TVSLKAFFLE TFAARFAAEA FFARFMIAAP TAGFT IAPAA FAFRLAACGF 

101 AFAGRFHAFD GQFGQFSVNV FFDFVDFDVF VHFGKRNRNT RAACAAGAPD 

151 AVDVVFRLFR QVVVDNVGNG RYVDTACGNV GGNQNFAAAF AQIHQRAVAP 

201 ALRHIAVQAV GGETFFVQFI RDDFGHGFGG RENHALIDVG IAQDMIEQAV 

251 FV AHIVAVQQ LFFDFALVV H AFDFDDFRVF GQFARQFADR AVPSGGEQQS 

301 LTVFRRGFDD GFDVVDKAHI QHTVGFVQNQ HFQAGEIDFA ALHQVHQTAR 

351 RGDNQIDRFA QGAGLVAERC TTDDADGTEP THIFGIRQRV FLDLSRQFAG 

4 01 RRQHQRARAF ARFFAAFGQS LQSR* 



m284/a284 94.8% identity in 424 aa overlap 



10 20 30 40 50 60 

m284 .pep MPSETRNRFQTALVYAAGWGLAVFVTAFAFACKRIAG FAFAFEAFAGFFETVSLKAFFLE 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a284 MPSETRNRFQTALVYAAGWGLAVFVTAFAFACKR I AG FAFAFEAFAGFFETVSLKAFFLE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m28 4 . pep TFAARFAAEA FFARFMIAAP TAG FTIAPAAFAFRLAACGFAFAGRFHAFDGQFGQFSVNV 

I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I 
a284 TFAARFAAEAFFARFMIAAPTAGFTIAPAAFAFRLAACGFAFAGRFHAFDGQFGQFSVNV 

70 80 90 100 110 120 



130 140 150 160 170 180 

m28 4 . pep FFDFVDFDVFVHFGKRNRNTRAACAAGAPDAVDWFRLFRQVVVDNVGNGRYVDTACGNI 

I I I I I I I I I I I I I II II I I I I I I I I I I I II I I I I M I I I ! I I M I I I I I I I I I I I I M I : ft 
a2 8 4 FFDFVDFDVFVHFGKRNRNTRAACAAGAPDAVDWFRLFRQVWDNVGNGRYVDTACGNV 

130 140 150 160 170 180 - 

190 200 210 220 230 240 

m284 . pep GGNQNFAAAFTQIHQRAVAPALRHIAVQAVCGETFFVQFIRNDFGHGFGGRENHALIDIG 
I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I II I I I I : I I I I I I I I I I I I I I I I : I 
a284 GGNQNFAAAFAQIHQRAVAPALRHIAVQAVGGETFFVQFIRDDFGHGFGGRENHALIDVG 

190 200 210 220 230 240 ; » 

250 260 270 280 290 300 

m284 . pep IAQDMIEQAVFVAHIVAVQQLFFDFALIVHAFDFDDFRVFGQFARQFADRAVPSGGEQQS 
I I I I I I I I I I I I I I I I I I I I I I II I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a284 IAQDMIEQAVFVAHIVAVQQLFFDFALVVHAFDFDDFRVFGQFARQFADRAVPSGGEQQS *- 

250 260 270 280 290 300 



310 320 330 340 350 360 

m284 . pep LTVARRCFH DG FDVVDKAH IQHTVGFVQNQH FQT FKIN FAALHQVHQTARRGDNQI DRFA 

III I I I I I I i I I I I I I I 1 I I I I I M 1 I I I I : : I : I I I I I I I I I I II I I I I I I I I I I 
a28 4 LTV FRRGFDDG FDVVDKAH IQHTVGFVQNQHFQAGE ID FAALHQVHQTARRGDNQI DRFA 

310 320 330 340 350 360 



370 380 390 400 410 420 

m284 . pep QGTGLVAERRAADDADGAEPTHIFGIRQRVFLDLSRQFAGRGQHQSTRAFARFFAAFGQF 

11:111111 :: I I I I I : I I I I I I I I I I I I I I I II I II I I I III : I I I I I I I I I I I I 
a284 QGAGLVAERCTTDDADGTEPTHIFGIRQRVFLDLSRQFAGRRQHQRARAFARFFAAFGQS 

370 380 390 400 410 420 



m284.pep »LQSRX 
I I I I I 

a284 LQSRX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 181>: 

g285 . seq 

1 atgaccgata ccacaccgac agataccgat ccgaccgaaa acggcacgcg 

51 caaaatgccg tctgaacacc gccccgcccc gccggcaaaa aaacgccgcc 

101 cgctgctgaa gctgtcggcg gcactgctgt ctgtcctgat tttggcagta 
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151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 
2751 
2801 
2851 
2901 
2951 
3001 
3051 
3101 
3151 
3201 
3251 
3301 
3351 
3401 



tgtttcctcg 
gtaccaaatc 
aaggcacact 
gagggggcag 
cgaactgatg 
tcgccatcgt 
ggcctgcccg 
cgagacgggc 
atctcgaacg 
ctcgacctga 
ctcggtcggc 
aaggcggatt 
agcctgaagg 
cctctcggga 
cattggaaga 
gtgccttccc 
gtcgttttca 
ccaaagccgg 
ggcggctttg 
cgccgccctg 
ccgaaaaaga 
gaagacgtgc 
catcggcggc 
gcacggcacg 
gaacagcgga 
cagcctgacc 
agctggacat 
tttccggcag 
aaaagagaaa 
gcgtgccgat 
ccgcgcgccg 
cggcggcttc 
ccgatttatc 
ggacaccttt 
cctttccggc 
gttcgctcga 
cgcgccgata 
tgtcgatacc 
tccgcacaca 
ttggacgctt 
catcggcatc 
gtatgacgct 
tggcaggcaa 
gaaaaccggc 
agttgcacaa 
ggcgactggg 
cagccggcaa 
tgaacgcatt 
ctgcttgacg 
cggcaacgcc 
ttacagcctc 
gccgcgcaaa 
acgggtaggc 
acgggaaaat 
accgcacctt 
attccgcaac 
ccgccgtaac 
atcaacggcg 
ggacaacggc 
acagcctgaa 
agcatggaaa 
ataccgcatc 
cccgcctgcg 
aaaactgatc 
cggcgacgat 
cgctccccgt 



gctggatcgc 
ccgtcctggt 
gctcgacggc 
accttaaaat 
cgccgcagcc 
aaccaaaccg 
acagcataga 
aaaatcagca 
cctcaacgcg 
aggccgccga 
ttgaaaaaac 
cgaaggcgaa 
atgtgcgcgc 
aaatccgtca 
agtactggtc 
tgcccgatgc 
gacggcatcg 
ctttgccgac 
tcatccggca 
ctcggacggg 
catccttgat 
tgcaaaccgc 
acgaccgcct 
cacggacggc 
aactggtgtt 
gcgcaaggct 
ccgttcccgc 
gcaatatcaa 
tttacgggca 
tgccggcagc 
ccgtcgattt 
ggcaaaaaag 
ccgtttcggt 
ccggcgattt 
acggcgcgca 
ttttaccctc 
tcaagggcgg 
gccggcctga 
cgccgccatg 
caggcggcat 
ctcgacatcg 
cgaagccggt 
tgggcggcag 
atatcggcaa 
tttcttcaaa 
atgtcgccta 
agcggcgatg 
ttccctgaaa 
gcggcgcgcg 
ttcggcggca 
ccttcccgac 
acattaccgg 
tctccgtccg 
caacggcaat 
tgggcggcag 
ttcctaccgg 
cctcggcggc 
acaagctcta 
tcgctgcgtt 
attccggcac 
acagcgtgcc 
ctgtcccgcc 
ctattcgccg 
aggggctgtt 
gtcgtcgtat 
caatatgaac 



cggtacggaa 
tcggcgtaaa 
ttcgacggcg 
cagccgcttc 
tgcacatcac 
actccgccta 
cctgcccgcc 
tgggcaaaac 
gcataccgtt 
cacgccgtgg 
cgtttgccct 
accatacaca 
cgaactgacg 
tccacccgtt 
aaaggattca 
cgggctgaat 
cgctggaagg 
cgcaacggca 
ggacggcacg 
gcggcatcag 
ttaaatatag 
gttcaaaggc 
cgcccaaaat 
agcctcccca 
cgacaccgtc 
atctcgagct 
gcattcgacc 
cggttcgatt 
aaatgcgttt 
gccgacattg 
gcggttgggg 
gcgaccggct 
ttcggactcg 
ggacggcggc 
acttacacat 
aaaggctcac 
ccgcctttcc 
cgctggaagg 
acgctggacg 
caacagggaa 
gcggcgcatt 
gcggaacacg 
cctcaacctg 
aaggcggcgc 
ccgcccttcg 
cgggcacaac 
ccgtattgcc 
acgcgctttc 
tttcggacgg 
atatggcaaa 
ttgggcgcat 
cagcctgaat 
tcaatgccgc 
atcaccgtcg 
gctcaacctg 
tcggacaaac 
agcatcgccg 
ttaccgcaac 
cgcatattgc 
gaagggacgg 
cgatgtcgat 
ccaaccgccg 
caaaaaggca 
cggttcgcaa 
tgggcgaagt 
ctgactttag 



gcaggtttgc 
catttcctcc 
acaactggtc 
cgcttcgcgt 
cgacatctcc 
aagaagaacg 
gctgtctatc 
ctttgacaaa 
acgaccgtaa 
agcagttcgt 
cgataccgcc 
gtacggcgcg 
atcgacggcg 
tgccgaatca 
acatcaatcc 
ttcgacctga 
ctcgctcgat 
tccccgtccg 
gtgcatatcg 
gctgtcgggc 
gcatcaactc 
aggttggacg 
ctcttggcaa 
tcgcaagcga 
aacatctccg 
gtttaaagac 
cttcgcgcat 
catcttgccg 
tttgcccggt 
tttacgagtc 
cggaacatcg 
taacctcaat 
cggggtcttt 
atccgaacct 
cggcaaagcg 
ccggcacaag 
ctgtcgggcg 
tacgggcgcg 
gcaaaccgtt 
cttacccgat 
caacctcaag 
tggcggcaag 
caacactttt 
acgcggcctg 
aacacaatct 
gcgcgcggct 
cggcgggcag 
aaaacgaccg 
attaacgccg 
tacaccgctc 
tgaagccctt 
gcctccgcgc 
cgtcaacggt 
ggcaaagccg 
accgttgccg 
cgtcaaaggc 
acccgcactt 
caaacccaag 
aggcaggaaa 
cggaactctc 
atcggcgcgg 
cctgacggtt 
tatccgttac 
aaatcctcga 
caagaaagag 
acctcaatga 



gcttcgggct 
caaaacctca 
gatagaaacc 
ggaaaccgtc 
gccggcgaca 
cccgcctcaa 
tcgaccgctt 
caaaccgtct 
agggcaccgc 
cggggtcagc 
atttacacca 
gctgagcggc 
gcaatatccg 
ttggataaaa 
gtccgccttc 
ccgccatccc 
ttggaaaaca 
tcaggttttg 
gcaatacgtc 
aaaatcgaca 
cgtcggcgcg 
gcagcatcgg 
ctcggcaccg 
ccccgcaaac 
ccggggaagg 
cgcctgctca 
cgatccgcaa 
gtgaactggc 
acgttcaacg 
ccgccacctt 
tcaaaacaga 
atcaccgcac 
aaatgtacgc 
ttgaaaccga 
gcagacatcc 
ccgcccgatg 
gcgcggcggt 
cagcaccgca 
caaactcgat 
ggaaaggcag 
ctgcaaaacc 
tgcggcaaat 
cttgggacag 
cacatcgccg 
ggttttaaac 
acctcaatat 
gctttgggtt 
catcggaatc 
atttgggcat 
ggcggcagga 
tctgcccgcc 
aaatcggcgg 
agcagcaact 
ctccttcgat 
atgccgaagc 
agcctgaatg 
gggcggcagt 
gcatcatctt 
tgggtaatcg 
cggcacggtc 
tgttcgacaa 
tccggcaaca 
cggtatgatt 
tgccgtccgt 
gcggcggcat 
cggcatccgc 
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3451 
3501 
3551 
3601 
3651 
3701 
3751 
3801 
3851 
3901 
3951 
4001 
4051 
4101 
4151 



ttctccggct 
cgcgcaaccg 
aagggcgtta 
gtctcctttg 
acgccgcctt 
acagcccgcg 
• aagctctcct 
caatgccgcc 
acgaccgcat 
cgcaacgcgc 
cggcaaacaa 
ccagcgcgga 
caggcggttg 
caccatacgt 
gaaacggcaa 



acggcgcgga 
ggcggaaatg 
caaagcatac 
tcggcccgct 
tcccccgtcg 
cattacgctg 
ggctcatcct 
ctgtccgcag 
cgggctggtg 
aaaccggcga 
ctgaccggca 
acagtccgtc 
cccgtatcgg 
ttcgaccgcc 
agggaaataa 



cgttaccata 
tgcgtggggt 
gggcaggatt 
caacgacccc 
gtgcgggcgt 
acggcaaacg 
caaccgtgcc 
ccgcaggcgc 
gatgatttgg 
actcaacccc 
aactctacat 
aaactgattt 
cagccgttcg 
tcttcggttc 



ggcggcaaac 
gggcacggtc 
tagacattac 
aacctgaaca 
ggaaatattg 
aaccgatgag 
ggcagcggca 
gctgcttgcc 
gctttaccag 
gccgaacagg 
cggctacgaa 
accggctgac 
tcgggcggcg 
ggacaaaaaa 



tgaccctgac 
cgcgtcatca 
caaaggcaca 
tccgcgccga 
ggcagcctca 
tgaaaaagac 
gcagcggcga 
gggcaaatca 
caagcgcagc 
tgctgaccgt 
tacggcatct 
ccgcgccata 
agctgacata 
gactccgcag 



This corresponds to the amino acid sequence <SEQ ID 1 182; ORF 285.ng>: 

g285.pep 
1 
51 



MTDTTPTDTD PTENGTRKMP SEHRPAPPAK KRRPLLKLS A ALLSVLILAV 
CFLGWIAGTE AGLRFGLYQI PSWFGVNISS QNLKGTLLDG FDGDNWSIET 



101 EGADLKISRF RFAWKPSELM RRSLHITDIS AGDIAIVTKP TPPKEERPPQ 

151 GLPDSIDLPA AVYLDRFETG KISMGKTFDK QTVYLERLNA AYRYDRKGHR 

201 LDLKAADTPW SSSSGSASVG LKKPFALDTA IYTKGGFEGE TIHSTARLSG 

251 SLKDVRAELT IDGGNIRLSG KSVIHPFAES LDKTLEEVLV KGFNINPSAF 

301 VPSLPDAGLN FDLTAIPSFS DGIALEGSLD LENTKAGFAD RNGIPVRQVL 

351 GGFVIRQDGT VHIGNTSAAL LGRGGIRLSG KIDTEKDILD LNIGINSVGA 

401 EDVLQTAFKG RLDGSIGIGG TTASPKISWQ LGTGTARTDG SLPIASDPAN 

4 51 EQRKLVFDTV NISAGEGSLT AQGYLELFKD RLLKLDIRSR AFDPSRIDPQ 

501 FPAGNINGSI HLAGELAKEK FTGKMRFLPG TFNGVPIAGS ADIVYESRHL 

551 PRAAVDLRLG RNIVKTDGGF GKKGDRLNLN ITAPDLSRFG FGLAGSLNVR 

601 GHLSGDLDGG IRTFETDLSG TARNLHIGKA ADIRSLDFTL KGSPGTSRPM 

651 RADIKGGRLS LSGGAAVVDT AGLTLEGTGA QHRIRTHAAM TLDGKPFKLD 

7 01 LDASGGINRE LTRWKGSIGI LDIGGAFNLK LQNRMTLEAG AEHVAASAAN 

7 51 WQAMGGSLNL QHFSWDRKTG ISAKGGARGL HIAELHNFFK PPFEHNLVLN 

801 GDWDVAYGHN ARGYLNISRQ SGDAVLPGGQ ALGLNAFSLK TRFQNDRIGI 

851 LLDGGARFGR INADLGIGNA FGGNMANTPL GGRITASLPD LGALKPFLPA 

901 AAQNITGSLN ASAQIGGRVG SPSVNAAVNG SSNYGKINGN ITVGQSRSFD 

951 TAPLGGRLNL TVADAEAFRN FLPVGQTVKG SLNAAVTLGG SIADPHLGGS 

1001 INGDKLYYRN QTQGIILDNG SLRSHIAGRK WVIDSLKFRH EGTAELSGTV 

1051 SMENSVPDVD IGAVFDKYRI LSRPNRRLTV SGNTRLRYSP QKGISVTGMI 

1101 KTDQGLFGSQ KSSMPSVGDD VVVLGEVKKE AAASLPVNMN LTLDLNDGIR 

1151 FSGYGADVTI GGKLTLTAQP GGNVRGVGTV RVIKGRYKAY GQDLDITKGT 

1201 VSFVGPLNDP NLNIRAERRL SPVGAGVEIL GSLNSPRITL TANEPMSEKD 

1251 KLSWLILNRA GSGSSGDNAA LSAAAGALLA GQINDRIGLV DDLG FTSKRS 

1301 RNAQTGELNP AEQVLTVGKQ LTGKLYIGYE YGISSAEQSV KLIYRLTRAI 

1351 QAVARIGSRS SGGELTYTIR FDRLFGSDKK DSAGNGKGK* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 183>: 

m285 . seq 

1 ATGACCGATA CCGCACCGAC AGATACCGAT CCGACCGAAA ACGGCACGCG 

51 CAAAATGCCG TCTGAACACC GCCCTACCCC GCCGGCAAAA AAACGCCGCC 

101 CGTTGCTGAA GCTGTCGGCG GCACTGCTGT CTGTCCTGAT TTTGGCAGTA 

151 TGTTTCCTCG GCTGGCTCGC CGGTACGGAA GCAGGTTTGC GCTTCGGGCT 

201 GTACCAAATC CCGTCTTGGT TCGGCGTAAA CATTTCCTCC CAAAACCTCA 

251 AAGGCACGCT GCTCGACGGC TTCGACGGCG ACAACTGGTC GATAGAAACC 

301 GAGGGGGCAG ACCTTAAAAT CAGCCGCTTC CGCTTCGCGT GGAAACCGTC 

351 CGAACTGATG CGCCGCAGCC TGCACATTAC CGAAATTTCC GCCGGCGACA 

4 01 TCGCCATCGT TACCAAACCG ACTCCGCCTA AAGAAGAACG CCCGCCGCTC 

4 51 AGCCTTCCCG ACAGCATAGA CCTGCCTGCC GCCGTCTATC TCGACCGCTT 

501 CGAGACGGGC AAAATCAGCA TGGGCAAAGC CTTTGACAAA CAAACCGTCT 

551 ATCTCGAACG GCTGGATGCT TCATACCGTT ACGACCGCAA AGGACACCGC 

601 CTTGACCTGA AGGCCGCCGA CACGCCGTGG AGCAGTTCGT CGGGGGCGGC 

651 CTCGGTCGGC TTGAAAAAAC CGTTTGCCCT CGATACCGCC ATTTACACCA 

7 01 AAGGCGGACT CGAAGGCAAA ACCATACACA GTACGGCTCG GCTGAGCGGC 

7 51 AGCCTGAAGG ATGTGCGCGC CGAACTGGCG ATCGACGGCG GCAATATCCG 
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801 CCTCTCGGGA AAATCCGTCA TCCACCCGTT TGCCGAATCA TTGGATAAAA 
851 CATTGGAAGA AGTACTGGTC AAAGGGTTCA ACATCAATCC GGCCGCCTTC 
901 GTGCCTTCCC TGCCCGATGC CGGACTGAAT TTCGACCTGA CCGCCATCCC 
951 GTCGTTTTCA GACGGCATCG CGCTGGAAGG TTCGCTCGAT TTGGAAAACA 
1001 CCAAAGCCGG CTTTGCCGAC CGCAACGGCA TCCCCGTCCG TCAGGTTTTA 
1051 GGCGGCTTTG TCATCCGGCA GGACGGCACG GTGCATATCG GCAATACGTC 
1101 CGCCGCCCTG CTCGGACGGG GCGGCATCAG GCTGTCGGGC AAAATCGACA 
1151 CCGAAAAAGA CATCCTCGAT TTAAATATAG GCATCAACTC CGTCGGCGCG 
1201 GAAGACGTAC TGCAAACCGC GTTCAAAGGC AGGTTGGACG GCAGCATCGG 
1251 CATCGGTGGC ACGACCGCCT CGCCCAAAAT CTCTTGGCAA CTCGGCATCG 
1301 GCACGGCGCG CACGGACGGC AGCCTCGCCA TTGCAAGCGA CCCAGCAAAC 
.1351 GGACAGCGGA AACTGGTGCT CGACACCGTC AACATCGCCG CCGGGCAAGG 
14 01 CAGCCTGACC GCGCAAGGCT ATCTCGAGCT GTTTAAAGAC CGCCTGCTCA 

14 51 AGCTGGACAT CCGTTCCCGC GCATTCGACC CTTCGCGCAT CGATCCGCAA 
1501 CTTCCGGCAG GCAATATCAA CGGCTCAATA AACCTTGCCG GCGAACTGGC 

1551 AAAAGAGAAA TTCACAGGCA AAATGCGGTT TTTACCCGGC ACGTTCAACG 

1601 GCGTACCGAT TGCCGGCAGT GCCGACATTG TTTACGAGTC CCGCCACCTT 

1651 CCGCGTGCCG CCGTCGATTT GCGGCTGGGG CGGAACATTA TTAAAACAGA 

1701 CGGCGGCTTC GGCAAAAAAG GCGACCGGCT TAACCTCAAT ATCACCGCAC 

1751 CCGATTTATC CCGTTTCGGT TTCGGACTCG CGGGGTCTTT AAATGTACGC 

1801 GGACACCTTT CCGGTGATTT GGACGGCGGC ATCCGAACCT TTGAAACCGA 

1851 CCTTTCCGGC GCGGCGCGCA ACCTGCACAT CGGCAAGGCG GCAGACATCC 

1901 GTTCGCTCGA TTTCACGCTC AAAGGTTCGC CCGACACAAG CCGCCCGATA 

1951 CGCGCCGACA TCAAAGGCAG CCGCCTTTCG CTGTCGGGCG GAGCGGCGGT 

2001 TGTCGATACC GCCGACCTGA TGCTGGACGG CACGGGCGTG CAGCACCGCA 

2051 TCCGCACACA CGCCGCCATG ACGCTGGATG GCAAACCGTT CAAATTCGAT 

2101 TTGGACGCTT CAGGCGGCAT CAACAGGGAA CTTACCCGAT GGAAAGGCAG 

2151 CATCGGCATC CTCGACATCG GCGGCGCATT CAACCTCAAG CTGCAAAACC 

2201 GTATGACGCT CGAAGCCGGT GCGGAACGCG TGGCGGCAAG TGCGGCAAAT 

2251 TGGCAGGCAA TGGGCGGCAG CCTCAACCTG CAACACTTTT CTTGGGATAA 

2301 AAAAACCGGC ATATCGGCAA AAGGCGGCGC ACACGGTCTG CATATCGCCG 

2351 AGTTGCACAA TTTCTTCAAA CCGCCCTTCG AACACAATCT GGTTTTAAAC 

24 01 GGCGACTGGG ATGTCGCCTA CGGGCGCAAC GCGCGCGGCT ACCTCAATAT 

24 51 CAGCCGGCAA AGCGGCGATG CCGTATTGCC CGGCGGGCAG GCTTTGGGTT 

2501 TGAACGCATT TTCCCTGAAA ACGCGCTTTC AAAACGACCG CATCGGAATC 

2551 CTGCTTGACG GCGGCGCGCG TTTCGGGCGG ATTAACGCCG ATTTGGGCAT 

2 601 CGCCAACGCC TTCGGCGGCA ATATGGCAAA TGCACCGCTC GGCGGCAGGA 

2 651 TTACCGCCTC CCTTCCCGAC TTGGGCGCAT TGAAGCCCTT TCTGCCCGCC 

27 01 GCCGCGCAAA ACATTACCGG CAGCCTGAAT GCCGCCGCGC AAATCGGCGG 

27 51 ACGGGTAGGC TCTCCGTCCG TCAATGCCGC CGTCAACGGC AGCAGCAACT 

2801 ACGGGAAAAT CAACGGCAAC ATCACCGTCG GGCAAAGCCG CTCTTTCGAT 

2851 ACCGCGCCTT TGGGCGGCAG GCTCAACCTG ACCGTTGCCG ATGCCGAAGT 

2 901 ATTCCGCAAC TTCCTACCGG TCGGACAAAC CGTCAAAGGC AGCCTGAATG 

2 951 CCGCCGTAAC CCTCGGCGGC AGCATCGCCG ATCCGCACTT GGGCGGCAGC 

3001 ATCAACGGCG ACAAACTCTA TTACCGCAAC CAAACCCAAG GCATCATCTT 

3051 GGACAACGGC TCGCTGCGTT CGCATATCGC GGGCAGGAAA TGGGTAATCG 

3101 ACAGCCTGAA ATTCCGGCAC GAAGGGACGG CGGAACTCTC CGGTACGGTC 

3151 GGTATGGAAA ACAGCGGACC CGATGTCGAT ATCGGCGCGG TGTTCGACAA 

3201 ATACCGCATC CTGTCCCGCC CCAACCGCCG CCTGACGGTT TCCGGCAACA 

3251 CCCGCCTGCG CTATTCGCCG CAAAAAGGCA TATCCGTTAC CGGGATGATT 

3301 AAAACGGATC AGGGGCTGTT CGGTTCGCAA AAATCCTCGA TGCCGTCCGT 

3351 CGGCGACGAT GTCGTCGTAT TAGGCGAAGT CAAAAAAGAG GCGGCGGCAC 

34 01 CGCTCCCCGT CAATATGAAC CTGACTTTAG ACCTCAATGA CGGCATCCGC 

34 51 TTCGCCGGCT ACGGCGCGGA CGTTACCATA GGCGGCAAAC TGACCCTGAC 

3501 CGCCCAATCG GGCGGAAGCG TACGGGGCGT GGGCACGGTC CGCGTCATCA 

3551 AAGGGCGTTA TAAGGCATAC GGGCAGGATT TGGACATTAC CAAAGGCACG 

3601 GTCTCCTTTG TCGGCCCGCT CAACGATCCC AACCTCAACA TCCGCGCCGA 

3651 ACGCCGCCTT TCCCCCGTCG GTGCGGGCGT GGAAATATTG GGCAGCCTCA 

3701 ACAGCCCGCG CATTACGCTG ACGGCAAACG AACCGATGAG TGAAAAAGAC 

3751 AAGCTCTCTT GGCTCATCCT CAACCGCGCC GGCAGCGGCA GCAGCGGCGA 

3801 CAATGCCGCC CTGTCTGCAG CCGCAGGTGC GCTGCTTGCC GGGCAAATCA 

3851 ACGACCGCAT CGGGCTGGTG GATGATTTGG GCTTTACCAG CAAGCGCAGC 

3901 CGCAACGCGC AAACCGGCGA ACTCAACCCC GCCGAACAGG TGCTGACCGT 

3951 CGGCAAACAA CTGACCGGCA AACTCTACAT CGGCTACGAA TACAGCATCT 

4 001 CCAGCGCGGA ACAGTCCGTC AAACTGATTT ACCGGCTGAC CCGCGCCATA 

4 051 CAGGCGGTTG CCCGTATCGG CAGCCGTTCG TCGGGCGGCG AGCTGACATA 
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4101 CACCATACGT TTCGACCGCT TCTCCGGTTC GGACAAAAAA GACTCCGCCG 
4151 GAAACGGCAA AGGAAAATAA 

This corresponds to the amino acid sequence <SEQ ID 1 1 84; ORP 285>: 

m285 .pep 

1 MTDTAPTDTD PTENGTRKMP SEHRPTPPAK KRRPLLKLS A ALLS VLI LAV 

51 CFLGWL AGTE AGLRFGLYQI PSWFGVNISS QNLKGTLLDG FDGDNWSIET 

101 EGADLKISRF RFAWKPSELM RRSLHITEIS AGDIAIVTKP TPPKEERPPL 

151 SLPDSIDLPA AVYLDRFETG KISMGKAFDK QTVYLERLDA SYRYDRKGHR 

201 LDLKAADTPW SSSSGAASVG LKKPFALDTA IYTKGGLEGK TIHSTARLSG 

251 SLKDVRAELA IDGGNIRLSG KSVIHPFAES LDKTLEEVLV KGFNINPAAF 

301 VPSLPDAGLN FDLTAIPSFS DGIALEGSLD LENTKAGFAD RNGIPVRQVL 

351 GGFVIRQDGT VHIGNTSAAL LGRGGIRLSG KIDTEKDILD LNIGINSVGA 

4 01 EDVLQTAFKG RLDGSIGIGG TTASPKISWQ LGIGTARTDG SLAIASDPAN 

4 51 GQRKLVLDTV NIAAGQGSLT AQGYLELFKD RLLKLDIRSR AFDPSRIDPQ 

501 LPAGNINGSI NLAGELAKEK FTGKMRFLPG TFNGVPIAGS ADIVYESRHL 

551 PRAAVDLRLG RNIIKTDGGF GKKGDRLNLN ITAPDLSRFG FGLAGSLNVR 

601 GHLSGDLDGG IRTFETDLSG AARNLHIGKA ADIRSLDFTL KGSPDTSRPI 

651 RADIKGSRLS LSGGAAWDT ADLMLDGTGV QHRIRTHAAM TLDGKPFKFD 

701 LDASGGINRE LTRWKGSIGI LDIGGAFNLK LQNRMTLEAG AERVAASAAN 

751 WQAMGGSLNL QHFSWDKKTG ISAKGGAHGL HIAELHNFFK PPFEHNLVLN 

801 GDWDVAYGRN ARGYLNISRQ SGDAVLPGGQ ALGLNAFSLK TRFQNDRIGI 

851 LLDGGARFGR INADLGIANA FGGNMANAPL GGRITASLPD LGALKPFLPA 

901 AAQNITGSLN AAAQIGGRVG SPSVNAAVNG SSNYGKINGN ITVGQSRSFD 

951 TAPLGGRLNL TVADAEVFRN FLPVGQTVKG SLNAAVTLGG SIADPHLGGS 

1001 INGDKLYYRN QTQGIILDNG SLRSHIAGRK WVIDSLKFRH EGTAELSGTV 

1051 GMENSGPDVD IGAVFDKYRI LSRPNRRLTV SGNTRLRYSP QKGISVTGMI 

1101 KTDQGLFGSQ KSSMPSVGDD VVVLGEVKKE AAAPLPVNMN LTLDLNDGIR 

1151 FAGYGADVTI GGKLTLTAQS GGSVRGVGTV RVIKGRYKAY GQDLDITKGT 

1201 VSFVGPLNDP NLNIRAERRL SPVGAGVEIL GSLNSPRITL TANEPMSEKD 

1251 KLSWLILNRA GSGSSGDNAA LSAAAGALLA GQINDRIGLV DDLG FTSKRS 

1301 RNAQTGELNP AEQVLTVGKQ LTGKLYIGYE YSISSAEQSV KLIYRLTRAI 

1351 QAVARIGSRS SGGELTYTIR FDRFSGSDKK DSAGNGKGK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

m285/g2B5 96.5% identity in 1389 aa overlap 

10 20 30 40 50 60 

MTDTAPTDTDPTENGTRKMPSEHRPTPPAKKRRPLLKLSAALLSVLILAVCFLGWLAGTE 
I I I I : I I I I I I I I I I S ! I I 1 I I I I I : I I I I I i I I I ! I i 1 I I I I I I ! I I I I ! I I II : I I I ! 
MTDTTPTDTDPTENGTRKMPSEHRPAPPAKKRRPLLKLSAALLSVLILAVCFLGWIAGTE 
10 20 30 40 50 60 

70 80 90 100 110 120 

AGLRFGLYQI PSWFGVNISSQNLKGTLLDGFDGDNWSIETEGADLKISRFRFAWKPSELM 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I || I I | | | | | | | M I I 
AGLRFGLYQI PSWFGVN I SSQNLKGTLLDGFDGDNWS I ETEGADLKISRFRFAWKPSELM 
70 80 90 100 110 120 

130 140 150 160 170 180 

RRSLHITEISAGDIAIVTKPTPPKEERPPLSLPDSIDLPAAVYLDRFETGKISMGKAFDK 
I I I I I I I : I I II I I I I I I II I I I I I I I I I : I I II M I I I I I I I I I I I I I I I I I I I : I I I 
RRSLHITDISAGDIAIVTKPTPPKEERPPQGLPDSIDLPAAVYLDRFETGKISMGKTFDK 
130 140 150 160 170 180 

190 200 210 220 230 240 

QTVYLERLDASYRYDRKGHRLDLKAADTPWSSSSGAASVGLKKPFALDTAIYTKGGLEGK 
I I I I I I I I : I : I I I I I I I I I I I I I I I I I I I I I I I (: I I I I I I I I I I I I I I I I I I I I : I I : 
QTVYLERLNAAYRYDRKGHRLDLKAADTPWSSSSGSASVGLKKPFALDTAIYTKGGFEGE 
190 200 210 220 230 240 
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I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I : | | 
g285 TIHSTARLSGSLKDVRAELTIDGGNIRLSGKSVIHPFAESLDKTLEEVLVKGFNINPSAF 

250 260 270 280 290 300 

310 320 330 340 350 360 

m285.pep VPSLPDAGLNFDLTAIPSFSDGIALEGSLDLENTKAGFADRNGIPVRQVLGGFVIRQDGT 
I I I I I I I I I II I I I I I I I I I 1 1 I I I ! I I I I I I I I I I I I I I I I I I I I I I I | | I | M I I I I I 

g285 VPSLPDAGLNFDLTAIPSFSDGIALEGSLDLENTKAGFADRNGIPVRQVLGGFVIRQDGT 

310 320 330 340 350 360 

370 380 390 400 410 420 

m285 . pep VHIGNTSAALLGRGGIRLSGKIDTEKDILDLNIGINSVGAE DVLQTAFKGRLDGSIGIGG 

I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I II I I I I I I I I I | || | | 
g285 VHIGNTSAALLGRGGIRLSGKIDTEKDILDLNIG INS VGAE DVLQTAFKGRLDGSIGIGG 

370 380 390 400 410 420 

430 440 450 460 470 480 

m285.pep TTAS PKI SWQLGIGTARTDGSLAIASDPANGQRKLVLDTVN I AAGQGSLTAQGYLELFKD 

II I I I I I 1 I I I I I I I I I I I I I I M I I I I 11 I f I : I I I I I : I I : I I I I I I II I I I I I I 
g285 TTAS PKI SWQLGTGTARTDGSLPIASDPANEQRKLVFDTVNISAGEGSLTAQGYLELFKD 

430 440 450 460 470 480 

490 500 510 520 530 540 

m285 . pep RLLKLDIRSRAFDPSRIDPQLPAGNINGSINLAGELAKEKFTGKMRFLPGTFNGVPIAGS 
I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I : I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
g285 RLLKLDIRSRAFDPSRIDPQFPAGNINGSIHLAGELAKEKFTGKMRFLPGTFNGVPIAGS 

490 500 510 520 530 540 

550 560 570 580 590 600 

m285.pep ADIVYESRHLPRAAVDLRLGRNIIKTDGGFGKKGDRLNLNITAPDLSRFGFGLAGSLNVR 
I I I M I I I I I I I I I I I I I I I I I I : I I i I I I I I I I I I I I I I i I I I I I I I I I ! I I I I I I I I I 
g285 ADIVYESRHLPRAAVDLRLGRNIVKTDGGFGKKGDRLNLNITAPDLSRFG FGLAGSLNVR 

550 560 570 580 590 600 

610 620 630 640 650 660 

m285.pep GHLSGDLDGGIRTFETDLSGAARNLHIGKAADIRSLDFTLKGSPDTSRPIRADIKGSRLS 
. I I I I I II I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I i I I : I I I 
g285 GHLSGDLDGGIRTFETDLSGTARNLHIGKAADIRSLDFTLKGSPGTSRPMRADIKGGRLS 

610 620 630 640 650 660 

670 680 690 700 710 720 

m285.pep LSGGAAVVDTADLMLDGTGVQHRIRTHAAMTLDGKPFKFDLDASGGINRELTRWKGSIGI 
I I II I I I I I I I I I : I I I : I I I I I I I I I I I II I I I I I : I I I I I I I I I I I I I II I I I I I I 
g285 LSGGAAVVDTAGLTLEGTGAQHRIRTHAAMTLDGKPFKLDLDASGGINRELTRWKGSIGI 

670 680 690 700 710 720 

730 740 750 760 770 780 

m285.pep LDIGGAFNLKLQNRMTLEAGAERVAASAANWQAMGGSLNLQHFSWDKKTGISAKGGAHGL 
I I I I I I I II I I I I I I I I I I I I I: I I I I t I 1 I I I I I I I I I I I I I I I I : I 11 I I I I I I I : I I 
g285 LDIGGAFNLKLQNRMTLEAGAEHVAASAANWQAMGGSLNLQHFSWDRKTGISAKGGARGL 

730 740 750 760 770 780 

790 800 810 820 830 840 

m285 . pep HIAELHNFFKPPFEHNLVLNGDWDVAYGRNARGYLNISRQSGDAVLPGGQALGLNAFSLK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I II II I I I I I I I I I I I II I I I I I I I I 
g285 HIAELHNFFKPPFEHNLVLNGDWDVAYGHNARGYLNISRQSGDAVLPGGQALGLNAFSLK 

790 800 810 820 830 840 

850 860 870 880 890 900 

m285 . pep TRFQNDRIGILLDGGARFGRINADLGIANAFGGNMANAPLGGRITASLPDLGALKPFLPA 
I I I I I M I I I i I I I I I I I I I I I I I I I I : I I I I I I I I I : I I I I I I I I I I I I I I I I [ I I M I 
g285 TRFQNDRIGILLDGGARFGRINADLGIGNAFGGNMANTPLGGRITASLPDLGALKPFLPA 

850 860 870 880 890 900 



m285.pep 
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AAQNITGSLNAAAQIGGRVGSPSVNAAVNGSSNYGKINGNITVGQSRSFDTAPLGGRLNL 
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I I I I I I II I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I | | | | J I | | | | | 
g285 AAQNITGSLNASAQIGGRVGSPSVNAAVNGSSNYGKINGNITVGQSRSFDTAPLGGRLNL 

910 920 930 940 950 960 



970 980 990 1000 1010 1020 

m285 . pep TVADAEVFRNFLPVGQTVKGSLNAAVTLGGSIADPHLGGSINGDKLYYRNQTQGIILDNG 
I I II I I : I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I | | 
g285 TVADAEAFRNFLPVGQTVKGSLNAAVTLGGSIADPHLGGSINGDKLYYRNQTQGIILDNG 

970 980 990 1000 1010 1020 

1030 1040 1050 1060 1070 1080 

m285.pep SLRSHIAGRKWVIDSLKFRHEGTAELSGTVGMENSGPDVDIGAVFDKYRILSRPNRRLTV 
I I I I I I I I I I I I I II I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I II I I I I I I I I I I 
g285 SLRSHIAGRKWVIDSLKFRHEGTAELSGTVSMENSVPDVDIGAVFDKYRILSRPNRRLTV 

1030 1040 1050 1060 1070 1080 

1090 1100 1110 1120 1130 1140 

m285.pep SGNTRLRYSPQKGISVTGMIKTDQGLFGSQKSSMPSVGDDVVVLGEVKKEAAAPLPVNMN 
I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I MINI 
g285 SGNTRLRYSPQKGISVTGMIKTDQGLFGSQKSSMPSVGDDVVVLGEVKKEAAASLPVNMN 

1090 1100 1110 1120 1130 1140 



1150 1160 1170 1180 1190 1200 

m285 . pep LTLDLNDGIRFAGYGADVTIGGKLTLTAQSGGSVRGVGTVRVIKGRYKAYGQDLDITKGT 
I I II II I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I II I I 
g285 LTLDLNDGIRFSGYGADVTIGGKLTLTAQPGGNVRGVGTVRVIKGRYKAYGQDLDITKGT 

1150 1160 1170 1180 1190 1200 

1210 1220 1230 1240 1250 1260 

m285.pep . VSFVGPLNDPNLNIRAERRLSPVGAGVEILGSLNSPRITLTANEPMSEKDKLSWLILNRA 

I I I I I 1 I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I 

g285 VS FVGPLNDPNLN IRAERRLS PVGAGVE I LGSLNS PRI TLTANEPMSEKDKLS WL I LNRA 

1210 1220 1230 1240 1250 1260 

1270 1280 1290 1300 1310 1320 

m285 . pep GSGSSGDNAALSAAAGALLAGQINDRIGLVDDLGFTSKRSRNAQTGELNPAEQVLTVGKQ 
I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I II I I I i I I I I I 
g285 GSGSSGDNAALSAAAGALLAGQINDRIGLVDDLGFTSKRSRNAQTGELN PAEQVLTVGKQ 

1270 1280 1290 1300 1310 1320 



1330 1340 1350 1360 1370 1380 

m285.pep LTGKLYIGYEYSISSAEQSVKLIYRLTRAIQAVARIGSRSSGGELTYTIRFDRFSGSDKK 
. I I I I I I I I I I I : I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 
g285 LTGKLYIGYEYGISSAEQSVKLIYRLTRAIQAVARIGSRSSGGELTYTIRFDRLFGSDKK 

1330 1340 1350 1360 1370 1380 



1390 

m2 8 5 . pep DSAGNGKGKX 
I I I I I I I I I I 
g285 DSAGNGKGKX 



The following partial DNA sequence 



a285 . seq 






1 


ATGACCGATA 


CCGCACCGAC 


51 


CAAAATGCCG 


TCTGAACACC 


101 


CGCTGCTGAA 


GCTGTCGGCG 


151 


TGTTTCCTCG 


GCTGGCTCGC 


201 


GTACCAAATC 


CCGTCTTGGT 


251 


AAGGCACGCT 


GCTCGACGGC 


301 


GAGGGGGCAG 


ACCTTAAAAT 


351 


CGAACTGATG 


CGCCGCAGCC 


401 


TCGCCATCGT 


TACCAAACCG 


451 


AGCCTTCCCG 


ACAGCATAGA 


501 


CGAGACGGGC 


AAAATCAGCA 


551 


ATCTCGAACG 


GCTGGATGCT 


601 


CTCGACCTGA 


AGGCTGCCGA 



identified in N. meningitidis <SEQ ID 1 185>: 

AGATACCGAT CCGACCGAAA ACGGCACGCG 
GCCCTACCCC GCCGGCAAAA AAACGCCGCC 
GCACTGCTGT CTGTTCTGAT TTTGGCAGTA 
CGGCACGGAA GCGGGTTTGC GCTTCGGGCT 
TCGGCGTAAA CATTTCCTCC CAAAACCTCA 
TTCGACGGCG ACAACTGGTC GATAGAAACC 
CAGCCGCTTC CGCTTCGCGT GGAAACCGTC 
TGCACATTAC CGAAATTTCC GCCGGCGACA 
ACTCCGCCTA AAGAAGAACG CCCGCCGCTC 
CCTGCCTGCC GCCGTCTATC TCGACCGCTT 
TGGGCAAAGC CTTTGACAAA CAAACCGTCT 
TCATACCGTT ACGACCGCAA AGGACACCGC 
CACGCCGTGG AGCAGTTCGT CGGGGTCAGC 



WO 99/57280 



PCT/US99/09346 



651 CTCGGTCGGC TTGAAAAAAC 

7 01 AAGGCGGACT CGAAGGCAAA 

751 AGCCTGAAGG ATGTGCGCGC 

801 CCTCTCGGGA AAATCCGTCA 

851 CATTGGAAGA AGTACTGGTC 

901 GTGCCTTCCC TGCCCGATGC 

951 GTCGTTTTCA GACGGCATCG 

1001 CCAAAGCCGG CTTTGCCGAC 

1051 GGCAGCTTTG TCATCCGGCA 

1101 CGTCGCCCTG CTCGGACGGG 

1151 CCGAAAAAGA CATCCTCGAT 

1201 GAAGACGTAC TGCAAACCGC 

1251 CATCGGTGGC ACGACCGCCT 

1301 GCACGGCGCG CACGGACGGC 

1351 GGACAGCGGA AACTGGTGCT 

14 01 CAGCCTGACC GCGCAAGGCT 

14 51 AGCTGGACAT CCGTTCCCGC 

1501 CTTCCGGCAG GCAATATCAA 

1551 AAAAGAGAAA TTCACAGGCA 

1601 GCGTACCGAT TGCCGGCAGT 

1651 CCGCGTGCCG CCGTCGATTT 

1701 CGGCGGCTTC GGCAAAAAAG 

1751 CCGATTTATC CCGTTTCGGT 

1801 GGACACCTTT CCGGCGATTT 

1851 CCTTTCCGGC GCGGCGCGCA 

1901 GTTCGCTCGA TTTCACGCTC 

1951 CGCGCCGACA TCAAAGGCAG 

2001 TGTCGATACC GCCGACCTGA 

2051 TCCGCACACA CGCCGCCATG • 

2101 TTGGACGCTT CAGGCGGCAT 

2151 CATCGGCATC CTCGACATCG 

2201 GTATGACGCT CGAAGCCGGT 

2251 TGGCAGGCAA TGGGCGGCAG 

2301 AAAAACCGGC ATATCGGCAA 

2351 AGTTGCACAA TTTCTTCAAA 

2 401 GGCGACTGGG ATGTCGCCTA 

2 4 51 CAGCCGGCAA AGCGGCGATG 

2501 TGAACGCATT TTCCCTGAAA 

2551 CTGCTTGACG GCGGCGCGCG 

2 601 CGGCAACGCC TTCGGCGGCA 

2 651 TTACCGCCTC CCTTCCCGAC 

2701 GCCGCGCAAA ACATTACCGG 

27 51 ACGGGTCGGC TCTCCGTCCG 

2801 ACGGGAAAAT CAACGGCAAC 

2851 ACCGCGCCTT TGGGCGGCAG 

2901 ATTCCGCAAC TTCCTACCGG 

2 951 CCGCCGTAAC CCTCGGCGGC 
3001 ATCAACGGCG ACAAACTCTA 
3051 GGACAACGGC TCGCTGCGTT 
3101 ACAGCCTGAA ATTCCGGCAC 
3151 GGTATGGAAA ACAGCGGACC 
3201 ATACCGCATC CTGTCCCGCC 
3251 CCCGCCTGCG CTATTCGCCG 
3301 AAAACGGATC AGGGGCTGTT 
3351 CGGCGACGAT GTCGTCGTAT 
3401 CGCTCCCCGT CAATATGAAC 
34 51 TTCGCCGGCT ACGGCGCGGA 
3501 CGCCCAATCG GGCGGAAGCG 
3551 AAGGGCGTTA TAAGGCATAC 

3 601 GTCTCCTTTG TCGGCCCGCT 
3 651 ACGCCGCCTT TCCCCCGTCG 
3701 ACAGTCCGCG CATTACGCTG 
3751 AAGCTCTCCT GGCTCATCCT 
3801 CAATGCCGCC CTGTCCGCAG 
3851 ACGACCGCAT CGGGCTGGTG 
3 901 CGCAACGCGC AAACCGGCGA 
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CGTTTGCCCT CGATACCGCC ATTTACACCA 
ACCATACACA GTACGGCTCG GCTGAGCGGC 
CGAACTGGCG ATCGACGGCG GCAATATCCG 
TCCACCCGTT TGCCGAATCA TTGGATAAAA 
AAAGGGTTCA ACATCAATCC GTCCGCCTTC 
CGGGCTGAAT TTCGACCTGA CCGCCATCCC 
CGCTGGAAGG CTCGCTCGAT TTGGAAAACA 
CGCAACGGCA TCCCCGTCCG TCAGGTTTTA 
GGACGGCACG GTGCATATCG GCAATACGTC 
GCGGCATCAG GCTGTCGGGC AAAATCGACA 
TTAAATATAG GCATCAACTC CGTCGGCGCG 
GTTCAAAGGC AGGTTGGACG GCAGCATCGG 
CGCCCAAAAT CTCTTGGCAA CTCGGCATCG 
AGCCTCGCCA TTGCAAGCGA CCCCGCAAAC 
CGACACCGTC AACATCGCCG CCGGGCAAGG 
ATCTCGAGCT GTTTAAAGAC CGCCTGCTCA 
GCATTCGACC CTTCGCGCAT CGATCCGCAA 
CGGCTCAATA AACCTTGCCG GCGAACTGGC 
AAATGCGGTT TTTACCCGGC ACGTTCAACG 
GCCGACATTG TTTACGAGTC CCGCCACCTT 
GCGGCTGGGG CGGAACATTA TTAAAACAGA 
GCGACCGGCT TAACCTCAAT ATCACCGCAC 
TTCGGACTCG CGGGGTCTTT AAATGTACGC 
GGACGGTGGC ATCCGAACCT TTGAAACCGA 
ACCTGCACAT CGGCAAGGCG GCAGACATCC 
AAAGGTTCGC CCGACACAAG CCGCCCGATA 
CCGCCTTTCG CTGTCGGGCG GAGCGGAGGT 
TGCTGGACGG CACGGGCGTG CAGCACCGCA 
ACGCTGGATG GCAAACCGTT CAAATTCGAT 
CAACAGGGAA CTTACCCGAT GGAAAGGCAG 
GCGGCGCATT CAACCTCAAG CTGCAAAACC 
GCGGAACGCG TGGCGGCAAG TGCGGCAAAT 
CCTCAACCTG CAACACTTTT CTTGGGATAA 
AAGGCGGCGC ACACGGTCTG CATATCGCCG 
CCGCCCTTCG AACACAATCT GGTTTTAAAC 
CGGGCGAAAC GCGCGCGGCT ACCTCAATAT 
CCGTATTGCC CGGCGGGCAG GCTTTGGGTT 
ACGCGCTTTC AAAACGACCG TATCGGAATC 
TTTCGGGCGG ATTAACGCCG ATTTGGACAT 
ATATGGCAAA TGCACCGCTC GGCGGCAGGA 
TTGGGCACAT TGAAGCCCTT TCTGCCCGCC 
CAGCCTGAAT GCCGCCGCGC AAATCGGCGG 
TCAATGCCGC CGTCAACGGC AGCAGCAACT 
ATCACCGTCG GGCAAAGCCG CTCTTTCGAT 
GCTCAACCTG ACCGTTGCCG ATGCCGAAGT 
TCGGACAAAC CGTCAAAGGC AGCCTGAATG 
AGCATCGCCG ATCCGCACTT GGGCGGCAGC 
TTACCGCAAC CAAACCCAAG GCATCATCTT 
CGCATATCGC GGGCAGGAAA TGGGTAATCG 
GAAGGGACGG CGGAACTCTC CGGTACGGTC 
CGATGTCGAT ATCGGCGCGG TGTTCGACAA 
CCAACCGCCG CCTGACGGTT TCCGGCAACA 
CAAAAAGGCA TATCCGTTAC CGGGATGATT 
CGGTTCGCAA AAATCCTCGA TGCCGTCCGT 
TAGGCGAAGT CAAAAAAGAG GCGGCGGCAC 
CTGACTTTAG ACCTCAATGA CGGCATCCGC 
CGTTACCATA GGCGGCAAAC TGACCCTGAC 
TGCGGGGCGT GGGCACGGTC CGCGTCATCA 
GGGCAGGATT TGGACATTAC CAAAGGCACG 
CAACGACCCC AACCTCAACA TCCGCGCCGA 
GTGCGGGCGT GGAAATATTG GGCAGCCTCA 
ACGGCAAACG AACCGATGAG TGAAAAAGAC 
CAACCGCGCC GGCAGTGGCA GCAGCGGCGA 
CCGCCGGCGC GCTGCTTGCC GGGCAAATCA 
GATGATTTGG GCTTTACCAG CAAGCGCAGC 
ACTCAACCCC GCCGAACAGG TGCTGACCGT 
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3951 CGGCAAACAA CTGACCGGCA AACTCTACAT CGGCTACGAA TACAGCATCT 

4 001 CCAGCGCGGA ACAGTCCGTC AAACTGATTT ACCGGCTGAC CCGCGCCATA 

4 051 CAGGCGGTTG CCCGTATCGG CAGCCGTTCG TCGGGCGGCG AGCTGACATA 

4101 CACCATACGT TTCGACCGCT TCTCCGGTTC GGACAAAAAA GACTCCGCCG 

4151 GAAACAGCAA AGGAAAATAA 



This corresponds to the amino acid sequence <SEQ ID 1 1 86; ORF 285. a>: 

a285.pep 

MTDTAPTDTD PTENGTRKMP SEHRPTPPAK 
CFLGWLAGTE AGLRFGLYQI PSWFGVNISS 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 



KRRPLLKLSA ALLSVLILAV 



EGADLKISRF RFAWKPSELM 
SLPDSIDLPA AVYLDRFETG 



LDLKAADTPW 
SLKDVRAELA 
VPSLPDAGLN 



SSSSGSASVG 
IDGGNIRLSG 
FDLTAIPSFS 



GSFVIRQDGT VHIGNTSVAL 



QNLKGTLLDG FDGDNWSIET 
RRSLHITEIS AGDIAIVTKP TPPKEERPPL 
KISMGKAFDK QTVYLERLDA SYRYDRKGHR 
LKKPFALDTA IYTKGGLEGK TIHSTARLSG 
KSVIHPFAES LDKTLEEVLV KGFNINPSAF 
DGIALEGSLD LENTKAG FAD RNGIPVRQVL 
LGRGGIRLSG KIDTEKDILD LNIGINSVGA 



EDVLQTAFKG RLDGSIGIGG TTASPKISWQ LGIGTARTDG 
GQRKLVLDTV NIAAGQGSLT AQGYLELFKD RLLKLDIRSR 



LPAGNINGSI 
PRAAVDLRLG 
GHLSGDLDGG 
RADIKGSRLS 
LDASGGINRE 
WQAMGGSLNL 
GDWDVAYGRN 
LLDGGARFGR 
AAQNITGSLN 
TAPLGGRLNL 



N LAG E LAKE K FTGKMRFLPG 
RNIIKTDGGF GKKGDRLNLN 



TFNGVPIAGS 
ITAPDLSRFG 
IRT FETDLSG AARNLHIGKA ADIRSLDFTL 



SLAIASDPAN 
AFDPSRIDPQ 
ADIVYESRHL 
FGLAGSLNVR 
KGSPDTSRPI 



LSGGAEWDT ADLMLDGTGV QHRIRTHAAM TLDGKPFKFD 



LTRWKGSIGI 
QHFSWDKKTG 
ARGYLNISRQ 
INADLDIGNA 
AAAQIGGRVG 
TVADAEVFRN 



LDIGGAFNLK LQNRMTLEAG 
ISAKGGAHGL HIAELHNFFK 



AERVAASAAN 
PPFEHNLVLN 
SGDAVLPGGQ ALGLNAFSLK TRFQNDRIGI 
FGGNMANAPL GGRITASLPD LGTLKPFLPA 
SPSVNAAVNG SSNYGKINGN 
FLPVGQTVKG SLNAAVTLGG 



INGDKLYYRN QTQGIILDNG 
GMENSGPDVD IGAVFDKYRI 



SLRSHIAGRK WVIDSLKFRH 
LSRPNRRLTV SGNTRLRYSP 



KSSMPSVGDD WVLGEVKKE 
GGKLTLTAQS GGSVRGVGTV 



KTDQGLFGSQ 
FAGYGADVTI 
VSFVGPLNDP 
KLSWLILNRA GSGSSGDNAA 
RNAQTGELNP AEQVLTVGKQ 



ITVGQSRSFD 
SIADPHLGGS 
EGTAELSGTV 
QKGISVTGMI 
LTLDLNDGIR 



AAAPLPVNMN 

RVIKGRYKAY GQDLDITKGT 
NLNIRAERRL SPVGAGVEIL GSLNSPRITL TANEPMSEKD 
LSAAAGALLA GQINDRIGLV DDLGFTSKRS 
LTGKLYIGYE YSISSAEQSV KLIYRLTRAI 
QAVARIGSRS SGGELTYTIR FDRFSGSDKK DSAGNSKGK* 



m285/a285 99.4% identity in 1389 aa overlap 

10 20 30 40 50 60 

m285.pep MTDTAPTDTDPTENGTRKMPSEHRPTPPAKKRRPLLKLST^ALLSVLILAVCFLGWLAGTE 

I I I I M I I I I I I I I I I I I I I I I I I I I I I M t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a285 MTDTAPTDTDPTENGTRKMPSEHRPTPPAKKRRPLLKLSAALLSVLILAVCFLGWLAGTE 

10 20 30 40 50 60 



70 80 90 100 110 120 

m285.pep AGLRFGLYQI PSWFGVNISSQNLKGTLLDGFDGDNWSIETEGADLKISRFRFAWKPSELM 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I 
a 285 AGLRFGLYQI PSWFGVNISSQNLKGTLLDGFDGDNWSIETEGADLKISRFRFAWKPSELM 

70 80 90 100 110 120 

. 130 140 150 160 170 180 

m285 . pep RRSLHITEISAGDIAIVTKPTPPKEERPPLSLPDSIDLPAAVYLDRFETGKISMGKAFDK 

I I I I I i I I ! I I I I i I I I I I I I I II i I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

a285 RRSLHITEISAGDIAIVTKPTPPKEERPPLSLPDSIDLPAAVYLDRFETGKISMGKAFDK 

130 140 150 160 170 180 



190 200 210 220 230 240 

m285 . pep QTVYLERLDASYRYDRKGHRLDLKAADTPWSSSSGAASVGLKKPFALDTAIYTKGGLEGK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I : I I I I I I I I I I I I I I I I I I I I I I I I 
a285 QTVYLERLDASYRYDRKGHRLDLKAADTPWSSSSGSASVGLKKPFALDTAIYTKGGLEGK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m285.pep TIHSTARLSGSLKDVRAELAIDGGNIRLSGKSVIHPFAESLDKTLEEVLVKGFNINPAAF 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I : I I 
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a285 TIHSTARLSGSLKDVRAELAIDGGNIRLSGKSVIHPFAESLDKTLEEVLVKGFNINPSAF 

250 260 270 280 290 300 

310 320 330 340 350 360 

m285.pep VPSLPDAGLNFDLTAIPSFSDGIALEGSLDLENTKAGFADRNGIPVRQVLGGFVIRQDGT 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I i I I I I I II I I 1 I I I I : I I I I I I I I 
a285 VPSLPDAGLNFDLTAIPSFSDGIALEGSLDLENTKAG FADRNGIPVRQVLGSFVIRQDGT 

310 320 330 340 350 360 

370 380 390 400 410 420 

m285 . pep VHIGNTSAALLGRGGIRLSGKIDTEKDILDLNIGINSVGAE DVLQTAFKGRLDGSIGIGG 

I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
a285 VHIGNTSVALLGRGGIRLSGKIDTEKDILDLNIGINSVGAEDVLQTAFKGRLDGSIGIGG 

370 380 390 400 410 420 

430 440 450 460 470 480 

ra285 . pep TTASPKISWQLGIGTARTDGSLAIASDPANGQRKLVLDTVNIAAGQGSLTAQGYLELFKD 
I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a28 5 TTASPKISWQLGIGTARTDGSLAIASDPANGQRKLVLDTVNIAAGQGSLTAQGYLELFKD 

430 440 450 460 470 480 

490 500 510 520 530 540 

m285 . pep RLLKLDIRSRAFDPSRIDPQLPAGNINGSINLAGELAKEKFTGKMRFLPGTFNGVPIAGS 
I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I 1 I II I I I I I II I I I I I I I I I 
a285 RLLKLDIRSRAFDPSRIDPQLPAGNINGSINLAGELAKEKFTGKMRFLPGTFNGVPIAGS 

490 500 510 520 530 540 

550 560 570 580 590 600 

m285.pep ADIVYESRHLPRAAVDLRLGRNIIKTDGGFGKKGDRLNLNITAPDLSRFGFGLAGSLNVR 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 i 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 i I 

a285 ADIVYESRHLPRAAVDLRLGRNIIKTDGGFGKKGDRLNLNITAPDLSRFG FGLAGSLNVR 

550 560 570 580 590 600 

610 620 630 640 650 660 

m285 . pep GHLSGDLDGGIRTFETDLSGAARNLHIGKAADIRSLDFTLKGSPDTSRPIRADIKGSRLS 
I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I 
a285 GHLSGDLDGGIRTFETDLSGAARNLHIGKAADIRSLDFTLKGSPDTSRPIRADIKGSRLS 

610 620 630 640 650 660 

670 680 690 700 710 720 

m285 . pep LSGGAAWDTADLMLDGTGVQHRIRTHAAMTLDGKPFKFDLDASGGINRELTRWKGSIGI 
I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I II I I I I II I I 
a285 LSGGAEVVDTADLMLDGTGVQHRIRTHAAMTLDGKPFKFDLDASGGINRELTRWKGSIGI 

670 680 690 700 710 720 

730 740 750 760 770 780 

m285 . pep LDIGGAFNLKLQNRMTLEAGAERVAASAANWQAMGGSLNLQHFSWDKKTGISAKGGAHGL 

I I I I I I I I I I i I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
a285 LDIGGAFNLKLQNRMTLEAGAERVAASAANWQAMGGSLNLQHFSWDKKTGISAKGGAHGL 

730 740 750 760 ' 770 780 

790 800 810 820 830 840 

m285 . pep HIAELHNFFKPPFEHNLVLNGDWDVAYGRNARGYLNISRQSGDAVLPGGQALGLNAFSLK 

II II I I II I I ! I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I | I I I | M | 
a285 HIAELHNFFKPPFEHNLVLNGDWDVAYGRNARGYLNISRQSGDAVLPGGQALGLNAFSLK 

790 800 810 820 830 840 

850 860 870 880 890 900 

m285 . pep TRFQNDRIGILLDGGARFGRINADLGIANAFGGNMANAPLGGRITASLPDLGALKPFLPA 
I I I I I II I I I I I I I I I I I I I I II I I I : I I I I I I I I! I I I I I I I I I ! I I I I I : I I I I I I I 
a285 TRFQNDRIGILLDGGARFGRINADLDIGNAFGGNMANAPLGGRITASLPDLGTLKPFLPA 

850 860 870 880 890 900 

910 920 930 940 950 960 

m285 . pep AAQNITGSLNAAAQIGGRVGSPSVNAAVNGSSNYGKINGNITVGQSRSFDTAPLGGRLNL 
I I I i I I I I I I I I I II I I I I ! I I I I I I I I I I I I I I II II I I I I | | | | | | | | | | | | | | | | | | 
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a285 AAQNITGSLNAAAQIGGRVGSPSVNAAVNGSSNYGKINGNITVGQSRSFDTAPLGGRLNL 

910 920 930 940 950 960 

970 980 990 1000 1010 1020 

m285.pep TVADAEVFRNFLPVGQTVKGSLNAAVTLGGSIADPHLGGSINGDKLYYRNQTQGIILDNG 
I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I M I I I I I I M I II I I I I I I I I I I I i I I I I I I I 

a2 8 5 T VADAE V FRN FL PVGQTVKG S LN AAVT LGG S I AD PH LGG S INGDKLYYRNQTQGI I LDNG 

970 980 990 1000 1010 1020 

1030 1040 1050 1060 1070 1080 

m285.pep SLRSHIAGRKWVIDSLKFRHEGTAELSGTVGMENSGPDVDIGAVFDKYRILSRPNRRLTV 
I I I t I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a285 SLRSHIAGRKWVIDSLKFRHEGTAELSGTVGMENSGPDVDIGAVFDKYRILSRPNRRLTV 

1030 1040 1050 1060 , 1070 1080 

1090 1100 1110 1120 1130 1140 

m285 . pep SGNTRLRYSPQKGISVTGMIKTDQGLFGSQKSSMPSVGDDVVVLGEVKKEAAAPLPVNMN 
I I I I I I I I I I I I I I I I I I I II I I I I I I I f I I II I I I I I I I I I I I I I I I I I I I I I I I I M | 
a285 SGNTRLRYSPQKGISVTGMIKTDQGLFGSQKSSMPSVGDDVVVLGEVKKEAAAPLPVNMN 

1090 1100 1110 1120 1130 1140 

1150 1160 1170 1180 1190 1200 

m285.pep LTLDLNDGIRFAGYGADVTIGGKLTLTAQSGGSVRGVGTVRVIKGRYKAYGQDLDITKGT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a285 LTLDLNDGIRFAGYGADVTIGGKLTLTAQSGGSVRGVGTVRVIKGRYKAYGQDLDITKGT 

1150 1160 1170 1180 1190 1200 

1210 1220 1230 1240 1250 1260 

m285 . pep VSFVGPLNDPNLNIRAERRLSPVGAGVEILGSLNSPRITLTANEPMSEKDKLSWLILNRA 
I II I I I I I I I I I I I I M I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

a285 VSFVGPLNDPNLNIRAERRLSPVGAGVEILGSLNSPRITLTANEPMSEKDKLSWLILNRA 
1210 1220 1230 1240 1250 1260 

1270 1280 1290 1300 1310 1320 

m285 . pep GSGSSGDNAALSAAAGALLAGQINDRIGLVDDLGFTSKRSRNAQTGELN PAEQVLTVGKQ 

I I I I I I I I I I I I I 1 I I I I I II I I I I I I I I M I ! I I I I I I I I I I I M I I I II I I I I i I I I I 

a285 GSGSSGDNAALSAAAGALLAGQINDRIGLVDDLGFTSKRSRNAQTGELN PAEQVLTVGKQ 

1270 1280 1290 1300 1310 1320 

1330 1340 1350 1360 1370 1380 

m285 . pep LTGKLYIGYEYSISSAEQSVKLIYRLTRAIQAVARIGSRSSGGELTYTIRFDRFSGSDKK 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
a285 LTGKLYIGYEYSISSAEQSVKLIYRLTRAIQAVARIGSRSSGGELTYTIRFDRFSGSDKK 

1330 1340 1350 1360 1370 1380 

1390 

m285.pep DSAGNGKGKX 
I I I I I : I I I I 
a285 DSAGNSKGKX 
1390 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 187>: 

g285-l . 3eq 

1 CTGAAGCTGT CGGCGGCACT GCTGTCTGTC CTGATTTTGG CAGTATGTTT 

51 CCTCGGCTGG ATCGCCGGTA CGGAAGCAGG TTTGCGCTTC GGGCTGTACC 

101 AAATCCCGTC CTGGTTCGGC GTAAACATTT CCTCCCAAAA CCTCAAAGGC 

151 ACACTGCTCG ACGGCTTCGA CGGCGACAAC TGGTCGATAG AAACCGAGGG 

201 GGCAGACCTT AAAATCAGCC GCTTCCGCTT CGCGTGGAAA CCGTCCGAAC 

251 TGATGCGCCG CAGCCTGCAC ATCACCGACA TCTCCGCCGG CGACATCGCC 

301 ATCGTAACCA AACCGACTCC GCCTAAAGAA GAACGCCCGC CTCAAGGCCT 

351 GCCCGACAGC ATAGACCTGC CCGCCGCCGT CTATCTCGAC CGCTTCGAGA 

401 CGGGCAAAAT CAGCATGGGC AAAACCTTTG ACAAACAAAC CGTCTATCTC 

4 51 GAACGCCTCA ACGCGGCATA CCGTTACGAC CGTAAAGGGC ACCGCCTCGA 

501 CCTGAAGGCC GCCGACACGC CGTGGAGCAG TTCGTCGGGG TCAGCCTCGG 
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551 TCGGCTTGAA AAAACCGTTT GCCCTCGATA CCGCCATTTA CACCAAAGGC 

601 GGATTCGAAG GCGAAACCAT ACACAGTACG GCGCGGCTGA GCGGCAGCCT 

651 GAAGGATGTG CGCGCCGAAC TGACGATCGA CGGCGGCAAT ATCCGCCTCT 

701 CGGGAAAATC CGTCATCCAC CCGTTTGCCG AATCATTGGA TAAAACATTG 

751 GAAGAAGTAC TGGTCAAAGG ATTCAACATC AATCCGTCCG CCTTCGTGCC 

801 TTCCCTGCCC GATGCCGGGC TGAATTTCGA CCTGACCGCC ATCCCGTCGT 

851 TTTCAGACGG CATCGCGCTG GAAGGCTCGC TCGATTTGGA AAACACCAAA 

901 GCCGGCTTTG CCGACCGCAA CGGCATCCCC GTCCGTCAGG TTTTGGGCGG 

951 CTTTGTCATC CGGCAGGACG GCACGGTGCA TATCGGCAAT ACGTCCGCCG 

1001 CCCTGCTCGG ACGGGGCGGC ATCAGGCTGT CGGGCAAAAT CGACACCGAA 

1051 AAAGACATCC TTGATTTAAA TATAGGCATC AACTCCGTCG GCGCGGAAGA 

1101 CGTGCTGCAA ACCGCGTTCA AAGGCAGGTT GGACGGCAGC ATCGGCATCG 

1151 GCGGCACGAC CGCCTCGCCC AAAATCTCTT GGCAACTCGG CACCGGCACG 

12 01 GCACGCACGG ACGGCAGCCt cgcCATCGCA AGCGAcCCCG CAAACGAACA 

1251 GCGGAAACTG GTGTTCGACA CCGTCAACAT CTCCGCCGGG GAAGGCAGCC 

1301 TGACCGCGCA AGGCTATCTC GAGCTGTTTA AAGACCGCCT GCTCAAGCTG 

1351 GACATCCGTT CCCGCGCATT CGACCCTTCG CGCATCGATC CGCAATTTCC 

1401 GGCAGGCgat atCAACGGTT CGATTCATCT TGCCGGTGAA CTGGCAAAAG 

14 51 AGAAATTTAC GGGCAAAATG CGTTTTTTGC CCGGTACGTT CAACGGCGTG 

1501 CCGATTGCCG GCAGCGCCGA CATTGTTTAC GAGTCCCGCC ACCTTCCGCG 

1551 CGCCGCCGTC GATTTGCGGT TGGGGCGGAA CATCGTCAAA ACAGACGGCG 

1601 GCTTCGGCAA AAAAGGCGAC CGGCTTAACC TCAATATCAC CGCACCCGAT 

1651 TTATCCCGTT TCGGTTTCGG ACTCGCGGGG TCTTTAAATG TACGCGGACA 

1701 CCTTTCCGGC GATTTGGACG GCGGCATCCG AACCTTTGAA ACCGACCTTT 

17 51 CCGGCACGGC GCGCAACTTA CACATCGGCA AAGCGGCAGA CATCCGTTCG 

1801 CTCGATTTTA CCCTCAAAGG CTCACCCGGC ACAAGCCGCC CGATGCGCGC 

1851 CGATATCAAG GGCGGCCGCC TTTCCCTGTC GGGCGGCGCG GCGGTTGTCG 

1901 ATACCGCCGG CCTGACGCTG GAAGGTACGG GCGCGCAGCA CCGCATCCGC 

1951 ACACACGCCG CCATGACGCT GGACGGCAAA CCGTTCAAAC TCGATTTGGA 

2001 CGCTTCAGGC GGCATCAACA GGGAACTTAC CCGATGGAAA GGCAGCATCG 

2051 GCATCCTCGA CATCGGCGGC GCATTCAACC TCAAGCTGCA AAACCGTATG 

2101 ACGCTCGAAG CCGGTGCGGA ACACGTGGCG GCAAGTGCGG CAAATTGGCA 

2151 GGCAATGGGC GGCAGCCTCA ACCTGCAACA CTTTTCTTGG GACAGGAAAA 

2201 CCGGCATATC GGCAAAAGGC GGCGCACGCG GCCTGCACAT CGCCGAGTTG 

2251 CACAATTTCT TCAAACCGCC CTTCGAACAC AATCTGGTTT TAAACGGCGA 

2301 CTGGGATGTC GCCTACGGGC ACAACGCGCG CGGCTACCTC AATATCAGCC 

2351 GGCAAAGCGG CGATGCCGTA TTGCCCGGCG GGCAGGCTTT GGGTTTGAAC 

2401 GCATTTTCCC TGAAAACGCG CTTTCAAAAC GACCGCATCG GAATCCTGCT 

24 51 TGACGGCGGC GCGCGTTTCG GACGGATTAA CGCCGATTTG GGCATCGGCA 

2501 ACGCCTTCGG CGGCAATATG GCAAATACAC CGCTCGGCGG CAGGATTACA 

2551 GCCTCCCTTC CCGACTTGGG CGCATTGAAG CCCTTTCTGC CCGCCGCCGC 

2601 GCAAAACATT ACCGGCAGCC TGAATGCCTC CGCGCAAATC GGCGGACGGG 

2 651 TAGGCTCTCC GTCCGTCAAT GCCGCCGTCA ACGGTAGCAG CAACTACGGG 

2701 AAAATCAACG GCAATATCAC CGTCGGGCAA AGCCGCTCCT TCGATACCGC 

2751 ACCTTTGGGC GGCAGGCTCA ACCTGACCGT TGCCGATGCC GAAGCATTCC 

2801 GCAACTTCCT ACCGGTCGGA CAAACCGTCA AAGGCAGCCT GAATGCCGCC 

2851 GTAACCCTCG GCGGCAGCAT CGCCGACCCG CACTTGGGCG GCAGTATCAA 

2 901 CGGCGACAAG CTCTATTACC GCAACCAAAC CCAAGGCATC ATCTTGGACA 

2 951 ACGGCTCGCT GCGTTCGCAT ATTGCAGGCA GGAAATGGGT AATCGACAGC 

3001 CTGAAATTCC GGCACGAAGG GACGGCGGAA CTCTCCGGCA CGGTCAGCAT 

3051 GGAAAACAGC GTGCCCGATG TCGATATCGG CGCGGTGTTC GACAAATACC 

3101 GCATCCTGTC CCGCCCCAAC CGCCGCCTGA CGGTTTCCGG CAACACCCGC 

3151 CTGCGCTATT CGCCGCAAAA AGGCATATCC GTTACCGGTA TGATTAAAAC 

3201 TGATCAGGGG CTGTTCGGTT CGCAAAAATC CTCGATGCCG TCCGTCGGCG 

3251 ACGATGTCGT CGTATTGGGC GAAGTCAAGA AAGAGGCGGC GGCATCGCTC 

3301 CCCGTCAATA TGAACCTGAC TTTAGACCTC AATGACGGCA TCCGCTTCTC 

3351 CGGCTACGGC GCGGACGTTA CCATAGGCGG CAAACTGACC CTGACCGCGC 

3401 AACCGGGCGG AAATGTGCGT GGGGTGGGCA CGGTCCGCGT CATCAAAGGG 

34 51 CGTTACAAAG CATACGGGCA GGATTTAGAC ATTACCAAAG GCACAGTCTC 
3501 CTTTGTCGGC CCGCTCAACG ACCCCAACCT GAACATCCGC GCCGAACGCC 

35 51 GCCTTTCCCC CGTCGGTGCG GGCGTGGAAA TATTGGGCAG CCTCAACAGC 
3601 CCGCGCATTA CGCTGACGGC AAACGAACCG ATGAGTGAAA AAGACAAGCT 
3651 CTCCTGGCTC ATCCTCAACC GTGCCGGCAG CGGCAGCAGC GGCGACAATG 
3701 CCGCCCTGTC CGCAGCCGCA GGCGCGCTGC TTGCCGGGCA AATCAACGAC 

37 51 CGCATCGGGC TGGTGGATGA TTTGGGCTTT ACCAGCAAGC GCAGCCGCAA 
3801 CGCGCAAACC GGCGAACTCA ACCCCGCCGA ACAGGTGCTG ACCGTCGGCA 

38 51 AACAACTGAC CGGCAAACTC TACATCGGCT ACGAATACGG CATCTCCAGC 
3901 GCGGAACAGT CCGTCAAACT GATTTACCGG CTGACCCGCG CCATACAGGC 
3951 GGTTGCCCGT ATCGGCAGCC GTTCGTCGGG CGGCGAGCTG ACATACACCA 
4 001 TACGTTTCGA CCGCCTCTTC GGTTCGGACA AAAAAGACTC CGCAGGAAAC 
4051 GGCAAAGGGA AATAA 



s corresponds to the amino acid sequence <SEQ ID 1 188; ORF 285-1. ng>: 

- 1 . pep 
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l LKLSAALLSV LILAVCFLGW IAGTEA GLRF GLYQIPSWFG VNISSQNLKG 

51 TLLDGFDGDN WSIETEGADL KISRFRFAWK PSELMRRSLH ITDISAGDIA 

101 IVTKPTPPKE ERPPQGLPDS IDLPAAVYLD RFETGKISMG KTFDKQTVYL 

151 ERLNAAYRYD RKGHRLDLKA ADTPWSSSSG SASVGLKKPF ALDTAIYTKG 

201 GFEGETIHST ARLSGSLKDV RAELTIDGGN IRLSGKSVIH PFAESLDKTL 

251 EEVLVKGFNI NPSAFVPSLP DAGLNFDLTA IPSFSDGIAL EGSLDLENTK 

301 AGFADRNGIP VRQVLGGFVI RQDGTVHIGN TSAALLGRGG IRLSGKIDTE 

351 KDILDLNIGI NSVGAEDVLQ TAFKGRLDGS IGIGGTTASP KISWQLGTGT 

4 01 ARTDGSLAIA SDPANEQRKL VFDTVNISAG EGSLTAQGYL ELFKDRLLKL 

451 DIRSRAFDPS RIDPQFPAGD INGSIHLAGE LAKEKFTGKM RFLPGTFNGV 

501 PIAGSADIVY ESRHLPRAAV DLRLGRNIVK TDGGFGKKGD RLNLNITAPD 

551 LSRFGFGLAG SLNVRGHLSG DLDGGIRTFE TDLSGTARNL HIGKAADIRS 

601 LDFTLKGSPG TSRPMRADIK GGRLSLSGGA AVVDTAGLTL EGTGAQHRIR 

651 THAAMTLDGK PFKLDLDASG GINRELTRWK GSIGILDIGG AFNLKLQNRM 

701 TLEAGAEHVA ASAANWQAMG GSLNLQHFSW DRKTGISAKG GARGLHIAEL 

751 HNFFKPPFEH NLVLNGDWDV AYGHNARGYL NISRQSGDAV LPGGQALGLN 

801 AFSLKTRFQN DRIGILLDGG ARFGRINADL GIGNAFGGNM ANTPLGGRIT 

851 ASLPDLGALK PFLPAAAQNI TGSLNASAQI GGRVGSPSVN AAVNGSSNYG 

901 KINGNITVGQ SRSFDTAPLG GRLNLTVADA EAFRNFLPVG QTVKGSLNAA 

951 VTLGGSIADP HLGGSINGDK LYYRNQTQGI ILDNGSLRSH IAGRKWVIDS 

1001 LKFRHEGTAE LSGTVSMENS VPDVDIGAVF DKYRILSRPN RRLTVSGNTR 

1051 LRYSPQKGIS VTGMIKTDQG LFGSQKSSMP SVGDDVVVLG EVKKEAAASL 

1101 PVNMNLTLDL NDGIRFSGYG ADVTIGGKLT LTAQPGGNVR GVGTVRVIKG 

1151 RYKAYGQDLD ITKGTVSFVG PLNDPNLNIR AERRLSPVGA GVEILGSLNS 

1201 PRITLTANEP MSEKDKLSWL ILNRAGSGSS GDNAALSAAA GALLAGQIND 

1251 RIGLVDDLGF TSKRSRNAQT GELNPAEQVL TVGKQLTGKL YIGYEYGISS 

1301 AEQSVKLIYR LTRAIQAVAR IGSRSSGGEL TYTIRFDRLF GSDKKDSAGN 

1351 GKGK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 189>: 

m285-l.seq 

1 CTGAAGCTGT CGGCGGCACT GCTGTCTGTC CTGATTTTGG CAGTATGTTT 

51 CCTCGGCTGG CTCGCCGGTA CGGAAGCAGG TTTGCGCTTC GGGCTGTACC 

101 AAATCCCGTC TTGGTTCGGC GTAAACATTT CCTCCCAAAA CCTCAAAGGC 

151 ACGCTGCTCG ACGGCTTCGA CGGCGACAAC TGGTCGATAG AAACCGAGGG 

201 GGCAGACCTT AAAATCAGCC GCTTCCGCTT CGCGTGGAAA CCGTCCGAAC 

251 TGATGCGCCG CAGCCTGCAC ATTACCGAAA TTTCCGCCGG CGACATCGCC 

301 ATCGTTACCA AACCGACTCC GCCTAAAGAA GAACGCCCGC CGCTCAGCCT 

351 TCCCGACAGC ATAGACCTGC CTGCCGCCGT CTATCTCGAC CGCTTCGAGA 

4 01 CGGGCAAAAT CAGCATGGGC AAAGCCTTTG ACAAACAAAC CGTCTATCTC 

451 GAACGGCTGG ATGCTTCATA CCGTTACGAC CGCAAAGGAC ACCGCCTTGA 

501 CCTGAAGGCC GCCGACACGC CGTGGAGCAG TTCGTCGGGG GCGGCCTCGG 

551 TCGGCTTGAA AAAACCGTTT GCCCTCGATA CCGCCATTTA CACCAAAGGC 

601 GGACTCGAAG GCAAAACCAT ACACAGTACG GCTCGGCTGA GCGGCAGCCT 

651 GAAGGATGTG CGCGCCGAAC TGGCGATCGA CGGCGGCAAT ATCCGCCTCT 

701 CGGGAAAATC CGTCATCCAC CCGTTTGCCG AATCATTGGA TAAAACATTG 

751 GAAGAAGTAC TGGTCAAAGG GTTCAACATC AATCCGGCCG CCTTCGTGCC 

801 TTCCCTGCCC GATGCCGGAC TGAATTTCGA CCTGACCGCC ATCCCGTCGT 

851 TTTCAGACGG CATCGCGCTG GAAGGTTCGC TCGATTTGGA AAACACCAAA 

901 GCCGGCTTTG CCGACCGCAA CGGCATCCCC GTCCGTCAGG TTTTAGGCGG 

951 CTTTGTCATC CGGCAGGACG GCACGGTGCA TATCGGCAAT ACGTCCGCCG 

1001 CCCTGCTCGG ACGGGGCGGC ATCAGGCTGT CGGGCAAAAT CGACACCGAA 

1051 AAAGACATCC TCGATTTAAA TATAGGCATC AACTCCGTCG GCGCGGAAGA 

1101 CGTACTGCAA ACCGCGTTCA AAGGCAGGTT GGACGGCAGC ATCGGCATCG 

1151 GTGGCACGAC CGCCTCGCCC AAAATCTCTT GGCAACTCGG CATCGGCACG 

1201 GCGCGCACGG ACGGCAGCCT CGCCATTGCA AGCGACCCAG CAAACGGACA 

12 51 GCGGAAACTG GTGCTCGACA CCGTCAACAT CGCCGCCGGG CAAGGCAGCC 

1301 TGACCGCGCA AGGCTATCTC GAGCTGTTTA AAGACCGCCT GCTCAAGCTG 

1351 GACATCCGTT CCCGCGCATT CGACCCTTCG CGCATCGATC CGCAACTTCC 

1401 GGCAGGCAAT ATCAACGGCT CAATAAACCT TGCCGGCGAA CTGGCAAAAG 

1451 AGAAATTCAC AGGCAAAATG CGGTTTTTAC CCGGCACGTT CAACGGCGTA 

1501 CCGATTGCCG GCAGTGCCGA CATTGTTTAC GAGTCCCGCC ACCTTCCGCG 

1551 TGCCGCCGTC GATTTGCGGC TGGGGCGGAA CATTATTAAA ACAGACGGCG 

1601 GCTTCGGCAA AAAAGGCGAC CGGCTTAACC TCAATATCAC CGCACCCGAT 

1651 TTATCCCGTT TCGGTTTCGG ACTCGCGGGG TCTTTAAATG TACGCGGACA 

1701 CCTTTCCGGT GATTTGGACG GCGGCATCCG AACCTTTGAA ACCGACCTTT 

17 51 CCGGCGCGGC GCGCAACCTG CACATCGGCA AGGCGGCAGA CATCCGTTCG , 

1801 CTCGATTTCA CGCTCAAAGG TTCGCCCGAC ACAAGCCGCC CGATACGCGC 

1851 CGACATCAAA GGCAGCCGCC TTTCGCTGTC GGGCGGAGCG GCGGTTGTCG 

1901 ATACCGCCGA CCTGATGCTG GACGGCACGG GCGTGCAGCA CCGCATCCGC 

1951 ACACACGCCG CCATGACGCT GGATGGCAAA CCGTTCAAAT TCGATTTGGA 

2001 CGCTTCAGGC GGCATCAACA GGGAACTTAC CCGATGGAAA GGCAGCATCG 

2051 GCATCCTCGA CATCGGCGGC GCATTCAACC TCAAGCTGCA AAACCGTATG 

2101 ACGCTCGAAG CCGGTGCGGA ACGCGTGGCG GCAAGTGCGG CAAATTGGCA 
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2151 GGCAATGGGC GGCAGCCTCA ACCTGCAACA CTTTTCTTGG GATAAAAAAA 

2201 CCGGCATATC GGCAAAAGGC GGCGCACACG GTCTGCATAT CGCCGAGTTG 

2251 CACAATTTCT TCAAACCGCC CTTCGAACAC AATCTGGTTT TAAACGGCGA 

2301 CTGGGATGTC GCCTACGGGC GCAACGCGCG CGGCTACCTC AATATCAGCC 

2351 GGCAAAGCGG CGATGCCGTA TTGCCCGGCG GGCAGGCTTT GGGTTTGAAC 

2 4 01 GCATTTTCCC TGAAAACGCG CTTTCAAAAC GACCGCATCG GAATCCTGCT 

2451 TGACGGCGGC GCGCGTTTCG GGCGGATTAA CGCCGATTTG GGCATCGCCA 

2501 ACGCCTTCGG CGGCAATATG GCAAATGCAC CGCTCGGCGG CAGGATTACC 

2551 GCCTCCCTTC CCGACTTGGG CGCATTGAAG CCCTTTCTGC CCGCCGCCGC 

2 601 GCAAAACATT ACCGGCAGCC TGAATGCCGC CGCGCAAATC GGCGGACGGG 

2 651 TAGGCTCTCC GTCCGTCAAT GCCGCCGTCA ACGGCAGCAG CAACTACGGG 
2701 AAAATCAACG GCAACATCAC CGTCGGGCAA AGCCGCTCTT TCGATACCGC 
2751 GCCTTTGGGC GGCAGGCTCA ACCTGACCGT TGCCGATGCC GAAGTATTCC 
2801 GCAACTTCCT ACCGGTCGGA CAAACCGTCA AAGGCAGCCT GAATGCCGCC 
2851 GTAACCCTCG GCGGCAGCAT CGCCGATCCG CACTTGGGCG GCAGCATCAA 
2901 CGGCGACAAA CTCTATTACC GCAACCAAAC CCAAGGCATC ATCTTGGACA 
2951 ACGGCTCGCT GCGTTCGCAT ATCGCGGGCA GGAAATGGGT AATCGACAGC 
3001 CTGAAATTCC GGCACGAAGG GACGGCGGAA CTCTCCGGTA CGGTCGGTAT 
3051 GGAAAACAGC GGACCCGATG TCGATATCGG CGCGGTGTTC GACAAATACC 
3101 GCATCCTGTC CCGCCCCAAC CGCCGCCTGA CGGTTTCCGG CAACACCCGC 
3151 CTGCGCTATT CGCCGCAAAA AGGCATATCC GTTACCGGGA TGATTAAAAC 
3201 GGATCAGGGG CTGTTCGGTT CGCAAAAATC CTCGATGCCG TCCGTCGGCG 
3251 ACGATGTCGT CGTATTAGGC GAAGTCAAAA AAGAGGCGGC GGCACCGCTC 
3301 CCCGTCAATA TGAACCTGAC TTTAGACCTC AATGACGGCA TCCGCTTCGC 
3351 CGGCTACGGC GCGGACGTTA CCATAGGCGG CAAACTGACC CTGACCGCCC 
3401 AATCGGGCGG AAGCGTACGG GGCGTGGGCA CGGTCCGCGT CATCAAAGGG 
34 51 CGTTATAAGG CATACGGGCA GGATTTGGAC ATTACCAAAG GCACGGTCTC 
3501 CTTTGTCGGC CCGCTCAACG ATCCCAACCT CAACATCCGC GCCGAACGCC 
3551 GCCTTTCCCC CGTCGGTGCG GGCGTGGAAA TATTGGGCAG CCTCAACAGC 

3 601 CCGCGCATTA CGCTGACGGC AAACGAACCG ATGAGTGAAA AAGACAAGCT 
3651 CTCTTGGCTC ATCCTCAACC GCGCCGGCAG CGGCAGCAGC GGCGACAATG 
3701 CCGCCCTGTC TGCAGCCGCA GGTGCGCTGC TTGCCGGGCA AATCAACGAC 
3751 CGCATCGGGC TGGTGGATGA TTTGGGCTTT ACCAGCAAGC GCAGCCGCAA 
3801 CGCGCAAACC GGCGAACTCA ACCCCGCCGA ACAGGTGCTG ACCGTCGGCA 
3851 AACAACTGAC CGGCAAACTC TACATCGGCT ACGAATACAG CATCTCCAGC 
3901 GCGGAACAGT CCGTCAAACT GATTTACCGG CTGACCCGCG CCATACAGGC 
3951 GGTTGCCCGT ATCGGCAGCC GTTCGTCGGG CGGCGAGCTG ACATACACCA 

4 001 TACGTTTCGA CCGCTTCTCC GGTTCGGACA AAAAAGACTC CGCCGGAAAC 
4051 GGCAAAGGAA AATAA 

This corresponds to the amino acid sequence <SEQ ID 1190; ORF 285-1 >: 

m285-l .pep 

1 LKLSAALLSV LILAVCFLGW LAGTEA GLRF GLYQIPSWFG VNISSQNLKG 

51 TLLDGFDGDN WSIETEGADL KISRFRFAWK PSELMRRSLH ITEISAGDIA 

101 IVTKPTPPKE ERPPLSLPDS IDLPAAVYLD RFETGKISMG KAFDKQTVYL 

151 ERLDASYRYD RKGHRLDLKA ADTPWSSSSG AASVGLKKPF ALDTAIYTKG 

201 GLEGKTIHST ARLSGSLKDV RAELAIDGGN IRLSGKSVIH PFAESLDKTL 

251 EEVLVKGFNI NPAAFVPSLP DAGLNFDLTA IPSFSDGIAL EGSLDLENTK 

301 AGFADRNGIP VRQVLGGFVI RQDGTVHIGN TSAALLGRGG IRLSGKIDTE 

351 KDILDLNIGI NSVGAEDVLQ TAFKGRLDGS IGIGGTTASP KISWQLGIGT 

4 01 ARTDGSLAIA SDPANGQRKL VLDTVNIAAG QGSLTAQGYL ELFKDRLLKL 

451 DIRSRAFDPS RIDPQLPAGN INGSINLAGE LAKEKFTGKM RFLPGTFNGV 

501 PIAGSADIVY ESRHLPRAAV DLRLGRNIIK TDGGFGKKGD RLNLNITAPD 

551 LSRFGFGLAG SLNVRGHLSG DLDGGIRTFE TDLSGAARNL HIGKAADIRS 

601 LDFTLKGSPD TSRPIRADIK GSRLSLSGGA AVVDTADLML DGTGVQHRIR 

651 THAAMTLDGK PFKFDLDASG GINRELTRWK GSIGILDIGG AFNLKLQNRM 

701 TLEAGAERVA ASAANWQAMG GSLNLQHFSW DKKTGISAKG GAHGLHIAEL 

751 HNFFKPPFEH NLVLNGDWDV AYGRNARGYL NISRQSGDAV LPGGQALGLN 

801 AFSLKTRFQN DRIGILLDGG ARFGRINADL GIANAFGGNM ANAPLGGRIT 

851 ASLPDLGALK PFLPAAAQNI TGSLNAAAQI GGRVGSPSVN AAVNGSSNYG 

901 KINGNITVGQ SRSFDTAPLG GRLNLTVADA EVFRNFLPVG QTVKGSLNAA 

951 VTLGGSIADP HLGGSINGDK LYYRNQTQGI ILDNGSLRSH IAGRKWVIDS 

1001 LKFRHEGTAE LSGTVGMENS GPDVDIGAVF DKYRILSRPN RRLTVSGNTR 

1051 LRYSPQKGIS VTGMIKTDQG LFGSQKSSMP SVGDDVVVLG EVKKEAAAPL 

1101 PVNMNLTLDL NDGIRFAGYG ADVTIGGKLT LTAQSGGSVR GVGTVRVIKG 

1151 RYKAYGQDLD ITKGTVSFVG PLNDPNLNIR AERRLSPVGA GVEILGSLNS 

1201 PRITLTANEP MSEKDKLSWL ILNRAGSGSS GDNAALSAAA GALLAGQIND 

1251 RIGLVDDLGF TSKRSRNAQT GELNPAEQVL TVGKQLTGKL YIGYEYSISS 

1301 AEQSVKLIYR LTRAIQAVAR IGSRSSGGEL TYTIRFDRFS GSDKKDSAGN 

1351 GKGK* 

g285-l/m285-l 96.5% identity in 1354 aa overlap 

10 20 30 40 50 60 
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g285-l.pep LKLSAALLSVLILAVCFLGWIAGTEAGLRFGLYQIPSWFGVNISSQNLKGTLLDGFDGDN 
I II I I I I I I I I I I I I I I I I I : I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I | | | | II | 
m2 8 5 - 1 LKLS AALLS VL ILAVC FLGWLAGTEAGLRFGLYQI PSWFGVN I S SQNLKGTLLDGFDGDN 

10 20 30 40 50 60 



70 80 90 100 110 120 

g285-l.pep WSIETEGADLKISRFRFAWKPSELMRRSLHITDISAGDIAIVTKPTPPKEERPPQGLPDS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I II I I Mill 
m285-l WSIETEGADLKISRFRFAWKPSELMRRSLHITEISAGDIAI VTKPTPPKEERPPLSLPDS 

70 80 90 100 110 120 



130 140 150 160 170 180 

g285-l . pep IDLPAAVYLDRFETGKISMGKTFDKQTVYLERLNAAYRYDRKGHRLDLKAADTPWSSSSG 
I I I I I I I I I I I I I I I I M I I 1 : I I I I I I I I I I I : I : I I I I I I I I I I I I M ( I I I II t I 1 I 
m2 8 5 - 1 I DLPAAVYLDRFETGKI SMGKAFDKQTVYLERLDAS YRYDRKGHRLDLKAADTPWS SSSG 

130 140 150 160 170 180 



190 200 210 220 230 240 

g285-l.pep SASVGLKKPFALDTAI YTKGGFEGETIHSTARLSGSLKDVRAELTIDGGNIRLSGKSVIH 
: I I I I I I I I I I I I I I I I M I I : I I : I I I I I I I I I M I I I I I I M : I I I I I I I I I II I I I I 
m285-l AASVGLKKPFALDTAIYTKGGLEGKTIHSTARLSGSLKDVRAELAIDGGNIRLSGKSVIH 

190 200 210 220 230 240 

250 260 270 280 290 300 

g285-l.pep PFAESLDKTLEEVLVKGFNINPSAFVPSLPDAGLNFDLTAIPSFSDGIALEGSLDLENTK 
I I I I I 1 I I II I I I I I I I I I I I I : II I I I II I I I I I I I I I I I I I I I I I I I I M I I I M I I ! 
m2 8 5 - 1 PFAESLDKTLEEVLVKGFNINPAAFVPSLPDAGLNFDLTAI PSFSDGI ALEGSLDLENTK 

250 260 270 280 290 300 



310 320 330 340 350 360 

g285-l .pep AGFADRNGIPVRQVLGGFVIRQDGTVHIGNTSAALLGRGGIRLSGKIDTEKDILDLNIGI 
I I 1 I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I 
m285-l AGFADRNGIPVRQVLGGFVIRQDGTVHIGNTSAALLGRGGIRLSGKIDTEKDILDLNIGI 
310 320 330 340 350 360 



370 380 390 400 410 420 

g285-l.pep NSVGAEDVLQTAFKGRLDGSIGIGGTTASPKISWQLGTGTARTDGSLAIASDPANEQRKL 
I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m285-l NSVGAEDVLQTAFKGRLDGSIGIGGTTASPKISWQLGIGTARTDGSLAIASDPANGQRKL 
370 380 390 400 410 420 



430 440 450 460 470 480 

g285-l.pep VFDTVNISAGEGSLTAQGYLELFKDRLLKLDIRSRAFDPSRIDPQFPAGDINGSIHLAGE 
I : I I I I I : I I : I I I I I I I I f I I I I I I I I I I I I I I I I I I I I I I ! I I : I I I : I I I I I : | | | | 
m285-l VLDTVNIAAGQGSLTAQGYLELFKDRLLKLDIRSRAFDPSRIDPQLPAGNINGSINLAGE 
430 440 450 460 470 480 



490 500 510 520 530 540 

g285-l.pep LAKEKFTGKMRFLPGTFNGVPIAGSADIVYESRHLPRAAVDLRLGRNI VKTDGGFGKKGD 
I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I : I I I I I I I I | I I 
m2 85-l LAKEKFTGKMRFLPGTFNGVPIAGSADIVYESRHLPRAAVDLRLGRNIIKTDGGFGKKGD 
490 500 510 520 530 540 



550 560 570 580 590 600 

g285-l.pep RLNLNITAPDLSRFGFGLAGSLNVRGHLSGDLDGGIRTFETDLSGTARNLHIGFCAADIRS 
I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I : I I I I M I I I I I I I I 
m285-l RLNLNITAPDLSRFGFGLAGSLNVRGHLSGDLDGGIRTFETDLSGAARNLHIGKAADIRS 

550 560 570 580 590 600 

610 620 630 640 650 660 

g2 85-1. pep LDFTLKGSPGTSRPMRADIKGGRLSLSGGAAVVDTAGLTLEGTGAQHRIRTHAAMTLDGK 

I MM II II 1111:111111: MINIMI I IMMMIM INI 

m2 8 5 - 1 LDFTLKGS PDTSRP I RADI KGS RLS LSGGAAVVDTADLMLDGTGVQHRI RTH AAMTLDGK 

610 620 630 640 650 660 

670 680 690 700 710 720 

g285-l.pep PFKLDLDASGGINRELTRWKGSIGILDIGGAFNLKLQNRMTLEAGAEHVAASAANWQAMG 

I I I : I I M I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I : I I I I I I I I I I II 

m285-l PFKFDLDASGGINRELTRWKGSIGILDIGGAFNLKLQNRMTLEAGAERVAASAANWQAMG 
670 680 690 700 • 710 720 



730 740 750 760 770 780 

g285-l.pep GSLNLQHFSWDRKTGISAKGGARGLHIAELHNFFKPPFEHNLVLNGDWDVAYGHNARGYL 

I I I I I I I I I I I M I I I I Ml MMMIMMIMM MIMMIMMMMI II 

m285-l GSLNLQHFSWDKKTGISAKGGAHGLHIAELHNFFKPPFEHNLVLNGDWDVAYGRNARGYL 
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730 740 750 760 770 780 

790 800 810 820 830 840 

g2 85-1 .pep NISRQSGDAVLPGGQALGLNAFSLKTRFQNDRIGILLDGGARFGRINADLGIGNAFGGNM 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I : I | I I | | | 
m285-l NISRQSGDAVLPGGQALGLNAFSLKTRFQNDRIGILLDGGARFGRINADLGIANAFGGNM 

790 800 810 820 830 840 

850 860 870 880 890 900 

g285-l.pep ANTPLGGRITASLPDLGALKPFLPAAAQNITGSLNASAQIGGRVGSPSVNAAVNGSSNYG 
I I :! t I I I I I I I I I I I I I I i I I I I M I I I I I M I I f : I I i I I I I I I I I I I I I I I t I I I I I 
m285-l ANAPLGGRITASLPDLGALKPFLPAAAQNITGSLNAAAQIGGRVGSPSVNAAVNGSSNYG 

850 860 870 880 890 900 

910 920 930 940 950 960 

g2 8 5-1 .pep KINGNITVGQSRSFDTAPLGGRLNLTVADAEAFRNFLPVGQTVKGSLNAAVTLGGSIADP 
I I M I I I I I I t I I I I I II I I I I 1 I I I I II I I : I I I I I I I I I I I I I I I I I I I | | | | | | | | | 
m285-l KINGNITVGQSRS FDTAPLGGRLNLTVADAEVFRNFLPVGQTVKGSLNAAVTLGGSIADP 

910 920 930 940 950 960 

970 980 990 1000 1010 1020 

g285-l.pep HLGGSINGDKLYYRNQTQGIILDNGSLRSHIAGRKWVIDSLKFRHEGTAELSGTVSMENS 
I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I : I I I I 
m28 5-l HLGGSINGDKLYYRNQTQGI ILDNGSLRSHIAGRKWVIDSLKFRHEGTAELSGTVGMENS 

970 980 990 1000 1010 1020 

1030 1040 1050 1060 1070 1080 

g285-l .pep VPDVDIGAVFDKYRILSRPNRRLTVSGNTRLRYSPQKGISVTGMIKTDQGLFGSQKSSMP 
I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M 1 1 I I I I I I I I I I II I I I I I I I I I I I 

m285-l GPDVDIGAVFDKYRILSRPNRRLTVSGNTRLRYSPQKGISVTGMIKTDQGLFGSQKSSMP 
1030 1040 1050 1060 1070 1080 

1090 1100 1110 1120 1130 1140 

g285~l.pep SVGDDVVVLGEVKKEAAASLPVNMNLTLDLNDGIRFSGYGADVTIGGKLTLTAQPGGNVR 
I 1 I I I ! I I I I I I I I I I I I I I I I I II I I I I I I I I II : I I I I I I I I I I I I I I I I | ||:|| 
m285-l SVGDDVVVLGEVKKEAAAPLPVNMNLTLDLNDGIRFAGYGADVTIGGKLTLTAQSGGSVR 

1090 1100 1110 1120 1130 1140 

1150 1160 1170 1180 1190 1200 

g285-l .pep GVGTVRVIKGRYKAYGQDLDITKGTVSFVGPLNDPNLNIRAERRLSPVGAGVEILGSLNS 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I | I | | | | 
m285-l GVGTVRVIKGRYKAYGQDLDITKGTVSFVGPLNDPNLNIRAERRLSPVGAGVEILGSLNS 

1150 1160 1170 1180 1190 1200 

1210 1220 1230 1240 1250 1260 

g285-l . pep PRITLTANEPMSEKDKLSWLILNRAGSGSSGDNAALSAAAGALLAGQINDRIGLVDDLGF 
I I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I II I I I I I I I | I I | | I | I | | I | | | | | | 
m285-l PRITLTANEPMSEKDKLSWLILNRAGSGSSGDNAALSAAAGALLAGQINDRIGLVDDLGF 

1210 1220 1230 1240 1250 1260 

1270 1280 1290 1300 1310 1320 

g285-l .pep TSKRSRNAQTGELNPAEQVLTVGKQLTGKLYIGYEYGISSAEQSVKLIYRLTRAIQAVAR 
I I I I ! I I M I I I II I I I I I I I I M I I I M I I I I I I I :| I I I I I I I I I I I I I I I I I I I I I I 
m28 5-l TSKRSRNAQTGELNPAEQVLTVGKQLTGKLYIGYEYSISSAEQSVKLIYRLTRAIQAVAR 
1270 1280 1290 1300 1310 1320 

1330 1340 1350 

g285-l .pep IGSRSSGGELTYTIRFDRLFGSDKKDSAGNGKGK 
I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I 
m285-l IGSRSSGGELTYTIRFDRFSGSDKKDSAGNGKGKX 

1330 1340 1350 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 1 191>: 

a285-l .seq 

1 CTGAAGCTGT CGGCGGCACT GCTGTCTGTT CTGATTTTGG CAGTATGTTT 

51 CCTCGGCTGG CTCGCCGGCA CGGAAGCGGG TTTGCGCTTC GGGCTGTACC 

101 AAATCCCGTC TTGGTTCGGC GTAAACATTT CCTCCCAAAA CCTCAAAGGC 

151 ACGCTGCTCG ACGGCTTCGA CGGCGACAAC TGGTCGATAG AAACCGAGGG 

201 GGCAGACCTT AAAATCAGCC GCTTCCGCTT CGCGTGGAAA CCGTCCGAAC 

2 51 TGATGCGCCG CAGCCTGCAC ATTACCGAAA TTTCCGCCGG CGACATCGCC 

301 ATCGTTACCA AACCGACTCC GCCTAAAGAA GAACGCCCGC CGCTCAGCCT 

351 TCCCGACAGC ATAGACCTGC CTGCCGCCGT CTATCTCGAC CGCTTCGAGA 

4 01 CGGGCAAAAT CAGCATGGGC AAAGCCTTTG ACAAACAAAC CGTCTATCTC 

4 51 GAACGGCTGG ATGCTTCATA CCGTTACGAC CGCAAAGGAC ACCGCCTCGA 
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501 CCTGAAGGCT GCCGACACGC CGTGGAGCAG TTCGTCGGGG TCAGCCTCGG 

551 TCGGCTTGAA AAAACCGTTT GCCCTCGATA CCGCCATTTA CACCAAAGGC 

601 GGACTCGAAG GCAAAACCAT ACACAGTACG GCTCGGCTGA GCGGCAGCCT 

651 GAAGGATGTG CGCGCCGAAC TGGCGATCGA CGGCGGCAAT ATCCGCCTCT 

701 CGGGAAAATC CGTCATCCAC CCGTTTGCCG AATCATTGGA TAAAACATTG 

7 51 GAAGAAGTAC TGGTCAAAGG GTTCAACATC AATCCGTCCG CCTTCGTGCC 

801 TTCCCTGCCC GATGCCGGGC TGAATTTCGA CCTGACCGCC ATCCCGTCGT 

851 TTTCAGACGG CATCGCGCTG GAAGGCTCGC TCGATTTGGA AAACACCAAA 

901 GCCGGCTTTG CCGACCGCAA CGGCATCCCC GTCCGTCAGG TTTTAGGCAG 

951 CTTTGTCATC CGGCAGGACG GCACGGTGCA TATCGGCAAT ACGTCCGTCG 

1001 CCCTGCTCGG ACGGGGCGGC ATCAGGCTGT CGGGCAAAAT CGACACCGAA 

1051 AAAGACATCC TCGATTTAAA TATAGGCATC AACTCCGTCG GCGCGGAAGA 

1101 CGTACTGCAA ACCGCGTTCA AAGGCAGGTT GGACGGCAGC ATCGGCATCG 

1151 GTGGCACGAC CGCCTCGCCC AAAATCTCTT GGCAACTCGG CATCGGCACG 

1201 GCGCGCACGG ACGGCAGCCT CGCCATTGCA AGCGACCCCG CAAACGGACA 

1251 GCGGAAACTG GTGCTCGACA CCGTCAACAT CGCCGCCGGG CAAGGCAGCC 

1301 TGACCGCGCA AGGCTATCTC GAGCTGTTTA AAGACCGCCT GCTCAAGCTG 

1351 GACATCCGTT CCCGCGCATT CGACCCTTCG CGCATCGATC CGCAACTTCC 

14 01 GGCAGGCAAT ATCAACGGCT CAATAAACCT TGCCGGCGAA CTGGCAAAAG 

14 51 AGAAATTCAC AGGCAAAATG CGGTTTTTAC CCGGCACGTT CAACGGCGTA 

1501 CCGATTGCCG GCAGTGCCGA CATTGTTTAC GAGTCCCGCC ACCTTCCGCG 

1551 TGCCGCCGTC GATTTGCGGC TGGGGCGGAA CATTATTAAA ACAGACGGCG 

1601. GCTTCGGCAA AAAAGGCGAC CGGCTTAACC TCAATATCAC CGCACCCGAT 

1651 TTATCCCGTT TCGGTTTCGG ACTCGCGGGG TCTTTAAATG TACGCGGACA 

1701 CCTTTCCGGC GATTTGGACG GTGGCATCCG AACCTTTGAA ACCGACCTTT 

1751 CCGGCGCGGC GCGCAACCTG CACATCGGCA AGGCGGCAGA CATCCGTTCG 

1801 CTCGATTTCA CGCTCAAAGG TTCGCCCGAC ACAAGCCGCC CGATACGCGC 

1851 CGACATCAAA GGCAGCCGCC TTTCGCTGTC GGGCGGAGCG GAGGTTGTCG 

1901 ATACCGCCGA CCTGATGCTG GACGGCACGG GCGTGCAGCA CCGCATCCGC 

1951 ACACACGCCG CCATGACGCT GGATGGCAAA CCGTTCAAAT TCGATTTGGA 

2001 CGCTTCAGGC GGCATCAACA GGGAACTTAC CCGATGGAAA GGCAGCATCG 

2051 GCATCCTCGA CATCGGCGGC GCATTCAACC TCAAGCTGCA AAACCGTATG 

2101 ACGCTCGAAG CCGGTGCGGA ACGCGTGGCG GCAAGTGCGG CAAATTGGCA 

2151 GGCAATGGGC GGCAGCCTCA ACCTGCAACA CTTTTCTTGG GATAAAAAAA 

2201 CCGGCATATC GGCAAAAGGC GGCGCACACG GTCTGCATAT CGCCGAGTTG 

2251 CACAATTTCT TCAAACCGCC CTTCGAACAC AATCTGGTTT TAAACGGCGA 

2301 CTGGGATGTC GCCTACGGGC GAAACGCGCG CGGCTACCTC AATATCAGCC 

2351 GGCAAAGCGG CGATGCCGTA TTGCCCGGCG GGCAGGCTTT GGGTTTGAAC 

2401 GCATTTTCCC TGAAAACGCG CTTTCAAAAC GACCGTATCG GAATCCTGCT 

24 51 TGACGGCGGC GCGCGTTTCG GGCGGATTAA CGCCGATTTG GACATCGGCA 

2501 ACGCCTTCGG CGGCAATATG GCAAATGCAC CGCTCGGCGG CAGGATTACC 

2551 GCCTCCCTTC CCGACTTGGG CACATTGAAG CCCTTTCTGC CCGCCGCCGC 

2601 GCAAAACATT ACCGGCAGCC TGAATGCCGC CGCGCAAATC GGCGGACGGG 

2 651 TCGGCTCTCC GTCCGTCAAT GCCGCCGTCA ACGGCAGCAG CAACTACGGG 
2701 AAAATCAACG GCAACATCAC CGTCGGGCAA AGCCGCTCTT TCGATACCGC 
2751 GCCTTTGGGC GGCAGGCTCA ACCTGACCGT TGCCGATGCC GAAGTATTCC 
2801 GCAACTTCCT ACCGGTCGGA CAAACCGTCA AAGGCAGCCT GAATGCCGCC 
2851 GTAACCCTCG GCGGCAGCAT CGCCGATCCG CACTTGGGCG GCAGCATCAA 
2901 CGGCGACAAA CTCTATTACC GCAACCAAAC CCAAGGCATC ATCTTGGACA 
2951 ACGGCTCGCT GCGTTCGCAT ATCGCGGGCA GGAAATGGGT AATCGACAGC 
3001 CTGAAATTCC GGCACGAAGG GACGGCGGAA CTCTCCGGTA CGGTCGGTAT 
3051 GGAAAACAGC GGACCCGATG TCGATATCGG CGCGGTGTTC GACAAATACC 
3101 GCATCCTGTC CCGCCCCAAC CGCCGCCTGA CGGTTTCCGG CAACACCCGC 
3151 CTGCGCTATT CGCCGCAAAA AGGCATATCC GTTACCGGGA TGATTAAAAC 
3201 GGATCAGGGG CTGTTCGGTT CGCAAAAATC CTCGATGCCG TCCGTCGGCG 
3251 ACGATGTCGT CGTATTAGGC GAAGTCAAAA AAGAGGCGGC GGCACCGCTC 
3301 CCCGTCAATA TGAACCTGAC TTTAGACCTC AATGACGGCA TCCGCTTCGC 
3351 CGGCTACGGC GCGGACGTTA CCATAGGCGG CAAACTGACC CTGACCGCCC 
3401 AATCGGGCGG AAGCGTGCGG GGCGTGGGCA CGGTCCGCGT CATCAAAGGG 
34 51 CGTTATAAGG CATACGGGCA GGATTTGGAC ATTACCAAAG GCACGGTCTC 
3501 CTTTGTCGGC CCGCTCAACG ACCCCAACCT CAACATCCGC GCCGAACGCC 

3 551 GCCTTTCCCC CGTCGGTGCG GGCGTGGAAA TATTGGGCAG CCTCAACAGT 
3601 CCGCGCATTA CGCTGACGGC AAACGAACCG ATGAGTGAAA AAGACAAGCT 
3 651 CTCCTGGCTC ATCCTCAACC GCGCCGGCAG TGGCAGCAGC GGCGACAATG 
3701 CCGCCCTGTC CGCAGCCGCC GGCGCGCTGC TTGCCGGGCA AATCAACGAC 
37 51 CGCATCGGGC TGGTGGATGA TTTGGGCTTT ACCAGCAAGC GCAGCCGCAA 
3801 CGCGCAAACC GGCGAACTCA ACCCCGCCGA ACAGGTGCTG ACCGTCGGCA 
3851 AACAACTGAC CGGCAAACTC TACATCGGCT ACGAATACAG CATCTCCAGC 

3 901 GCGGAACAGT CCGTCAAACT GATTTACCGG CTGACCCGCG CCATACAGGC 
3951 GGTTGCCCGT ATCGGCAGCC GTTCGTCGGG CGGCGAGCTG ACATACACCA 
4001 TACGTTTCGA CCGCTTCTCC GGTTCGGACA AAAAAGACTC CGCCGGAAAC 

4 051 AGCAAAGGAA AATAA 



This corresponds to the amino acid sequence <SEQ ID 1 192; ORF 285-1. a>: 
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a285-l .pep 

1 LKLSAALLSV LILAVCFLGW LAGTEA GLRF GLYQIPSWFG VNISSQNLKG 

51 TLLDGFDGDN WSIETEGADL KISRFRFAWK PSELMRRSLH ITEISAGDIA 

101 IVTKPTPPKE ERPPLSLPDS IDLPAAVYLD RFETGKISMG KAFDKQTVYL 

151 ERLDASYRYD RKGHRLDLKA ADTPWSSSSG SASVGLKKPF ALDTAIYTKG 

201 GLEGKTIHST ARLSGSLKDV RAELAIDGGN IRLSGKSVIH PFAESLDKTL 

251 EEVLVKGFNI NPSAFVPSLP DAGLNFDLTA IPSFSDGIAL EGSLDLENTK 

301 AGFADRNGIP VRQVLGSFVI RQDGTVHIGN TSVALLGRGG IRLSGKIDTE 

351 KDILDLNIGI NSVGAEDVLQ TAFKGRLDGS IGIGGTTASP KISWQLGIGT 

4 01 ARTDGSLAIA SDPANGQRKL VLDTVNIAAG QGSLTAQGYL ELFKDRLLKL 

451 DIRSRAFDPS RIDPQLPAGN INGSINLAGE LAKEKFTGKM RFLPGTFNGV 

501 PIAGSADIVY ESRHLPRAAV DLRLGRNIIK TDGGFGKKGD RLNLNITAPD 

551 LSRFGFGLAG SLNVRGHLSG DLDGGIRTFE TDLSGAARNL HIGKAADIRS 

601 LDFTLKGSPD TSRPIRADIK GSRLSLSGGA EVVDTADLML DGTGVQHRIR 

651 THAAMTLDGK PFKFDLDASG GINRELTRWK GSIGILDIGG AFNLKLQNRM 

701 TLEAGAERVA ASAANWQAMG GSLNLQHFSW DKKTGISAKG GAHGLHIAEL 

751 HNFFKPPFEH NLVLNGDWDV AYGRNARGYL NISRQSGDAV LPGGQALGLN 

801 AFSLKTRFQN DRIGILLDGG ARFGRINADL DIGNAFGGNM ANAPLGGRIT 

851 ASLPDLGTLK PFLPAAAQNI TGSLNAAAQI GGRVGSPSVN AAVNGSSNYG 

901 KINGNITVGQ SRSFDTAPLG GRLNLTVADA EVFRNFLPVG QTVKGSLNAA 

951 VTLGGSIADP HLGGSINGDK LYYRNQTQGI ILDNGSLRSH IAGRKWVIDS 

1001 LKFRHEGTAE LSGTVGMENS GPDVDIGAVF DKYRILSRPN RRLTVSGNTR 

1051 LRYSPQKGIS VTGMIKTDQG LFGSQKSSMP SVGDDVVVLG EVKKEAAAPL 

1101 PVNMNLTLDL NDGIRFAGYG ADVTIGGKLT LTAQSGGSVR GVGTVRVIKG 

1151 RYKAYGQDLD ITKGTVSFVG PLNDPNLNIR AERRLSPVGA GVEILGSLNS 

1201 PRITLTANEP MSEKDKLSWL ILNRAGSGSS GDNAALSAAA GALLAGQIND 

1251 RIGLVDDLGF TSKRSRNAQT GELNPAEQVL TVGKQLTGKL YIGYEYSISS 

1301 AEQSVKLIYR LTRAIQAVAR IGSRSSGGEL TYTIRFDRFS GSDKKDSAGN 

1351 SKGK* 

a285-l/m285-l 99.3% identity in 1354 aa overlap 

10 20 30 40 50 . 60 

a285-l.pep LKLSAALLSVLILAVCFLGWLAGTEAGLRFGLYQIPSWFGVNISSQNLKGTLLDGFDGDN 
I | I I I I I I I I I I I I I I I I I i I ! I I I I I I I 1 II I I I i I I I I I I I I I I I I I I I I I I I I I I II 

H1285-1 LKLSAALLSVLILAVCFLGWLAGTEAGLRFGLYQIPSWFGVNISSQNLKGTLLDGFDGDN 
10 20 30 40 50 60 

70 80 90 100 110 120 

a285-l.pep WSIETEGADLKISRFRFAWKPSELMRRSLHITEISAGDIAIVTKPTPPKEERPPLSLPDS 
| | | | | | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I II 
m285-l WS IETEGADLKI SRFRFAWKPSELMRRSLHITEI SAGDIAI VTKPTPPKEERPPLSLPDS 

70 80 90 100 110 120 

130 140 150 160 170 180 

a285-l.pep IDLPAAVYLDRFETGKI SMGKAFDKQTVYLERLDASYRYDRKGHRLDLKAADTPWSSSSG 
| | | | | I | I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i 
m285-l IDLPAAVYLDRFETGKISMGKAFDKQTVYLERLDASYRYDRKGHRLDLKAADTPWSSSSG 

130 140 150 160 170 180 

190 200 210 220 230 240 

a285-l .pep SASVGLKKPFALDTAIYTKGGLEGKTIHSTARLSGSLKDVRAELAIDGGNIRLSGKSVIH 
: | I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 
m28 5-l AASVGLKKPFALDTAIYTKGGLEGKTIHSTARLSGSLKDVRAELAIDGGNIRLSGKSVIH 

190 200 210 220 230 240 

250 260 270 280 290 300 

a285-l.pep PFAESLDKTLEEVLVKGFNINPSAFVPSLPDAGLNFDLTAIPSFSDGIALEGSLDLENTK 
I I I I I I I I I I I I I I I I ! I I I I I : I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
m285-l PFAESLDKTLEEVLVKGFNINPAAFVPSLPDAGLNFDLTAIPSFSDGIALEGSLDLENTK 

250 260 270 280 290 300 

310 320 330 340 350 360 

a285-l .pep AGFADRNGIPVRQVLGSFVIRQDGTVHIGNTSVALLGRGGIRLSGKIDTEKDILDLNIGI 
I | | | | | | | | | | | I I I i : I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m285-l AGFADRNGIPVRQVLGGFVIRQDGTVHIGNTSAALLGRGGIRLSGKIDTEKDILDLNIGI 

310 320 330 340 350 360 

370 380 390 400 410 420 

a285-l.pep NSVGAEDVLQTAFKGRLDGSIGIGGTTASPKISWQLGIGTARTDGSLAIASDPANGQRKL 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i l ( 1 1 i i 1 1 1 1 1 1 1 i I 

m285-l NSVGAEDVLQTAFKGRLDGSIGIGGTTASPKISWQLGIGTARTDGSLAIASDPANGQRKL 
370 380 390 400 410 420 



430 440 450 460 470 480 
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a285-l .pep VLDTVNI AAGQGSLTAQGYLELFKDRLLKLDIRSRAFDPSRIDPQLPAGNINGSINLAGE 
I I I I I I II I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I 
m28 5-l VLDTVNIAAGQGSLTAQGYLELFKDRLLKLDIRSRAFDPSRIDPQLPAGNINGSINLAGE 
430 440 450 460 470 480 



490 500 510 520 530 540 

a285-l.pep LAKEKFTGKMRFLPGTFNGVPIAGSADIVYESRHLPRAAVDLRLGRNIIKTDGGFGKKGD 
I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I II I I I I I f I I I I ! I I I I I I I I ( I 
m2 8 5 - 1 LAKEKFTGKMRFLPGTFNGVPI AGSADI VYESRHLPRAAVDLRLGRNI I KTDGGFGKKGD 

490 500 510 520 530 540 



550 560 570 580 590 600 

a285-l.pep RLNLNITAPDLSRFGFGLAGSLNVRGHLSGDLDGGIRTFETDLSGAARNLHIGKAADIRS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I II I I I I I 
m285-l RLNLNITAPDLSRFGFGLAGSLNVRGHLSGDLDGGIRTFETDLSGAARNLHIGKAADIRS 

550 560 570 580 590 600 



610 620 630 640 650 660 

a285-l.pep LDFTLKGSPDTSRPIRADIKGSRLSLSGGAEVVDTADLMLDGTGVQHRIRTHAAMTLDGK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
m285~l LDFTLKGSPDTSRPIRADIKGSRLSLSGGAAVVDTADLMLDGTGVQHRIRTHAAMTLDGK 

610 620 630 640 650 660 



670 680 690 700 710 720 

a285-l .pep PFKFDLDASGGINRELTRWKGSIGILDIGGAFNLKLQNRMTLEAGAERVAASAANWQAMG 
I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I i I I i i I I I I I I I I I I I I I I I 
m2 8 5-l PFKFDLDASGGINRELTRWKGSIGILDIGGAFNLKLQNRMTLEAGAERVAASAANWQAMG 

670 680 690 700 710 720 



730 740 750 760 770 780 

a285-l.pep GSLNLQHFSWDKKTGI SAKGGAHGLHI AELHNFFKPPFEHNLVLNGDWDVAYGRNARGYL 
I I I I I I I I I I I I I I I II I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
m28 5-l GSLNLQHFSWDKKTGI SAKGGAHGLHI AELHNFFKPPFEHNLVLNGDWDVAYGRNARGYL 

730 740 750 760 770 780 



790 800 810 820 830 840 

a2 8 5 - 1 . pep NI SRQSGDAVLPGGQALGLNAFSLKTRFQNDRIGILLDGGARFGRINADLDIGNAFGGNM 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I II I I I I I I I Clllllll 
m285-l NISRQSGDAVLPGGQALGLNAFSLKTRFQNDRIGILLDGGARFGRINADLGIANAFGGNM 

790 800 810 820 830 840 



850 860 870 880 890 900 

a285-l.pep ANAPLGGRITASLPDLGTLKPFLPAAAQNITGSLNAAAQIGGRVGSPSVNAAVNGSSNYG 
I I I I I I I I I I I 1 I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m285-l ANAPLGGRITASLPDLGALKPFLPAAAQNITGSLNAAAQIGGRVGSPSVNAAVNGSSNYG 
850 860 870 880 890 900 



910 920 930 940 950 960 

a285-l.pep KINGNITVGQSRSFDTAPLGGRLNLTVADAEVFRNFLPVGQTVKGSLNAAVTLGGSIADP 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I i II I I 
n\285-l KINGNITVGQSRSFDTAPLGGRLNLTVADAEVFRNFLPVGQTVKGSLNAAVTLGGSIADP 

910 920 930 940 950 960 

970 980 990 1000 1010 1020 

a285-l .pep HLGGSINGDKLYYRNQTQGI ILDNGSLRSHIAGRKWVIDSLKFRHEGTAELSGTVGMENS 

II I I I I I I I II I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I 1 > I I 
m285-l HLGGSINGDKLYYRNQTQGI I LDNGSLRSHIAGRKWVIDSLKFRHEGTAELSGTVGMENS 

970 980 990 1000 1010 1020 



1030 1040 1050 1060 1070 1080 

a28 5-l .pep GPDVDIGAVFDKYRILSRPNRRLTVSGNTRLRYSPQKGISVTGMIKTDQGLFGSQKSSMP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II 
m285-l GPDVDIGAVFDKYRILSRPNRRLTVSGNTRLRYSPQKGISVTGMIKTDQGLFGSQKSSMP 

1030 1040 1050 1060 1070 1080 



1090 1100 1110 1120 1130 1140 

a2 8 5-1 .pep SVGDDVVVLGEVKKEAAAPLPVNMNLTLDLNDGIRFAGYGADVTIGGKLTLTAQSGGSVR 
I I I M I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I 
m28 5-l SVGDDVVVLGEVKKEAAAPLPVNMNLTLDLNDGIRFAGYGADVTIGGKLTLTAQSGGSVR 

1090 1100 1110 1120 1130 1140 



1150 1160 1170 1180 1190 1200 

a285-l.pep GVGTVRVIKGRYKAYGQDLDITKGTVSFVGPLNDPNLNIRAERRLSPVGAGVEILGSLNS 
I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m285-l GVGTVRVIKGRYKAYGQDLDITKGTVSFVGPLNDPNLNIRAERRLSPVGAGVEILGSLNS 



WO 99/57280 



PCT/US99/09346 



666 



1150 1160 1170 1180 1190 1200 

1210 1220 1230 1240 1250 1260 

a285-l . pep PRITLTANEPMSEKDKLSWLILNRAGSGSSGDNAALSAAAGALLAGQINDRIGLVDDLGF 
II I I ( I I I I I I I I I I I I I I 1 I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M 
m285-l PRITLTANEPMSEKDKLSWLILNRAGSGSSGDNAALSAAAGALLAGQINDRIGLVDDLGF 

1210 1220 1230 1240 1250 1260 

1270 1280 1290 1300 1310 1320 

a285-l.pep TSKRSRNAQTGELNPAEQVLTVGKQLTGKLYIGYEYSISSAEQSVKLIYRLTRAIQAVAR 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
m285-l TSKRSRNAQTGELNPAEQVLTVGKQLTGKLYIGYEYSISSAEQSVKLIYRLTRAIQAVAR 

1270 1280 1290 1300 1310 1320 

1330 1340 1350 

a285-l.pep IGSRSSGGELTYTIRFDRFSGSDKKDSAGNSKGKX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I 
m28 5-l IGSRSSGGELTYTIRFDRFSGSDKKDSAGNGKGKX 
1330 1340 1350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 193>: 

g286 . seq 

1 atgcagaaca ccggtaccat gatgatcaaa ccgaccgccc tgctcctgcc 

51 ggctttattt ttctttccgc acgcatacgc gcctgccgcc gacctttccg 

101 aaaacaaggc ggcgggtttc gcattgttca aaagcaaaag ccccgacacc 

151 gaatcagtca aattaaaacc caaattcccc gtccgcatcg acacgcagga 

201 cagtgaaatc aaagatatgg tcgaagaaca cctgccgctc atcacgcagc 

251 agcaggaaga ggttttggat aaggaacaga cgggattcct tgccgaagaa 

301 gcaccggaca acgttaaaac aatgctccgc agcaaaggct atttcagcag 

351 caaggtcagc ctgacggaaa aagacggagc ttatacggtg cacatcacac 

401 cgggcccgcg caccaaaatc gccaacgtcg gcgtcgccat cctcggcgac 

451 atcctttcag acggcaacct cgccgaatac taccgcaacg cgctggaaaa 

501 ctggcagcag ccggtaggca gcgatttcga tcaggacagt tgggaaaaca 

551 gcaaaacttc cgtcctcggc gcggtaacgc gcaaaggcta cccgcttgcc 

601 aagctcggca acacccgggc ggccgtcaac cccgataccg ccaccgccga 

651 tttgaacgtc gtcgtggaca gcggccgccc cattgccttc ggcgactttg 

701 aaatcaccgg cacacagcgt taccccgaac aaaccgtctc cggcctggcg 

751 cgcttccaac cgggcacgcc ctacgacctc gacctgctgc tcgacttcca 

801 acaggcgctc gaacaaaacg ggcattattc cggcgcgtcc gtacaagccg 

851 acttcgaccg cctcccaagg ggaccgcgtc cccgtcaaag tcagcgtaac 

901 cgaggtcaaa cgccacaaac tcgaaaccgg catccgcctc gattcggaat 

951 acggtttggg cggcaaaatc gcctacgact attacaacct cttcaacaaa 

1001 ggctatatcg gctcggtcgt ctgggatatg gacaaatacg aaaccacgct 

1051 tgccgccggc atcagccagc cgcgcaacta tcggggcaac tactggacaa 

1101 gcaacgtttc ctacaaccgt tcgaccaccc aaaacctcga aaaacgcgcc 

1151 ttctccggcg gcatctggta tgtgcgcgac cgcgcgggca tcgatgccag 

1201 gctgggggcg gaatttctcg cagaaggccg gaaaatcccc ggctcggatg 

1251 tcgatttggg caacagccac gccacgatgc tgaccgcctc ttggaaacgc 

1301 cagctgctca acaacgtgct gcaccccgaa aacggccatt acctcgacgg 

1351 caaaatcggg acgactttgg gcacattcct gtcctccacc gcgctaatcc 

1401 gcacctctgc ccgcgcaggt tatttcttca cgcccgaaaa caaaaaactc 

1451 ggcacgttca tcatacgcgg acaagcgggt tacaccgttg cacgcgacaa 

1501 tgccgatgtc ccctcggggc tgatgttccg cagcggcggc gcgtcttccg 

1551 tgcgcggtta cgaacttga 

This corresponds to the amino acid sequence <SEQ ID 1 194; ORF 286.ng>: 

g286.pep 

1 MQNTGTMMIK PTALLLPALF FFPHAYAPA A DLSENKAAGF ALFKSKSPDT 

51 ESVKLKPKFP VRIDTQDSEI KDMVEEHLPL ITQQQEEVLD KEQTGFLAEE 

101 APDNVKTMLR SKGYFSSKVS LTEKDGAYTV HITPGPRTKI ANVGVAILGD 

151 ILSDGNLAEY YRNALENWQQ PVGSDFDQDS WENSKTSVLG AVTRKGYPLA 

201 KLGNTRAAVN PDTATADLNV VVDSGRPIAF GDFEITGTQR YPEQTVSGLA 

251 RFQPGTPYDL DLLLDFQQAL EQNGHYSGAS VQADFDRLPR GPRPRQSQRN 

301 RGQTPQTRNR HPPRFGIRFG RQNRLRLLQP LQQRLYRLGR LGYGQIRNHA 

351 CRRHQPAAQL SGQLLDKQRF LQPFDHPKPR KTRLLRRHLV CARPRGHRCQ 

4 01 AGGGISRRRP ENPRLGCRFG QQPRHDADRL LETPAAQQRA APRKRPLPRR 
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4 51 QNRDDFGHIP VLHRANPHLC PRRLFLHARK QKTRHVHHTR TSGLHRCTRQ 

501 CRCPLGADVP QRRRVFRARL RT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 195>: 

m286.seq 

1 ATGCACGACA CCCGTACCAT GATGATCAAA CCGACCGCCC TGCTCCTGCC 

51 GGCTTTATTT TTCTTTCCGC ACGCATACGC GCCTGCCGCC GACCTTTCCG 

101 AAAACAAGGC GGCGGGTTTC GCATTGTTCA AAAACAAAAG CCCCGACACC 

151 GAATCAGTCA AATTAAAACC CAAATTCCCC GTCCTCATCG ACACGCAGGA 

201 CAGTGAAATC AAAGATATGG TCGAAGAACA CCTGCCGCTC ATCACGCAGC 

251 AGCAGGAAGA AGTATTGGAC AAGGAACAGA CGGGCTTCCT CGCCGAAGAA 

301 GCGCCGGACA ACGTTAAAAC GATGCTCCGC AGCAAAGGCT ATTTCAGCAG . 

351 CAAAGTCAGC CTGACGGAAA AAGACGGAGC TTATACGGTA CACATCACAC 

4 01 CGGGCCCGCG CACCAAAATC GCCAACGTCG GCGTCGCCAT CCTCGGCGAC 

4 51 ATCCTTTCAG ACGGCAACCT CGCCGAATAC TACCGCAACG CGCTGGAAAA 

501 CTGGCAGCAG CCGGTAGGCA GCGATTTCGA TCAGGACAGT TGGGAAAACA 

551 GCAAAACTTC CGTCCTCGGC GCGGTAACGC GCAAAGCCTA CCCGCTTGCC 

601 AAGCTCGGCA ATACGCAGGC GGCCGTCAAC CCCGATACCG CCACCGCCGA 

651 TTTGAACGTC GTCGTGGACA GCGGCCGCCC CATCGCCTTC GGCGACTTTG 

7 01 AAATCACCGG CACACAGCGT TACCCCGAAC AAATCGTCTC CGGCCTTGCG 

7 51 CGTTTCCAGC CCGGTATGCC GTACGACCTC GACCTGCTGC TCGACTTCCA 

801 ACAGGCGCTC GAACAAAACG GGCATTATTC CGGCGCGTCC GTACAAGCCG 

851 ACTTCGACCG CCTCCAAGGC GACCGCGTCC CCGTCAAAGT CAGCGTAACC 

901 GAGGTCAAAC GCCACAAACT CGAAACCGGC ATCCGCCTCG ATTCGGAATA 

951 CGGTTTGGGC GGCAAAATCG CCTACGACTA TTACAACCTC TTCAACAAAG 

1001 GCTATATCGG TTCGGTCGTC TGGGATATGG ACAAATACGA AACCACGCTT 

1051 GCCGCCGGCA TCAGCCAGCC GCGCAACTAT CGGGGCAACT ACTGGACAAG 

1101 CAACGTTTCC TACAACCGTT CGACCACCCA AAACCTCGAA AAACGCGCCT 

1151 TCTCCGGCGG CGTCTGGTAT GTGCGCGACC GCGCGGGCAT CGATGCCAGG 

1201 CTGGGGGCGG AATTTCTCGC AGAAGGCCGG AAAATCCCCG GCTCGGCTGT 

1251 CGATTTGGGC AACAGCCACG CCACGATGCT GACCGCCTCT TGGAAACGCC 

1301 AGCTGCTCAA CAACGTGCTG CATCCCGAAA ACGGCCATTA CCTCGACGGC 

1351 AAAATCGGTA CGACTTTGGG CACATTCCTG TCCTCCACCG CGCTGATCCG 

14 01 CACCTCTGCC CGTGCAGGTT ATTTCTTCAC GCCCGAAAAC AAAAAACTCG 

14 51 GCACGTTCAT CATACGCGGA CAAGCGGGTT ACACCGTTGC CCGCGACAAT 

1501 GCCGACGTTC CTTCAGGGCT GATGTTCCGC AGCGGCGGCG CGTCTTCCGT 

1551 GCGCGGTTAC GAACTCGACA GCATCGGACT TGCCGGCCCG AACGGATCGG 

1601 TCCTGCCCGA ACGCGCCCTC CTGGTGGGCA GCCTGGAATA CCAACTGCCG 

1651 TTTACGCGCA CCCTTTCCGG CGCGGTGTTC CACGATATGG GCGATGCCGC 

1701 CGCCAATTTC AAACGTATGA AGCTGAAACA CGGTTCGGGA CTGGGCGTGC 

17 51 GCTGGTTCAG CCCGCTTGCG CCGTTTTCCT TCGACATCGC CTACGGGCAC 

1801 AGCGATAAGA AAATCCGCTG GCACATCAGC TTGGGAACGC GCTTCTAA 

This corresponds to the amino acid sequence <SEQ ID 1 196; ORF 286>: 

m286.pep 

1 MHDTRTMMIK PTALLLPALF FFPHAYAPA A DLSENKAAGF ALFKNKSPDT 

51 ESVKLKPKFP VLIDTQDSEI KDMVEEHLPL ITQQQEEVLD KEQTGFLAEE 

101 APDNVKTMLR SKGYFSSKVS LTEKDGAYTV HITPGPRTKI ANVGVAILGD 

151 ILSDGNLAEY YRNALENWQQ PVGSDFDQDS WENSKTSVLG AVTRKAYPLA 

201 KLGNTQAAVN PDTATADLNV VVDSGRPIAF GDFEITGTQR YPEQIVSGLA 

251 RFQPGMPYDL DLLLDFQQAL EQNGHYSGAS VQADFDRLQG DRVPVKVSVT 

301 EVKRHKLETG IRLDSEYGLG GKIAYDYYNL FNKGYIGSVV WDMDKYETTL 

351 • AAGISQPRNY RGNYWTSNVS YNRSTTQNLE KRAFSGGVWY VRDRAGIDAR 

401 LGAEFLAEGR KIPGSAVDLG NSHATMLTAS WKRQLLNNVL HPENGHYLDG 

4 51 KIGTTLGTFL SSTALIRTSA RAG YF FT PEN KKLGTFIIRG QAGYTVARDN 

501 ADVPSGLMFR SGGASSVRGY ELDSIGLAGP NGSVLPERAL LVGSLEYQLP 

551 FTRTLSGAVF HDMGDAAANF KRMKLKHGSG LGVRWFSPLA PFSFDIAYGH 

601 SDKKIRWHIS LGTRF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



m286/g286 95.9% identity in 293 aa overlap 
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10 20 30 40 50 60 

m286.pep MHDTRTMMIKPTALLLPALFFFPHAYAPAADLSENKAAGFALFKNKSPDTESVKLKPKFP 
I : : I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I : I I I I I I I I I 1 I I I I I 
g 28 6 MQNTGTMMIKPTALLLPALFFFPHAYAPAADLSENKAAGFALFKSKSPDTESVKLKPKFP 

10 20 30 40 50 60 



70 80 90 100 110 120 

m28 6.pep VLIDTQDSEIKDMVEEHLPLITQQQEEVLDKEQTGFLAEEAPDNVKTMLRSKGYFSSKVS 
t I I I I ! I I I I I I I I I I I I I I I I i I ! I I I I I I I I I I I I I ! I I I 11 II I I M I I I I I I I I I 
g 28 6 VRIDTQDSEIKDMVEEHLPLITQQQEEVLDKEQTGFLAEEAPDNVKTMLRSKGYFSSKVS 

70 80 90 100 110 120 



130 140 150 160 170 180 

m286.pep LTEKDGAYTVHITPGPRTKIANVGVAILGDILSDGNLAEYYRNALENWQQPVGSDFDQDS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I 
g 28 6 LTEKDGAYTVHITPGPRTKIANVGVAILGDILSDGNLAEYYRNALENWQQPVGSDFDQDS 

130 140 150 160 170 180 



190 200 210 220 230 240 

m28 6 . pep WENSKTSVLGAVTRKAYPLAKLGNTQAAVNPDTATADLNVVVDSGRPIAFGDFEITGTQR 
I I I I I I I I I II I I I I : I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I 
g 28 6 WENSKTSVLGAVTRKGYPLAKLGNTRAAVNPDTATADLNVVVDSGRPIAFGDFEITGTQR 

190 200 210 220 230 240 

250 260 270 280 290 299 

m286.pep YPEQIVSGLARFQPGMPYDLDLLLDFQQALEQNGHYSGASVQADFDRL-QGDRVPVKVSV 

I I I I I I I I I I i I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I : I I 
g28 6 YPEQTVSGLARFQPGTPYDLDLLLDFQQALEQNGHYSGASVQADFDRLPRGPRPRQSQRN 

250 260 270 280 290 300 



300 310 320 330 340 350 359 

m286 .pep TEVKRHKLETGIRLDSEYGLGGKIAYDYYNLFNKGYIGSVVWDMDKYETTLAAGISQPRN 

g286 RGQTPQTRNRHPPRFGIRFGRQNRLRLLQPLQQRLYRLGRLGYGQIRNHACRRHQPAAQL 

310 320 330 340 350 360 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 197>: 

a286 . seq 

1 AT GC AC GACA CCCGTACCAT GATGATTAAA CCGACCGCCC TGCTCCTGCC 

51 GGCTTTATTT TTCTTTCCGC ACGCATACGC GCCTGCCGCC GACCTTTCCG 

101 AAAACAAGGC GGCGGGTTTC GCATTGTTCA AAAACAAAAG CCCCGACACC 

151 GAATCAGTTA AATTAAAACC CAAATTCCCC GTCCGCATCG ACACGCAGGA 

201 TAGTGAAATC AAAGATATGG TCGAAGAACA CCTGCCGCTC ATCACGCAGC 

251 AGCAGGAAGA AGTATTGGAC AAGGAACAGA CGGGCTTCCT CGCCGAAGAA 

301 GCACCGGACA ACGTTAAAAC AATGCTCCGC AGCAAAGGCT ATTTCAGCAG 

351 CAAAGTCAGC CTGACGGAAA AAGACGGAGC TTATACGGTA CACATCACAC 

4 01 CGGGCCCGCG CACCAAAATC GCCAACGTCG GCGTCGCCAT CCTCGGCGAC 

4 51 ATCCTTTCAG ACGGCAACCT CGCCGAATAC TACCGCAACG CGCTGGAAAA 

501 CTGGCAGCAG CCGGTAGGCA GTGATTTCGA TCAGGACAGT TGGGAAAACA 

551 GCAAAACTTC CGTCCTCGGC GCGGTAACGC GCAAAGCCTA CCCGCTTGCC 

601 AAGCTCGGCA ACACCCGGGC GGCCGTCAAC CCCGATACCG CCACCGCCGA 

651 TTTGAACGTC GTCGTGGACA GCGGCCGCCC CATCGCCTTC GGCGACTTTG 

701 AAATTACCGG CACGCAGCGT TACCCCGAAC AAATCGTCTC CGGCTTGGCG 

751 CGCTTCCAAC CGGGCACGCC CTACGACCTC GACCTGCTGC TCGACTTCCA 

801 ACAGGCGCTC GAACAAAACG GGCATTATTC .CGGCGCGTCC GTACAAGCCG 

851 ACTTCGACCG CCTCCAAGGC GACCGCGTCC CCGTCAAAGT CAGCGTAACC 

901 GAGGTCAAAC GCCACAAGCT CGAAACCGGC ATCCGCCTCG ATTCGGAATA 

951 CGGTTTGGGC GGCAAAATCG CCTACGACTA TTACAACCTC TTCAACAAAG 

1001 GCTATATCGG TTCGGTCGTC TGGGATATGG ACAAATACGA AACCACGCTT 

1051 GCCGCCGGCA TCAGCCAGCC GCGCAACTAT CGGGGCAACT ACTGGACAAG 

1101 CAACGTTTCC TACAACCGTT CGACCACCCA AAACCTCGAA AAACGCGCCT 

1151 TCTCCGGCGG CATCTGGTAT GTGCGCGACC GCGCGGGCAT CGATGCCAGG 

1201 CTGGGGGCGG AGTTTCTCGC AGAAGGCCGG AAAATCCCCG GCTCGGATAT 

1251 CGATTTGGGC AACAGCCACG CCACGATGCT GACCGCCTCT TGGAAACGCC 

1301 AGCTGCTCAA CAACGTGCTG CATCCCGAAA ACGGCCATTA CCTCGACGGC 
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1351 AAAATCGGTA CGACTTTGGG CGCATTCCTG TCCTCCACCG CGCTGATCCG 

14 01 CACCTCTGCC CGCGCAGGTT ATTTCTTCAC GCCCGAAAAC AAAAAACTCG 

14 51 GCACGTTCAT CATACGCGGA CAAGCGGGTT ACACCGTTGC CCGCGACAAT 

1501 GCCAACGTTC CTTCAGGGCT GATGTTCCGC AGCGGCGGCG CGTCTTCCGT 

1551 GCGCGGTTAC GAACTCGACA GCATCGGGCT TGCCGGCCCG AACGGATCGG 

1601 TCCTGCCCGA ACGCGCCCTC TTGGTGGGCA GCCTGGAATA CCAACTGCCG 

1651 TTTACGCGCA CCCTTTCCGG CGCGGTGTTC CACGATATGG GCGACGCCGC 

1701 CGCCAATTTC AAACGTATGA AGCTGAAACA CGGTTCGGGA CTGGGCGTGC 

1751 GCTGGTTCAG CCCGCTCGCG CCGTTTTCCT TCGACATCGC CTACGGGCAC 

1801 AGCGACAAGA AAATCCGCTG GCACATCAGC TTGGGAACGC GCTTCTAA 

This corresponds to the amino acid sequence <SEQ ID 1 198; ORP 286.a>: 

a286.pep 

1 MHDTRTMMIK PTALLLPALF FFPHAYAPA A DLSENKAAGF ALFKNKSPDT 

51 ESVKLKPKFP VRIDTQDSEI KDMVEEHLPL ITQQQEEVLD KEQTGFLAEE 

101 APDNVKTMLR SKGYFSSKVS LTEKDGAYTV HITPGPRTKI ANVGVAILGD 

151 ILSDGNLAEY YRNALENWQQ PVGSDFDQDS WENSKTSVLG AVTRKAYPLA 

201 KLGNTRAAVN PDTATADLNV VVDSGRPIAF GDFEITGTQR YPEQIVSGLA 

251 RFQPGTPYDL DLLLDFQQAL EQNGHYSGAS VQADFDRLQG DRVPVKVSVT 

301 EVKRHKLETG IRLDSEYGLG GKIAYDYYNL FNKGYIGSVV WDMDKYETTL 

351 AAGISQPRNY RGNYWTSNVS YNRSTTQNLE KRAFSGGIWY VRDRAGIDAR 

4 01 LGAEFLAEGR KIPGSDIDLG NSHATMLTAS WKRQLLNNVL HPENGHYLDG 

451 KIGTTLGAFL SSTALIRTSA RAGYFFT PEN KKLGTF1IRG QAGYTVARDN 

501 ANVPSGLMFR SGGASSVRGY ELDSIGLAGP NGSVLPERAL LVGSLEYQLP 

551 FTRTLSGAVF HDMGDAAANF KRMKLKHGSG LGVRWFSPLA PFSFDIAYGH 

601 SDKKIRWHIS LGTRF* 

m286/a286 98.7% identity in 615 aa overlap 

10 20 30 40 50 60 

m2 8 6 . pep MHDTRTMMIKPTALLLPALFFFPHAYAPAADLSENKAAGFALFKNKSPDTESVKLKPKFP 
I I I I I I I ! I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I i I I I I I I 
a 28 6 MHDTRTMMIKPTALLLPALFFFPHAYAPAADLSENKAAGFALFKNKSPDTESVKLKPKFP 

10 20 30 40 50 60 

70 80 90 100 110 120 

m28 6 . pep VLIDTQDSEIKDMVEEHLPLITQQQEEVLDKEQTGFLAEEAPDNVKTMLRSKGYFSSKVS 
I I I I ! I I I I I I I I i I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I M I I ! I I I I I I I I I 
a28 6 VRIDTQDSEIKDiyiVEEHLPLITQQQEEVLDKEQTGFLAEEAPDNVKTMLRSKGYFSSKVS 

70 80 90 100 110 120 

130 140 150 160 170 180 

m28 6 . pep LTEKDGAYTVHITPGPRTKIANVGVAILGDILSDGNLAEY YRNALENWQQ PVGSDFDQDS 

I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a28 6 LTEKDGAYTVHI T PGPRTKI ANVGVAILGD I LSDGNLAEYYRNALENWQQ PVGSDFDQDS 

130 140 '■" 150 160 170 180 

190 200 210 220 230 240 

ra28 6.pep WENSKTSVLGAVTRKAYPLAKLGNTQAAVNPDTATADLNVWDSGRPIAFGDFEITGTQR 
I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I ! I I I 1 I I I I I I I I t I I I I I t I I t I 
a28 6 WENSKTSVLGAVTRKAYPLAKLGNTRAAVNPDTATADLNVWDSGRPIAFGDFEITGTQR 

190 200 210 220 230 240 

250 260 270 280 290 300 

m2 8 6 . pep YPEQIVSGLARFQPGMPYDLDLLLDFQQALEQNGHYSGASVQADFDRLQGDRVPVKVSVT 
I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I 1 I I I I I I I I I I I I I 
a28 6 YPEQIVSGLARFQPGTPYDLDLLLDFQQALEQNGHYSGASVQADFDRLQGDRVPVKVSVT 

250 260 270 280 290 300 

310 320 330 340 350 360 

m286.pep EVKRHKLETG IRLDSEYGLGGKIAYDYYNLFNKGYIGSVVWDMDKYETTLAAG I SQPRNY 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I II I I I I I 1 I I I I I I I 
a28 6 EVKRHKLETG IRLDSEYGLGGKIAYDYYNLFNKGYIGSVVWDMDKYETTLAAG I SQPRNY 

310 320 330 340 350 360 

370 380 3.90 400 410 420 
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m286.pep RGNYWTSNVSYNRSTTQNLEKRAFSGGVWYVRDRAGIDARLGAE FLAEGRKIPGSAVDLG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I i 
a 286 RGNYWTSNVSYNRSTTQNLEKRAFSGGIWYVRDRAGIDARLGAEFLAEGRKIPGSDIDLG 

370 380 390 400 410 420 

430 440 450 4 60 470 480 

m286 . pep NSHATMLTASWKRQLLNNVLHPENGHYLDGKIGTTLGTFLSSTALIRTSARAGYFFTPEN 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I 
a 28 6 NSHATMLTASWKRQLLNNVLHPENGHYLDGKIGTTLGAFLSSTALIRTSARAGYFFTPEN 

430 440 450 460 470 480 

490 500 510 520 530 540 

m28 6.pep KKLGTFIIRGQAGYTVARDNADVPSGLMFRSGGASSVRGYELDSIGLAGPNGSVLPERAL 
I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J 
a 28 6 KKLGTFIIRGQAGYTVARDNANVPSGLMFRSGGASSVRGYELDSIGLAGPNGSVLPERAL 

490 500 510 520 530 540 

550 560 570 580 590 600 

m28 6.pep LVGSLEYQLPFTRTLSGAVFHDMGDAAAN FKRMKLKHGSGLGVRWFSPLAPFSFDIAYGH 

I I I I I I I I I I I I I I I I II t I I I I I I I ! I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

a286 LVGSLEYQLPFTRTLSGAVFHDMGDAAANFKRMKLKHGSGLGVRWFSPLAPFSFDIAYGH 

550 560 570 580 590 600 

610 

m28 6 . pep S DKK I RWH I SLGTRFX 

III Mill III Mill 
a286 SDKKI RWH I SLGTRFX 

610 



The following partial DNA sequence was 

g287 . seq 

1 atgtttaaac gcagtgtgat 

51 ctgtgggggc ggcggtggcg 

101 cgtcaaaacc ggccgccccc 

151 ctgccgaaag aaaagaaaga 

201 cgatacgcag gacgcaaccg 

251 tttcggcaga aaatacaggc 

301 aaaaatgaag acgcgggggc 

351 atccgcaaat caaacaggga 

401 cccccgcgtc aaaccctgcc 

451 acgaacgtgg gcaattctgt 

501 gttgacccac tgtaaaggcg 

551 aagaagcacc gtcaaaatca 

601 attaagcgat ataaaaaaga 

651 tgctgacagg gtaaaaaagg 

701 cggacaaacc acctactcgt 

751 gagattccgc tgattcccgt 

801 ggaagcggtc agcctgacgg 

851 ggaattaccg gtatctgact 

901 tatgccctcc gtgtgcaagg 

951 cacggccgtg tacaacggcg 

1001 gtccgtaccc gtccggaggc 

1051 aaatctgtgg acggcattat 

1101 gcaaaaattc aaagccgcca 

1151 cggaaaatgg cggcggggat 

1201 gaggaagtgg cgggaaaata 

1251 cggattcggc gtgtttgccg 



identified in N. gonorrhoeae <SEQ ID 1 199>: 



tgcaatggct 
gatcgcccga 
gttgttgctg 
tgaggaggca 
ccggagaagg 
aatggcggtg 
gcaaaatgat 
acaaccaacc 
cctgcgaatg 
tgtgattgac 
attcttgtaa 
gaatttgaaa 
cgagcaacgg 
atggaactaa 
tctgcacggt 
caatcaggcc 
ggcattccgg 
tacggggcgg 
cgaaccggca 
aagtgctgca 
aggtttgccg 
cgacagcggc 
tcgatggaaa 
gtttccggaa 
cagctatcgc 
gcaaaaaaga 



tgtatttttc 
tgtcaagtcg 
aaaatgccgg 
gcgggcggtg 
cagccaagat 
cggcaacaac 
atgccgcaaa 
cgccggttct 
gcggtagcga 
ggaccgtcgc 
tggtgataat 
aattaagtga 
gagaattttg 
caaatatatc 
cgaggaggtc 
gatacgctga 
caatatcttc 
aaaaattgcc 
aaaggcgaaa 
tttccatatg 
caaaagtcga 
gatgatttgc 
cggctttaag 
ggttttacgg 
ccgacagatg 
tcgggattga 



ccctttcagc 
gcggacacgc 
ggaaggggtg 
cgccgcaagc 
atggcggcag 
ggacaacccc 
atgccgccga 
tcagattccg 
ttttggaagg 
aaaatataac 
ttattggatg 
tgaagaaaaa 
tcggtttggt 
atcttctata 
gcttccggcc 
ttgtggatgg 
gcgcccgaag 
cggcggatcg 
tgcttgttgg 
gaaaacggcc 
tttcggcagc 
atatgggtac 
gggacttgga 
cccggccggc 
ctgaaaaggg 



This corresponds to the amino acid sequence <SEQ ID 1200; ORF 287.ng>: 

g287.pep 

1 MFKRSVIAMA CIFPLSA CGG GGGGSPDVKS ADTPSKPAAP VVAENAGEGV 

51 LPKEKKDEEA AGGAPQADTQ DATAGEGSQD MAAVSAENTG NGGAATTDNP 

101 KNEDAGAQND MPQNAAESAN QTGNNQPAGS SDSAPASNPA PANGGSDFGR 



WO 99/57280 



PCT/US99/09346 



671 



151 TNVGNSWID GPSQNITLTH CKGDSCNGDN LLDEEAPSKS EFEKLSDEEK 

201 IKRYKKDEQR ENFVGLVADR VKKDGTNKYI IFYTDKPPTR SARSRRSLPA 

251 EIPLIPVNQA DTLIVDGEAV SLTGHSGNIF APEGNYRYLT YGAEKLPGGS 

301 YALRVQGEPA KGEMLVGTAV YNGEVLHFHM ENGRPYPSGG RFAAKVDFGS 

351 KSVDGIIDSG DDLHMGTQKF KAAIDGNGFK GTWTENGGGD VSGRFYGPAG 

4 01 EEVAGKYSYR PTDAEKGGFG VFAGKKDRD* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1201>: 

m287.seq 

1 ATGTTTAAAC GCAGCGTAAT CGCAATGGCT TGTATTTTTG CCCTTTCAGC 

51 CTGCGGGGGC GGCGGTGGCG GATCGCCCGA TGTCAAGTCG GCGGACACGC 

101 TGTCAAAACC TGCCGCCCCT GTTGTTTCTG AAAAAGAGAC AGAGGCAAAG 

151 GAAGATGCGC CACAGGCAGG TTCTCAAGGA CAGGGCGCGC CATCCGCACA 

201 AGGCAGTCAA GATATGGCGG CGGTTTCGGA AGAAAATACA GGCAATGGCG 

251 GTGCGGTAAC AGCGGATAAT CCCAAAAATG AAGACGAGGT GGCACAAAAT 

301 GATATGCCGC AAAATGCCGC CGGTACAGAT AGTTCGACAC GGAATCACAC 

351 CCCGGATCCG AATATGCTTG CCGGAAATAT GGAAAATCAA GCAACGGATG 

4 01 CCGGGGAATC GTCTCAGCCG GCAAACCAAC CGGATATGGC AAATGCGGCG 

4 51 GACGGAATGC AGGGGGACGA TCCGTCGGCA GGCGGGCAAA ATGCCGGCAA 

501 " TACGGCTGCC CAAGGTGCAA ATCAAGCCGG AAACAATCAA GCCGCCGGTT 

551 CTTCAGATCC CATCCCCGCG TCAAACCCTG CACCTGCGAA TGGCGGTAGC 

601 AATTTTGGAA GGGTTGATTT GGCTAATGGC GTTTTGATTG ACGGGCCGTC 

651 GCAAAATATA ACGTTGACCC ACTGTAAAGG CGATTCTTGT AGTGGCAATA 

7 01 ATTTCTTGGA TGAAGAAGTA CAGCTAAAAT CAGAATTTGA AAAATTAAGT 

7 51 GATGCAGACA AAATAAGTAA TTACAAGAAA GATGGGAAGA ATGATAAATT 

801 TGTCGGTTTG GTTGCCGATA GTGTGCAGAT GAAGGGAATC AATCAATATA 

851 TTATCTTTTA TAAACCTAAA CCCACTTCAT TTGCGCGATT TAGGCGTTCT 

901 GCACGGTCGA GGCGGTCGCT TCCGGCCGAG ATGCCGCTGA TTCCCGTCAA 

951 TCAGGCGGAT ACGCTGATTG TCGATGGGGA AGCGGTCAGC CTGACGGGGC 

1001 ATTCCGGCAA TATCTTCGCG CCCGAAGGGA ATTACCGGTA TCTGACTTAC 

1051 GGGGCGGAAA AATTGCCCGG CGGATCGTAT GCCCTTCGTG TTCAAGGCGA 

1101 ACCGGCAAAA GGCGAAATGC TTGCGGGCGC GGCCGTGTAC AACGGCGAAG 

1151 TACTGCATTT CCATACGGAA AACGGCCGTC CGTACCCGAC CAGGGGCAGG 

1201 TTTGCCGCAA AAGTCGATTT CGGCAGCAAA TCTGTGGACG GCATTATCGA 

1251 CAGCGGCGAT GATTTGCATA TGGGTACGCA AAAATTCAAA GCCGCCATCG 

1301 ATGGAAACGG CTTTAAGGGG ACTTGGACGG AAAATGGCAG CGGGGATGTT 

1351 TCCGGAAAGT TTTACGGCCC GGCCGGCGAG GAAGTGGCGG GAAAATACAG 

14 01 CTATCGCCCG ACAGATGCGG AAAAGGGCGG ATTCGGCGTG TTTGCCGGCA 

14 51 AAAAAGAGCA GGATTGA 

This corresponds to the amino acid sequence <SEQ ID 1202; ORF 287>: 

m287 .pep 

1 MFKRSVIAMA CIFALSA CGG GGGGSPDVKS ADTLSKPAAP VVSEKETEAK 

51 EDAPQAGSQG QGAPSAQGSQ DMAAVSEENT GNGGAVTADN PKNEDEVAQN 

101 DMPQNAAGTD SSTPNHTPDP NMLAGNMENQ ATDAGESSQP ANQPDMANAA 

151 DGMQGDDPSA GGQNAGNTAA QGANQAGNNQ AAGSSDPIPA SNPAPANGGS 

201 NFGRVDLANG VLIDGPSQNI TLTHCKGDSC SGNNFLDEEV QLKSEFEKLS 

251 DADKISNYKK DGKNDKFVGL VADSVQMKGI NQYIIFYKPK PTSFARFRRS 

301 ARSRRSLPAE MPLIPVNQAD TLIVDGEAVS LTGHSGNIFA PEGNYRYLTY 

351 GAEKLPGGSY ALRVQGEPAK GEMLAGAAVY NGEVLHFHTE NGRPYPTRGR 

401 FAAKVDFGSK SVDGIIDSGD DLHMGTQKFK AAIDGNGFKG TWTENGSGDV 

4 51 SGKFYGPAGE EVAGKYSYRP TDAEKGGFGV FAGKKEQD* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



m287/g287 70.1% identity in 499 aa overlap 

10 20 30 40 49 

m287.pep MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPVVSE KETEA 

I I I I I I I I I I I I I I I I j I I I I II I I I I I I I I I 11111111:1 I : I I 

g287 MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPVVAENAGEGVLPKEKKDEEA 

10 20 30 40 50 60 
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50 60 70 80 90 100 109 

m287 . pep KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGT 

I I I I : I I : : : I I I I I I I ! I I I I I I I I I : I : II I I I M I I I I I I I I I I 
g 2 8 7 AGGAPQADTQD- - ATAGEGSQDMAAVS AENTGNGGAATT DN PKNEDAGAQN DMPQNAA — 

70 80 90 100 110 

110 120 130 140 150 160 169 

m287 . pep DSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTA 



g287 



170 180 190 200 210 220 229 

m287 .pep AQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDS 
: : I I I : I I I I I I I I I I I I I I I I I I I I I : I I I : : : : I : I : I I I I I I I 1 I I I I I I I I I 
g287 -ESANQTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSVVIDGPSQNITLTHCKGDS 
120 130 140 150 160 170 



230 240 250 260 270 280 289 

m287 .pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYI IFYKP 

I : I : I : I I I I : I II I II I I I : I j : I I I I : : : I I I I I I I I : I I : i I I I I 
g287 CNGDNLLDEEAPSKSEFEKLSDEEKIKRYKKDEQRENFVGLVADRVKKDGTNKYIIFYTD 
180 190 200 210 220 230 



290 300 310 320 330 340 349 

m287 .pep KPTSFARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLT 
II : I I I I I I I II I I I : I I II I I I I I I I I I I I I I I M I I I I I I I I I I I I 11 I I I I 

g 287 KPPT RSARSRRSLPAEIPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLT 

240 250 260 270 280 290 



350 360 370 380 390 400 409 

m287 . pep YGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGS 
I I I II I I I I I I I 1 I I I I I I I I I I I I : I : I I I I I I I! I I I I II I I I I : I I I I I i I I I i I 
g 287 YGAEKLPGGSYALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRFAAKVDFGS 

300 310 320 330 340 350 



410 420 430 440 450 460 469 

m287 . pep KSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYR 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : M I I I : I I I I I I I I I I I II I I I 
g 287 KSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYR 

360 370 380 390 400 410 

470 480 489 

m287 .pep PTDAEKGGFGVFAGKKEQDX 
I M II I II MIMI I 
g2 8 7 PTDAEKGGFGVFAGKKDRDX 

4 20 4 30 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1203>: 

a287.seq 

1 ATGTTTAAAC GCAGTGTGAT TGCAATGGCT TGTATTGTTG CCCTTTCAGC 

51 CTGTGGGGGC GGCGGTGGCG GATCGCCCGA TGTTAAGTCG GCGGACACGC 

101 TGTCAAAACC TGCCGCCCCT GTTGTTACTG AAGATGTCGG GGAAGAGGTG 

151 CTGCCGAAAG AAAAGAAAGA TGAGGAGGCG GTGAGTGGTG CGCCGCAAGC 

201 CGATACGCAG GACGCAACCG CCGGAAAAGG CGGTCAAGAT ATGGCGGCAG 

251 TTTCGGCAGA AAATACAGGC AATGGCGGTG CGGCAACAAC GGATAATCCC 

301 GAAAATAAAG ACGAGGGACC GCAAAATGAT ATGCCGCAAA ATGCCGCCGA 

351 TACAGATAGT TCGACACCGA ATCACACCCC TGCACCGAAT ATGCCAACCA 

4 01 G AG AT AT GGG AAACCAAGCA CCGGATGCCG GGGAATCGGC ACAACCGGCA 

4 51 AACCAACCGG ATATGGCAAA TGCGGCGGAC GGAATGCAGG GGGACGATCC 

501 GTCGGCAGGG GAAAATGCCG GCAATACGGC AGATCAAGCT GCAAATCAAG 

551 CTGAAAACAA TCAAGTCGGC GGCTCTCAAA ATCCTGCCTC TTCAACCAAT 

601 CCTAACGCCA CGAATGGCGG CAGCGATTTT GGAAGGATAA ATGTAGCTAA 

651 TGGCATCAAG CTTGACAGCG GTTCGGAAAA TGTAACGTTG ACACATTGTA 

701 AAGACAAAGT ATGCGATAGA GATTTCTTAG ATGAAGAAGC ACCACCAAAA 

7 51 TCAGAATTTG AAAAATTAAG TGATGAAGAA AAAATTAATA AATATAAAAA 
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801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 



AGACGAGCAA 
AGAATGGAAC 
TCTTCATCTG 
GGCCGAGATG 
ATGGGGAAGC 
GAAGGGAATT 
ATCGTATGCC 
CGGGCACGGC 
GGCCGTCCGT 
CAGCAAATCT 
GTACGCAAAA 
TGGACGGAAA 
CGGCGAAGAA 
AGGGCGGATT 



CGAGAGAATT 
TAACAAATAT 
CGCGATTCAG 
CCGCTGATTC 
GGTCAGCCTG 
ACCGGTATCT 
CTCAGTGTGC 
CGTGTACAAC 
CCCCGTCCGG 
GTGGACGGCA 
ATTCAAAGCC 
ATGGCGGCGG 
GTGGCGGGAA 
CGGCGTGTTT 



TTGTCGGTTT 
GTCATCATTT 
GCGTTCTGCA 
CCGTCAATCA 
ACGGGGCATT 
GACTTACGGG 
AAGGCGAACC 
GGCGAAGTGC 
AGGCAGGTTT 
TTATCGACAG 
GTTATCGATG 
GGATGTTTCC 
AATACAGCTA 
GCCGGCAAAA 



GGTTGCTGAC 
ATAAAGACAA 
CGGTCGAGGC 
GGCGGATACG 
CCGGCAATAT 
GCGGAAAAAT 
GGCAAAAGGC 
TGCATTTCCA 
GCCGCAAAAG 
CGGCGATGAT 
GAAACGGCTT 
GGAAGGTTTT 
TCGCCCGACA 
AAGAGCAGGA 



AGGGTAGAAA 
GTCCGCTTCA 
GGTCGCTTCC 
CTGATTGTCG 
CTTCGCGCCC 
TGTCCGGCGG 
GAAATGCTTG 
TATGGAAAAC 
TCGATTTCGG 
TTGCATATGG 
TAAGGGGACT 
ACGGCCCGGC 
GATGCGGAAA 
TTGA 



This corresponds to the amino acid sequence <SEQ ID 1204; ORF 287.a>: 

a287.pep 

MFKRSVIAMA CIVALSACGG GGGGSPDVKS ADTLSKPAAP VVTEDVGEEV 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 

m287/a287 



m287 .pep 
a287 

m287 .pep 
a287 

m287 .pep 
a287 

m287 .pep 
a287 



LPKEKKDEEA VSGAPQADTQ DATAGKGGQD MAAVSAENTG NGGAATTDNP 

ENKDEGPQND MPQNAADTDS STPNHTPAPN MPTRDMGNQA PDAGESAQPA 

NQPDMANAAD GMQGDDPSAG ENAGNTADQA ANQAENNQVG GSQNPASSTN 

PNATNGGSDF GRINVANGIK LDSGSENVTL THCKDKVCDR DFLDEEAPPK 

SEFEKLSDEE KINKYKKDEQ RENFVGLVAD RVEKNGTNKY VIIYKDKSAS 

SSSARFRRSA RSRRSLPAEM PLIPVNQADT LIVDGEAVSL TGHSGNIFAP 

EGNYRYLTYG AEKLSGGSYA LSVQGEPAKG EMLAGTAVYN GEVLHFHMEN 

GRPSPSGGRF AAKVDFGSKS VDGIIDSGDD LHMGTQKFKA VIDGNGFKGT 

WTENGGGDVS GRFYGPAGEE VAGKYSYRPT DAEKGGFGVF AGKKEQD* 

77.2% identity in 501 aa overlap 

10 20 30 40 49 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSE KETEA 

Mtlllllllll I I ! I I I I I II I I I I I II I I I I tl I I I I I I: I I: II 

MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPMPVVTEDVGEEVLPKEKKDEEA 

10 20 30 40 50 60 

50 60 70 80 90 100 109 

KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGT 
MM : j I : : : I : I I I I I I I M I I I I I I : I : I I t : I : I I II II I II I I I 
VSGAPQADTQ — DATAGKGGQDMAAVSAENTGNGGAATTDNPENKDEGPQNDMPQNAADT 
70 80 90 100 110 

110 120 130 140 150 160 169 

DSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTA 

| || | || || I I II : : I III II I I M II I I i I I I I I II II I I M I I I M : I II I I I 
DSSTPNHTPAPNMPTRDMGNQAPDAGESAQP7^NQPDMANAADGMQGDDPSAG-ENAGNTA 
120 130 140 150 160 170 

170 180 190 200 210 220 229 

AQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDS 

MINI I I I : : i I : : I : : I I : I I I I : I I I : : : I I I : : I : I : I : I I I I I I 
DQAANQAENNQVGGSQNPASSTNPNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKV 
180 190 200 210 220 230 



230 240 250 260 270 280 289 

m287.pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYI I FYKP 

| : MUM: II I I I I I I I : M : : I I I I : :: I I I I I I I I : : I I : I : I : I I 
a 287 CD-RDFLDEEAPPKSEFEKLSDEEKINKYKKDEQREN FVGLVADRVEKNGTNKYVIIYKD 

240 250 260 270 280 290 

290 300 310 320 330 340 

m287 . pep KP — TSFARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRY 

I : I I I I II M I II M I I I I I I I I I I I I M II I I I I I I I I I I I I I I M I M M M M 
a 287 KSASSSSARFRRSARSRRSLPAEMPLI PVNQADTLIVDGEAVSLTGHSGNI FAPEGNYRY 

300 310 320 330 340 350 
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350 360 370 380 390 400 

m287 pep LTYGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDF 

I I II I I i I I I I I I I I I I I I I I I I I I I I : I I I I I I I Ill I : I I I I I I I I I 

a2 87 LTYGAEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPSPSGGRFAAKVDF 
3 60 370 380 390 4 00 410 

410 420 430 440 450 460 

ra287 pep GSKSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYS 

I | | | I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I : I I I I I : I I I I I I Ml 

a 287 GSKSVDGIIDSGDDLHMGTQKFKAVIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYS 
420 430 440 450 460 470 

470 480 489 

m287 .pep YR PT DAEKGG FGV FAGKKEQDX 

I I I I I I I I I I I I I I I I I I I I I I 
a 2 8 7 YR PT DAEKGG FGV FAGKKEQ DX 

480 490 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1205>: 

g288.seq 

1 atgcacaccg gacaggcggt aagccgggtt ctgtctcgga cagtcattcc 

51 tctaggcata ccgttgccgg tatgctcaag caacctaccc gaacgctcgg 

101 cgggcagcgt cattgcgttc tgtttggtct tgctccgaat ggggtttggc 

151 ctgccgcata ttgttaccaa atgcgcggtg cgcccttacc gcaccttttc 

201 acccttgcct gtgctgccaa agcagccatc ggcggttttg ctttctgttc 

251 cactttccgt cgcgttaccg cgcccggccg ttaaccggca ttctaccctg 

301 cggagcccgg actttcctcc ccgtatgcct tacgcgatac gcggcgactg 

351 tctgcccgtc ccgtgtgcgg cgcggattat aacacgaaac gcaaaaatgc 

401 cgtctgaaac ggtacaggtt tcagacggca tacagcctaa actacacacc 

4 51 ctgtttcagg ctggcttcga tgaagccgtc caagtcgccg tccaatacgg 

501 ctttgtggtt gccgacttcg tagcctgtac gcaagtcttt gatgcgtga 

This corresponds to the amino acid sequence <SEQ ID 1206; ORF 288.ng>: 

g288.pep 

1 MHTGQAVSRV LSRTVIPLGI PLPVCSSNLP ERSA GSVIAF CLVLLRMGFG 

51 LPHIVTKCAV RPYRTFSPLP VLPKQPSAVL LSVPLSVALP RPAVNRHSTL 

101 RSPDFPPRMP YAIRGDCLPV PCAARIITRN AKMPSETVQV SDGIQPKLHT 

151 LFQAGFDEAV QVAVQYGFVV ADFVACTQVF DA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1207>: 

m288.seq 

1 ATGCACACCG GACAGGCGGT AAGCCGGGTT CTGTCTCGGA CAGTCATTCC 

51 TCTAGGCATA CCGTTACCGG TATGCTCAAG CAACCTACCC GAACGCTCGG 

101 CGGGCAGCGT CATTGCGTTC TGTTTGGTCT TGCTCCGAAT GGGGTTTGGC 

151 CTGCCGCATA TTGTTACCAA ATGCGCGGTG CGCCCTTACC GCACCTTTTC 

201 ACCCTTACCT GTGCTGCCAA AGCAGCCATC GGCGGTTTTG CTTTCTGTTC 

251 CACTTTCCGT CGCGTTACCG CGCCCGGCCG TTAACCGGCA TTCTACCCTG 

301 CGGAGCCCGG ACTTTCCTCC CCGTATGCCT TACGCGATAC GCGGCGACTG 

351 TCTGCCCGTC CCGTGTGCGG CGCGGATTAT AACACGAAAC ACAAAAATGC 

4 01 CGTCTGAAAC GGTACAGGTT TCAGACGGCA TACAGCCTAA ACTACACGCC 

4 51 CTGTTTCAGG CTGGCTTCGA TGAAGCCGTC CAAGTCGCCA TCCAATACGG 

501 CTTTGGTGTT GCCGACTTCG TAGCCTGTAC GCAAGTCTTT GATACGTGA 

This corresponds to the amino acid sequence <SEQ ID 1208; ORF 288>: 

m288 .pep 

1 MHTGQAVSRV LSRTVIPLGI PLPVCSSNLP ERSA GSVIAF CLVLLRMGFG 

51 LPHIVTKCAV RPYRTFSPLP VLPKQPSAVL LSVPLSVALP RPAVNRHSTL 

101 RSPDFPPRMP YAIRGDCLPV PCAARIITRN TKMPSETVQV SDGIQPKLHA 

151 LFQAGFDEAV QVAIQYGFGV ADFVACTQVF DT+ 



Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

m288/g288 97.8% identity in 181 aa overlap 

10 20 30 40 50 60 

m288.pep MHTGQAVSRVLSRTVIPLGIPLPVCSSNLPERSAGSVIAFCLVLLRMGFGLPHIVTKCAV 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I M I I 

g 288 MHTGQAVSRVLSRTVIPLGIPLPVCSSNLPERSAGSVIAFCLVLLRMGFGLPHIVTKCAV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m288 . pep RPYRTFSPLPVLPKQPSAVLLSVPLSVALPRPAVNRHSTLRSPDFPPRMPYAIRGDCLPV 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 ! 1 1 

g 288 RPYRTFSPLPVLPKQPSAVLLSVPLSVALPRPAVNRHSTLRSPDFPPRMPYAIRGDCLPV 

70 80 90 100 110 120 

130 140 150 160 170 180 

m288.pep PCAARIITRNTKMPSETVQVSDGIQPKLHALFQAGFDEAVQVAIQYGFGVADFVACTQVF 
I | | | I I I I I I : I I I I I I I I M I I I I M I I : I I I I II I I I I I II : I I I I II I I I I I II I 1 
g 288 PCAARI ITRNAKMPSETVQVSDGIQPKLHTLFQAGFDEAVQVAVQYGFVVADFVACTQVF 

130 140 150 160 170 180 



m288.pep DTX 
I : I 

g288 DAX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1209>: 

a288. seq 

1 ATGCACACCG GACAGGCGGT AAGCCGGGTT CTGTCTCGGA CAGTCATTCC 

51 TCTAGGCATA CCGTTGCCGG TATGCTCAAG CAACCTACCC GAACGCTCGG 

101 CGGGCAGCGT CATTGCGTTC TGTTTGGTCT TGCTCCGAAT GGGGTTTGGC 

151 CTGCCGCATA TTGTTACCAA ATGCGCGGTG CGCCCTTACC GCACCTTTTC 

201 ACCCTTGCCT GTGCTGCCAA AGCAGCCATC GGCGGTTTTG CTTTCTGTTC 

251 CACTTTCCGT CGCGTTACCG CGCCCGGCCG TTAACCGGCA TTCTACCCTG 

301 CGGAGCCCGG ACTTTCCTCC CCGTATGCCT TACGCGATAC GCGGCGACTG 

351 TCTGCCCGTC CCGTGTGCGG CGCGGATTAT AACACGAAAC GCAAAAATGC 

4 01 CGTCTGAAAC GGTACAGGTT TCAGACGGCA TACAGCCTAA ACTACACGCC 

4 51 CTGTTTCAGG CTGGCTTCGA TAAAGCCGTC CAAGTCGCCG TCCAATACGG 

501 CTTTGGTGTT GCCGACTTCG TAGCCTGTGC GCAAGTCTTT AATGCGTGA 



This corresponds to the amino acid sequence <SEQ ID 1210; ORF 288. a>: 

a288 .pep 

1 MHTGQAVSRV LSRTVIPLGI PLPVCSSNLP ERSAG SVIAF CLVLLRMGFG 

51 LPHIVTKCAV RPYRTFSPLP VLPKQPSAVL LSVPLSVALP RPAVNRHSTL 

101 RSPDFPPRMP YAIRGDCLPV PCAARI ITRN AKMPSETVQV SDGIQPKLHA 

151 LFQAGFDKAV QVAVQYGFGV ADFVACAQVF NA* 



m288/a288 97.2% identity in 181 aa overlap 

10 20 30 40 50 60 

m288.pep MHTGQAVSRVLSRTVIPLGIPLPVCSSNLPERSAGSVIAFCLVLLRMG FGLPHIVTKCAV 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I i I I 
a288 MHTGQAVSRVLSRTVIPLGIPLPVCSSNLPERSAGSVIAFCLVLLRMG FGLPHIVTKCAV 

10 20 30 40 50 60 



70 80 90 100 110 120 

m288 . pep RPYRTFSPLPVLPKQPSAVLLSVPLSVALPRPAVNRHSTLRSPDFPPRMPYAIRGDCLPV 
I | I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a288 RPYRTFSPLPVLPKQPSAVLLSVPLSVALPRPAVNRHSTLRSPDFPPRMPYAIRGDCLPV 

70 80 90 100 110 120 



m288 .pep 



130 140 150 160 170 180 

PCAARI ITRNTKMPSETVQVSDGIQPKLHALFQAGFDEAVQVAIQYGFGVADFVACTQVF 
I | I I I I I I I I : I I I I M I I I I I I I I I I I I I I I I I I I I : i I I I I : I I I M I I I I I II : I I I 
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a2 88 PCAARIITRNAKMPSETVQVSDGIQPKLHALFQAGFDKAVQVAVQYGFGVADFVACAQVF 

130 140 150 160 170 180 



m288.pep DTX 
a288 NAX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 121 1>: 

g290.seq 

1 atggcaaaaa tgatgaaatg ggcggctgtt gcggcggtcg cggcggcagc 

51 ggtttggggc ggatggtctt atctgaagcc cgaaccgcag gctgcttata 

101 ttacggaagc ggtcaggcgc ggcgatatca gccggacggt ttccgcgacg 

151 ggcgagattt cgccgtccaa cctggtatcg gtcggcgcgc aggcttcggg 

201 gcagattaaa aagctttatg tcaaactcgg gcaacaggtc aaaaagggcg 

251 atttgattgc ggaaatcaat tcgaccacgc agaccaacac gatcgatatg 

301 gaaaaatcca aattggaaac gtatcaggcg aagctggtgt ccgcacagat 

351 tgcattgggc agcgcggaaa aaaaatataa gcgtcaggcg gcgttgtgga 

401 aggatgatgc gacctctaaa gaagatttgg aaagcgcgca ggatgcgctt 

451 gccgccgcca aagccaatgt tgccgagttg aaggctttaa tcagacagag 

501 caaaatttcc atcaataccg ccgagtcgga tttgggctac acgcgcatta 

551 ccgcgacgat ggacggcacg gtggtggcga ttcccgtgga agaggggcag 

601 actgtgaacg cggcgcagtc tacgccgacg attgtccaat tggcgaatct 

651 ggatatgatg ttgaacaaaa tgcagattgc cgagggcgat attaccaagg 

701 tgaaggcggg gcaggatatt tcgtttacga ttttgtccga accggatacg 

751 ccgattaagg cgaagctcga cagcgtcgac cccgggctga ccacgatgtc 

801 gtcgggcggc tacaacagca gtacggatac ggcttccaat gcggtctatt 

851 attatgcccg ttcgtttgtg ccgaatccgg acggcaaact cgccacgggg 

901 atgacgacgc agaatacggt tgaaatcgac ggtgtgaaaa atgtgttgct 

951 tattccgtcg ctgaccgtga aaaatcgcgg cggcaaggcg ttcgtacgcg 

1001 tgttgggtgc ggacggcaag gcagtggaac gcgaaatccg gaccggtatg 

1051 aaagacagta tgaataccga agtgaaaagc gggttgaaag agggggacaa 

1101 agtggtcatc tccgaaataa ccgccgccga gcagcaggaa agcggcgaac 

1151 gcgccctagg cggcccgccg cgccgataa 

This corresponds to the amino acid sequence <SEQ ID 1212; ORF 290.ng>: 

g290.pep 

1 MAKMMKWAAV AAVAAA AVWG GWSYLKPEPQ AAYITEAVRR GDISRTVSAT 

51 GEISPSNLVS VGAQASGQIK KLYVKLGQQV KKGDLIAEIN STTQTNTIDM 

101 EKSKLETYQA KLVSAQIALG SAEKKYKRQA ALWKDDATSK EDLESAQDAL 

151 AAAKANVAEL KALIRQSKIS INTAESDLGY TRITATMDGT VVAIPVEEGQ 

201 TVNAAQSTPT IVQLANLDMM LNKMQIAEGD ITKVKAGQDI SFTILSEPDT 

251 PIKAKLDSVD PGLTTMSSGG YNSSTDTASN AVYYYARSFV PNPDGKLATG 

301 MTTQNTVEID GVKNVLLIPS LTVKNRGGKA FVRVLGADGK AVEREIRTGM 

351 KDSMNTEVKS GLKEGDKVVI SEITAAEQQE SGERALGGPP RR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1213>: 

m290. seq (partial) 

1 . . GTATCGGTCG GCGCGCAGGC ATCGGGGCAG ATTAAGATAC TTTATGTCAA 

51 ACTCGGGCAA CAGGTTAAAA AGGGCGATTT GATTGCGGAA ATCAATTCGA 

101 CCTCGCAGAC CAATACGCTC AATACGGAAA AATCCAAGTT GGAAACGTAT 

151 CAGGCGAAGC TGGTGTCGGC ACAGATTGCA TTGGGCAGCG CGGAGAAGAA 

201 ATATAAGCGT CAGGCGGCGT TATGGAAGGA AAACGCGACT TCCAAAGAGG 

251 ATTTGGAAAG CGCGCAGGAT GCGTTTGCCG CCGCCAAAGC CAATGTTGCC 

301 GAGCTGAAGG CTTTAATCAG AC AG AG C AAA ATTTCCATCA ATACCGCCGA 

351 GTCGGAATTG GGCTACACGC GCATTACCGC AACGATGGAC GGCACGGTGG 

4 01 TGGCGATTCT CGTGGAAGAG GGGCAGACTG TGAACGCGGC GCAGTCTACG 

4 51 CCGACGATTG TCCAATTGGC GAATCTGGAT ATGATGTTGA ACAAAATGCA 

501 GATTGCCGAG GGCGATATTA CCAAGGTGAA GGCGGGGCAG GATATTTCGT 

551 TTACGATTTT GTCCGAACCG GATACGCCGA TTAAGGCGAA GCTCGACAGC 

601 GTCGACCCCG GGCTGACCAC GATGTCGTCG GGCGGTTACA ACAGCAGTAC 

651 GGATACGGCT TCCAATGCGG TCTACTATTA TGCCCGTTCG TTTGTGCCGA 
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701 ATCCGGACGG CAAACTCGCC 

7 51 ATCGACGGCG TGAAAAATGT 

801 TCGCGGCGGC AAGGCGTTTG 

851 CGGAACGCGA AATCCGGACC 

901 AAAAGCGGGT TGAAAGAGGG 

951 CGCCGAGCAA CAGGAAAGCG 

1001 GATAA 



677 

ACGGGGATGA CGACGCAGAA TACGGTTGAA 
GCTGATTATT CCGTCGCTGA CCGTGAAAAA 
TGCGCGTGTT GGGTGCGGAC GGCAAGGCGG 
GGTATGAGAG ACAGTATGAA TACCGAAGTA 
GGACAAAGTG GTCATCTCCG AAATAACCGC 
GCGAACGCGC CCTAGGCGGC CCGCCGCGCC 



This corresponds to the amino acid sequence <SEQ ID 1214; ORF 290>: 

m290 .pep (partial) 

1 . . VSVGAQASGQ IKILYVKLGQ QVKKGDLIAE INSTSQTNTL NTEKSKLETY 

51 QAKLVSAQIA LGSAEKKYKR QAALWKENAT SKEDLESAQD AFAAAKANVA 

101 ELKALIRQSK ISINTAESEL GYTRITATMD GTVVAILVEE GQTVNAAQST 

151 PTIVQLANLD MMLNKMQIAE GDITKVKAGQ DISFTILSEP DTPIKAKLDS 

201 VDPGLTTMSS GGYNSSTDTA SNAVYYYARS FVPNPDGKLA TGMTTQNTVE 

251 IDGVKNVLII PSLTVKNRGG KAFVRVLGAD GKAAEREIRT GMRDSMNTEV 

301 KSGLKEGDKV VISEITAAEQ QESGERALGG PPRR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



m29O/g290 96.1% identity in 334 aa overlap 

10 20 30 

VSVGAQASGQIKILYVKLGQQVKKGDLIAE 
t I I I II I I I II I I I I I I I I I II I I I I I I I 
PQAAYITEAVRRGDISRTVSATGEISPSNLVSVGAQASGQIKKLYVKLGQQVKKGDLIAE 
30 40 50 60 70 80 



m290.pep 
g290 



40 50 60 70 80 90 

m290.pep INSTSQTNTLNTEKSKLETYQAKLVSAQIALGSAEKKYKRQAALWKENATSKEDLESAQD 

1111:1111:: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g 2 90 INSTTQTNTIDMEKSKLETYQAKLVSAQIALGSAEKKYKRQAALWKDDATSKEDLESAQD 
90 100 110 120 130 140 

100 110 120 130 140 150 

m290 . pep AFAAAKANVAELKALIRQSKISINTAESELGYTRITATMDGTVVAILVEEGQTVNAAQST 
I : | | I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g 290 ALAAAKANVAELKALIRQSKISINTAESDLGYTRITATMDGTVVAIPVEEGQTVNAAQST 
150 160 170 180 190 200 

160 170 180 190 200 210 

m290.pep PTIVQLANLDMMLNKMQIAEGDITKVKAGQDISFTILSEPDTPIKAKLDSVDPGLTTMSS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I M I I I M I I I I I I I I I I I I I I I I I I 
g 290 PTIVQLANLDMMLNKMQIAEGDITKVKAGQDISFTILSEPDTPIKAKLDSVDPGLTTMSS 
210 220 230 240 250 260 

220 230 240 250 260 270 

m290 . pep GGYNSSTDTASNAVYYYARSFVPNPDGKLATGMTTQNTVEIDGVKNVLI I PSLTVKNRGG 

I | | I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I 1 I I I I I II I 
g 290 GGYNSSTDTASNAVYYYARSFVPNPDGKLATGMTTQNTVEIDGVKNVLLI PSLTVKNRGG 

270 280 290 300 310 320 

280 290 300 310 320 330 

m290 . pep KAFVRVLGADGKAAEREIRTGMRDSMNTEVKSGLKEGDKVVISEITAAEQQESGERALGG 
M I I I I I I I I I ! I : I I I I I I I i : I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I ! I I I ! 
g 2 9 0 KAFVRVLGADGKAVERE IRTGMKDSMNTEVKSGLKEGDKVVI SEI TAAEQQESGERALGG 

330 340 350 360 370 380 



m290.pep PPRRX 
I I I I I 

g290 PPRRX 
3 90 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1215>: 

a290.seq 

1 ATGGCAAAAA TGATGAAATG GGCGGCTGTT GCGGCGGTCG CGGCGGCAGC 

51 GGTTTGGGGC GGATGGTCTT ATCTGAAGCC CGAGCCGCAG GCTGCTTATA 

101 TTACGGAAAC GGTCAGGCGC GGCGACATCA GCCGGACGGT TTCTGCAACA 

151 GGGGAGATTT CGCCGTCCAA CCTGGTATCG GTCGGCGCGC AGGCATCGGG 

201 GCAGATTAAG AAACTTTATG TCAAACTCGG GCAACAGGTT AAAAAGGGCG 

251 ATTTGATTGC GGAAATCAAT TCGACCTCGC AGACCAATAC GCTCAATACG 

301 GAAAAATCCA AATTGGAAAC GTATCAGGCG AAGCTGGTGT CGGCACAGAT 

351 TGCATTGGGC AGCGCGGAGA AGAAATATAA GCGTCAGGCG GCGTTGTGGA 

4 01 AGGATGATGC GACCGCTAAA GAAGATTTGG AAAGCGCACA GGATGCGCTT 

4 51 GCCGCCGCCA AAGCCAATGT TGCCGAGCTG AAGGCTCTAA TCAGACAGAG 

501 CAAAATTTCC ATCAATACCG CCGAGTCGGA ATTGGGCTAC ACGCGCATTA 

551 CCGCAACGAT GGACGGCACG GTGGTGGCGA TTCTCGTGGA AGAGGGGCAG 

601 ACTGTGAACG CGGCGCAGTC TACGCCGACG ATTGTCCAAT TGGCGAATCT 

651 GGATATGATG TTGAACAAAA TGCAGATTGC CGAGGGCGAT ATTACCAAGG 

701 TGAAGGCGGG GCAGGATATT TCGTTTACGA TTTTGTCCGA ACCGGATACG 

7 51 CCGATTAAGG CGAAGCTCGA CAGCGTCGAC CCCGGGCTGA CCACGATGTC 

801 GTCGGGCGGC TACAACAGCA GTACGGATAC GGCTTCCAAT GCGGTCTACT 

851 ATTATGCCCG TTCGTTTGTG CCGAATCCGG ACGGCAAACT CGCCACGGGG 

901 ATGACGACGC AGAATACGGT TGAAATCGAC GGTGTGAAAA ATGTGCTGAT 

951 TATTCCGTCG CTGACCGTGA AAAATCGCGG CGGCAGGGCG TTTGTGCGCG 

1001 TGTTGGGTGC AGACGGCAAG GCGGCGGAAC GCGAAATCCG GACCGGTATG 

1051 AGAGACAGTA TGAATACCGA AGTAAAAAGC GGGTTGAAAG AGGGGGACAA 

1101 AGTGGTCATC TCCGAAATAA CCGCCGCCGA GCAGCAGGAA AGCGGCGAAC 

1151 GCGCCCTAGG CGGCCCGCCG CGCCGATAA 



This corresponds to the amino acid sequence <SEQ ID 1216; ORF 290.a>: 

a2 90.pep 

1 MAKMMKWAAV AAVAAA AVWG GWSYLKPEPQ AAYITETVRR GDISRTVSAT 

51 GEISPSNLVS VGAQASGQIK KLYVKLGQQV KKGDLIAEIN STSQTNTLNT 

101 EKSKLETYQA KLVSAQIALG SAEKKYKRQA ALWKDDATAK EDLESAQDAL 

151 AAAKANVAEL KALIRQSKIS INTAESELGY TRITATMDGT VVAILVEEGQ 

201 TVNAAQSTPT IVQLANLDMM LNKMQIAEGD ITKVKAGQDI SFTILSEPDT 

251 PIKAKLDSVD PGLTTMSSGG YNSSTDTASN AVYYYARSFV PNPDGKLATG 

301 MTTQNTVEID GVKNVLIIPS LTVKNRGGRA FVRVLGADGK AAEREIRTGM 

351 RDSMNTEVKS GLKEGDKWI SEITAAEQQE SGERALGGPP RR* 



m290/a290 98.2% identity in 334 aa overlap 

10 20 30 

m2 90 . pep VSVGAQASGQIKILYVKLGQQVKKGDLIAE 

t I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I 
a 290 PQAAYITETVRRGDISRTVSATGEISPSNLVSVGAQASGQIKKLYVKLGQQVKKGDLIAE 
30 40 50 60 70 80 

40 50 60 70 80 90 

m290 . pep INSTSQTNTLNTEKSKLETYQAKLVSAQIALGSAEKKYKRQTVALWKENATSKEDLESAQD 

I | | | | | I I I I I I I I I I I I I I ( I I I I I I I I I I I I M I I I I I I I I I I I I I : I I I i I I II I 
a 290 INSTSQTNTLNTEKSKLETYQAKLVSAQIALGSAEKKYKRQAALWKDDATAKEDLESAQD 

90 100 110 120 130 140 

100 110 120 130 140 150 

m290 . pep AFAAAKANVAELKALIRQSKISINTAESELGYTRITATMDGTVVAILVEEGQTVNAAQST 
I : I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I M I t I I I I I I I I I I I I I ! I I I I I I I 
a 290 ALAAAKANVAELKALIRQSKI S INTAESELGYTRITATMDGTVVAI LVEEGQTVNAAQST 

150 160 170 180 190 200 

160 170 180 190 200 210 

m290 . pep PTIVQLANLDMMLNKMQIAEGDITKVKAGQDISFTILSEPDTPIKAKLDSVDPGLTTMSS 
| | | | | I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
a290 PTIVQLANLDMMLNKMQIAEGDITKVKAGQDISFTILSEPDTPIKAKLDSVDPGLTTMSS 
210 220 230 240 250 260 



220 230 240 250 260 270 



WO 99/57280 



PCT/US99/09346 



679 

m2 90 pep GGYNS STDTASNAVYYYARS FVPNPDGKLATGMTTQNTVEI DGVKNVLI I PSLTVKNRGG 

I I I I I ! I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I I II I I I I II I I 

a2 90 GGYNSSTDTASNAVYYYARSFVPNPDGKLATGMTTQNTVEIDGVKNVLI IPSLTVKNRGG 

270 280 290 300 310 320 

280 290 300 310 320 330 

m290 pep KAFVRVLGADGKAAEREIRTGMRDSMNTEVKSGLKEGDKVVISEITAAEQQESGERALGG 

: | | | | I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I II I I I I I 
a2 90 RAFVRVLGADGKAAEREIRTGMRDSMNTEVKSGLKEGDKVVISEITAAEQQESGERALGG 
330 340 350 360 370 380 



m2 90.pep PPRRX 
I I I I I 

a290 PPRRX 
390 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 121 7>: 

g292.seq 

1 atgaaaacca agttaatcaa aatcttgacc ccctttaccg tcctgccgct 

51 gctggcttgc gggcaaacgc ccgtttccaa tgccaacgcc gaatccgccg 

101 tcaaagccga atccgccggc aaatccgttg ccgcttcttt gaaagcgcgt 

151 ttggaaaaaa cctattccgc ccaagatttg aaagtgttga gcgtcagcga 

201 aacaccggtc aaaggcattt acgaagtcgt cgtcagcggc aggcagatta 

251 tctacaccga tgccgaaggc ggctatatgt tcgtcggcga actcatcaac 

301 atcgacacgc gcaaaaacct gaccgaagaa cgcgccgccg atttgaacaa 

351 aatcgacttc gcctccctgc ctttggacaa agccatcaaa gaagtacgcg 

401 gcaacggcaa gctgaaagtc gccgtcttct ccgaccccga ttgtccgttc 

451 tgcaaacgct tggaacatga gtttgaaaaa atgaccgacg tgacggttta 

501 cagctttatg atgcccattg ccggcctgca cccagatgcc gcgcgcaagg 

551 cgcaaatctt atggtgtcag cccgaccgtg ccaaagcgtg gacggattgg 

601 atgcgtaaag gcaaattccc ggtcggcggc agcatctgcg acaatcccgt 

651 cgcggaaacc acttccttgg gcgaacagtt cggcttcaac ggcacgccga 

701 cccttcgtct tccccaacgg gcgcacccaa agcggttaca gcccgatgcc 

751 ccaactggag gaaatcatcc gcaaaaacca gcagtaaacc cgcaatga 

This corresponds to the amino acid sequence <SEQ ID 1218; ORF 292.ng>: 

g292.pep 

1 MKTKLIKILT PFTVLPLLA C GQTPVSNANA ESAVKAESAG KSVAASLKAR 

51 LEKTYSAQDL KVLSVSETPV KGIYEVVVSG RQIIYTDAEG GYMFVGELIN 

101 IDTRKNLTEE RAADLNKIDF ASLPLDKAIK EVRGNGKLKV AVFSDPDCPF 

151 CKRLEHEFEK MTDVTVYSFM MPIAGLHPDA ARKAQILWCQ PDRAKAWTDW 

201 MRKGKFPVGG SICDNPVAET TSLGEQFGFN GTPTLRLPQR AHPKRLQPDA 

251 PTGGNHPQKP AVNPQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1219>: 

m292.seq 

1 ATGAAAACCA AGTTAATCAA AATCTTGACC CCCTTTACCG TCCTCCCGCT 

51 GCTGGCTTGC GGGCAAACGC CCGTTTCCAA TGCCAACGCC GAACCCGCCG 

101 TCAAAGCCGA GTCCGCCGGC AAATCCGTTG CCGCCTCTTT GAAAGCGCGT 

151 TTGGAAAAAA CCTATTCCGC CCAAGATTTG AAAGTGTTGA GCGTCAGCGA 

201 AACACCGGTC AAAGGCATTT ACGAAGTCGT CGTCAGCGGC AGGCAGATTA 

251 TCTACACCGA TGCCGAAGGC GGCTATATGT TCGTCGGCGA ACTCATCAAC 

301 ATCGACACGC GCAAAAACCT GACCGAAGAA CGCGCCGCCG ATTTGAACAA 

351 AATCGACTTC GCCTCCCTGC CTTTGGACAA AGCCATCAAA GAAGTGCGCG 

4 01 GCAACGGCAA GCTGAAAGTC GCCGTCTTCT CCGACCCCGA TTGTCCGTTC 

4 51 TGCAAACGCT TGGAACACGA GTTTGAAAAA ATGACCGACG TGACGGTTTA 

501 CAGCTTTATG ATGCCCATTG CCGGCCTGCA CCCCGATGCC GCGCGCAAGG 

551 CGCAAATCTT ATGGTGTCAG CCCGACCGCG CCAAAGCGTG GACGGATTGG 

601 ATGCGTAAAG GCAAATTCCC GGTCGGCGGC AGCATCTGCG ACAATCCCGT 

651 CGCGGAAACC ACTTCCTTGG GCGAACAATT CGGCTTCAAC GGCACGCCGA 

701 CCCTCGTCTT CCCCAACGGG CGCAGCCAAA GCGGCTACAG CCCGATGCCC 
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7 51 CAACTGGAGG AAATCATCCG CAAAAATCAA TAA 

This corresponds to the amino acid sequence <SEQ ID 1220; ORF 292>: 

m2 92.pep 

1 MKTKLIKILT PFTVLPLLA C GQTPVSNANA EPAVKAESAG KSVAASLKAR 

51 LEKTYSAQDL KVLSVSETPV KGIYEVVVSG RQIIYTDAEG GYMFVGELIN 

101 IDTRKNLTEE RAADLNKIDF ASLPLDKAIK EVRGNGKLKV AVFSDPDCPF 

151 CKRLEHEFEK MTDVTVYSFM MPIAGLHPDA ARKAQILWCQ PDRAKAWTDW 

201 MRKGKFPVGG SICDNPVAET TSLGEQFG FN GTPTLVFPNG RSQSGYSPMP 

251 QLEEIIRKNQ * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

m292/g292 98.7% identity in 238 aa overlap 

10 20 30 40 50 60 

MKTKLIKILT PFTVLPLLACGQTPVSNANAEPAVKAESAGKSVAASLKARLEKTYSAQDL 

I | | | | | I I I I I I I I I I I I I I I I I I I I I i I I I I ! I I I I I I I I I I I I I I I I I I I i I I I I I I 

MKTKLIKILTPFTVLPLLACGQTPVSNANAESAVKAESAGKSVAASLKARLEKTYSAQDL 
10 20 30 40 50 60 

70 80 90 100 110 120 

KVLSVSETPVKGIYEWVSGRQIIYTDAEGGYMFVGELINIDTRKNLTEERAADLNKIDF 

I | | | I | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 

KVLSVSETPVKGIYEVVVSGRQIIYTDAEGGYMFVGELINIDTRKNLTEERAADLNKIDF 

70 80 90 100 110 120 

130 140 150 160 170 180 

ASLPLDKAIKEVRGNGKLKVAVFSDPDCPFCKRLEHEFEKMTDVTVYSFMMPIAGLHPDA 

| | | | | | | | | | | I I | I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I 

ASLPLDKAIKEVRGNGKLKVAVFSDPDCPFCKRLEHEFEKMT DVTVYSFMMPIAGLHPDA 
130 140 150 160 170 180 

190 200 210 220 230 240 

ARKAQILWCQPDRAKAWTDWMRKGKFPVGGSICDN PVAETTSLGEQFG FN GTPTLVFPNG 

| | | | | | | | | | | I I I ! I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I : I : 
ARKAQILWCQPDRAKAWTDWMRKGKFPVGGSICDN PVAETTSLGEQFGFNGTPTLRLPQR 
190 200 210 220 230 . 240 

250 260 
RSQSGYSPMPQLEEI IRKNQX 

AH PKRLQPDAPTGGNH PQKPAVN PQX 
250 260 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1221>: 

a292.seq 

1 ATGAAAACCA AGTTAATCAA AATCTTGACC CCCTTTACCG TCCTCCCGCT 

51 GCTGGCTTGC GGGCAAACGC CCGTTTCCAA TGCCAACGCC GAACCCGCCG 

101 TCAAAGCCGA GTCCGCCGGC AAATCCGTTG CCGCCTCTTT GAAAGCGCGT 

151 TTGGAAAAAA CCTATTCCGC CCAAGATTTG AAAGTGTTGA GCGTCAGCGA 

201 AACACCGGTC AAAGGCATTT ACGAAGTCGT CGTCAGCGGC AGGCAGATTA 

251 TCTACACCGA TGCCGAAGGC GGCTATATGT TCGTCGGCGA ACTCATCAAC 

301 ATCGACACGC GCAAAAACCT GACCGAAGAA CGCGCCGCCG ATTTGAACAA 

351 AATCGACTTC GCCTCCCTGC CTTTGGACAA AGCCATCAAA GAAGTGCGCG 

4 01 GCAACGGCAA GCTGAAAGTC GCCGTCTTCT CCGACCCCGA TTGTCCGTTC 

4 51 TGCAAACGCT TGGAACACGA GTTTGAAAAA ATGACCGACG TGACGGTTTA 

501 CAGCTTTATG ATGCCCATTG CCGGCCTGCA CCCCGATGCC GCGCGCAAGG 

551 CGCAAATCTT ATGGTGTCAG CCCGACCGCG CCAAAGCGTG GACGGATTGG 

601 ATGCGTAAAG GCAAATTCCC GGTCGGCGGC AGCATCTGCG ACAATCCCGT 

651 CGCGGAAACC ACTTCCTTGG GCGAACAATT CGGCTTCAAC GGCACGCCGA 

701 CCCTCGTCTT CCCCAACGGG CGCAGCCAAA GCGGCTACAG CCCGATGCCC 

7 51 CAACTGGAGG AAATCATCCG CAAAAATCAA TAA 



m292.pep 
g292 

m2 92 .pep 
g292 

m292 .pep 
g292 

m292.pep 
g292 

m292.pep 
g292 
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This corresponds to the amino acid sequence <SEQ ID 1222; ORF 292. a>: 

a292 .pep 

1 MKTKLIKILT PFTVLPLLA C GQTPVSNANA EPAVKAESAG KSVAASLKAR 

51 LEKTYSAQDL KVLSVSETPV KGIYEVVVSG RQIIYTDAEG GYMFVGELIN 

101 IDTRKNLTEE RAADLNKIDF ASLPLDKAIK EVRGNGKLKV AVFSDPDCPF 

151 CKRLEHEFEK MTDVTVYSFM MPIAGLHPDA ARKAQILWCQ PDRAKAWTDW 

201 MRKGKFPVGG SICDNPVAET TSLGEQFGFN GTPTLVFPNG RSQSGYSPMP 

251 QLEEIIRKNQ * 

m292/a292 100.0% identity in 260 aa overlap 

10 20 30 40 50 60 

MKTKLIKILT PFTVLPLLACGQTPVSNANAEPAVKAESAGKSVAASLKARLEKTYSAQDL 

| | | | | | | | I I I I I I 1 I I I I I I 1 I i I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
MKTKLIKILTPFTVLPLLACGQTPVSNANAEPAVKAESAGKSVAASLKARLEKTYSAQDL 
10 20 30 40 50 60 

70 80 90 100 110 120 

KVLSVSETPVKGIYEVVVSGRQIIYTDAEGGYMFVGELINIDTRKNLTEERAADLNKIDF 

I | | | | | I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I 

KVLSVSETPVKGIYEVVVSGRQIIYTDAEGGYMFVGELINIDTRKNLTEERAADLNKIDF 
70 80 90 100 110 120 

130 140 150 160 170 180 

ASLPLDKAIKEVRGNGKLKVAVFSDPDCPFCKRLEHEFEKMTDVTVYSFMMPIAGLHPDA 

( I | I M I ft I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I 
ASLPLDKAIKEVRGNGKLKVAVFSDPDCPFCKRLEHE FEKMTDVTVYSFMMPIAGLHPDA 
130 140 150 160 170 180 

190 200 210 220 230 240 

ARKAQILWCQPDRAKAWTDWMRKGKFPVGGSICDNPVAETTSLGEQFGFNGTPTLVFPNG 

| | | | | | | | | | | | I I I I I I I 1 I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
ARKAQILWCQPDRAKAWTDWMRKGKFPVGGSICDNPVAETTSLGEQFGFNGTPTLVFPNG 
190 200 210 220 230 240 

250 260 
RSQSGYSPMPQLEEIIRKNQX 

I I I I I I I I I I I I I I I I I I I I I 
RSQSGYSPMPQLEEIIRKNQX 
250 260 



m292.pep 
a292 

m292 .pep 
a292 

m2 92.pep 
a292 

m2 92 .pep 
a292 

m2 92.pep 
a292 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1223>: 

g294 . seq (partial) 

1 atgcgtatta cctgtgcgcc gatgtcgctt ttgtcggcgg cagtctggtc 

51 ggttcgggct gtcagaacat catcgaaccg ctttcctgcg gcgttacgac 

101 gatattcggc ttttcgacct acaatttttc cgaagcctgc cggcacgcct 

151 tggcatcggg tgcggcggtt caagtcgaat cggcggacgc gtggcgtgaa 

201 gccgttgaaa aaaccttatc tggcgagggg ggcggaatgc agatgcaggc 

251 gcgcgtggac ggctttatcg cacaacatcg cggagcgggc gcgagaatcg 

301 ccgaggcggt gcgggaagcg gtatgcggac atcgggggcg atagtgatac 

351 aatccgtatc cgagttttcc ggttggagca tcgtatgagt atttatgccg 

401 tcgcgcacat catccacctg tattgcgcca ccgcctttgt cggcggcgtg 

451 ttttttgaag tgctggtttt gtccgtcctg catacgggac gggtgtcgcg 

501 cgaggcgcgg cgcgaagtgg aaaaggcaat gtcttaccgc gccgtcaggg 

551 tgatgccgtt tgcggtcgga ctgctgttcg ccaggggaac tctagagtcg 

601 actgcagcag catgccctc. . . 

This corresponds to the amino acid sequence <SEQ ID 1224; ORF 294.ng>: 

g294.pep (partial) 

1 MRITCAPMSL LSAAV WSVRA VRTSSNRFPA ALRRYSAFRP TIFPKPAGTP 

51 WHRVRRFKSN RRTRGVKPLK KPYLARGAEC RCRRAWTALS HNIAERARES 

101 PRRCGKRYAD IGGDSDTIRI RVFRLEHRMS IYAVAHIIHL Y CATAFVGGV 

151 FFEVLVLS VL HTGRVSREAR REVEKAMSYR AVRVMPFAVG LLFARGT LES 

201 TAAACP. . - - 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1225>: 
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m294 . seq 








1 


ATGCGTATTA 


CCi G 1 GGGCG 


GAI bl LGG 1 1 


51 


GATTCGGGTT 


GTCAGAACAT 


GAI LGAACCG 


101 


GATATTCGGC 


TTTTCAACCT 


AC AAl 1 I llC 


151 


TGGCATCGGG 


TGCGGCGG 1 1 


C AAG 1 LGAA 1 


201 


GCCGTTGAAA 


AAACG 1 IA1L 




251 


GCGCGTGGAC 


GGC1 I 1 AIULj 


LHL/iHLn J. L,vj 


301 


/-» /~> /*• t\ r~ f~ r* f r- 


la Ij o rt/i o o 




351 


AAl LtbiAlt 


^Lj/4Vj1 111 


bill bOnuLn 


401 




tul l^rtlV«.llJ 


TATTGfGCTA 




TTTTTTfiAAG 


TGCTGGTTTT 


GTCCGTCCTG 


501 


CGAGGCGCGG 


CGCGAAGTGG 


AAAAGGCAAT 


551 


TGATGCCGTT 


TGTGGTCGGA 


CTGCTGTTCG 


601 


GCAAACCGCT 


ATCTTTCTAT 


ATTGGGCGAA 


651 


TACGATGCTG 


ACGCTGAAAA 


TCCTGTTGGC 


701 


TCGCCATCGC 


CGTCGTCAAA 


ATGGCGCGTT 


751 


TCGAAATACA 


TACACGCCGT 


CGTCTTTACC 


801 


TTTGGCAAAA 


GCGATGTTTT 


ATATCAGCTG 



TTGTCGGCGG CAGTCTGGTC 
CTTTCCTGCG GCGTTCCGAC 
CGAAGCCTGC CGACACGCCT 
CGGCGGATGC GTGGCGGGAA 
GGGGGGATGC AGATGCAGGC 
CGGAGCGGGC GCGAGAATCG 
ATCGGGGGCG ATAGTGATAC 
TCGTATGAGT ATTTATGCCG 
TTGCCTTTGT CGGCGGCGTG 
CATACGGGAC GGGTGTCGCG 
GTCTTACCGC GCCGTCAGGG 
CCAGCGGCAT CGTGATGGCG 
CCGTTTGCCA CTTCCTTCGG 
GTTCAGCGTA TTGGCGCACT 



This corresponds to the amino acid sequence <SEQ ID 1226; ORF 294>: 

m294 .pep 

1 MRITCAPMSL LSAAV WSIRV VRTSSNRFPA AFRRYSAFQP TIFPKPADTP 

51 WHRVRRFKSN RRMRGGKPLK KPYRPRGGGC RCRRAWTALS HNIAERARES 

101 PRRCGKRYAD IGGDSDTIRI RVFRLEHRMS IYAVAHIVHL Y CAIAFVGGV 

151 FFEVLVLS VL HTGRVSREAR REVEKAMSYR AVRVMPFVVG LLFASGI VMA 

201 ANRYLSILGE PFATSFGTM L TLKILLAFSV LAHFAI AVVK MARSTLTVGW 

251 SKY IHAVVFT HMLLIVFLAK AMFYISW* 

g294/m294 92.3% identity in 196 aa overlap 

10 20 30 40 50 60 

g294 .pep MRITCAPMSLLSAAVWSVRAVRTSSNRFPAALRRYSAFRPTIFPKPAGTPWHRVRRFKSN 
| | | | | | | | | | | | I I I I I : I : I I I I I I I I I I I : II I I I I : I I I I I I I I I I I I I I I I 1 i I I 
m2 9 4 MRI TC APMS LLS AAVWS I RVVRTS SNRFPAAFRRYSAFQPT I FPKP ADTPWHRVRRFKSN 

10 20 30 40 50 60 

70 80 90 100 110 120 

g294 pep RRTRGVKPLKKPYLARGAECRCRRAWTALSHNIAERARESPRRCGKRYADIGGDSDTIRI 
|j if | I 1 I I I I II: I I I I I I I I I I! I I I I I I I I It II I I I I I I I I I 1 I I I I I I I I 
m 2 94 RRMRGGKPLKKPYRPRGGGCRCRRAWTALSHNIAERARESPRRCGKRYADIGGDSDTIRI 

70 80 90 100 110 120 

130 140 150 160 170 180 

g294 pep RVFRLEHRMSIYAVAHIIHLYCATAFVGGVFFEVLVLSVLHTGRVSREARREVEKAMSYR 

I I I I I I I I I I I I I I I I I : 1 I I I I IE I I I I I I I I I I I I I I i i I I I 1 I II I ( I I I I 

m2 94 RVFRLEHRMS I YAVAHIVHLYCAIAFVGGVFFEVLVLSVLHTGRVSREARREVEKAMSYR 

130 140 150 160 170 180 

190 200 
g294 .pep AVRVMPFAVGLLFARGTLESTAAACP 
I I I i I I I : I 1 I I I I I 

m2 94 AVRVMPFVVGLLFASGIVMAANRYLSILGEPFATSFGTMLTLKILLAFSVLAHFAIAVVK 
190 200 210 220 230 240 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1227>: 

a294 . seq 

1 ATGCGTATTA CCTGTGCGCC GATGTCGCTT TTGTCGGCGG CAGTCTGGTC 

51 GATTCGGGCT GTCAGAACAT CATCGAACCG CTTTCCTGCG GCGTTCCGAC 

101 GATATTCGGC TTTTCGACCT ACAATTTTTC CGAAGCCTGC CGGCACGCCT 

151 TGGCATCGGG TGCGGCGGTT CAAGTCGAAT CGGCGGACGC GTGGCGGGAA 

201 GCCGTTGAAA AAAACTTATC GTCCGAGGAG GGCGGAATGC AGATGCAGGC 

251 GCGCGCGGAC GGCTTTATCG CACAACATCG CGGAGCGGGC GCGAGAATCG 

301 CCGAGGCGGT ACGGGAAGCG GTATGCGGAC ATCGGGGACG ATAGTGATAC 

351 AATCCGTATC CGAGTTTTCC GGTTGGAGTA CCGTATGAGT ATTTATGCCG 

4 01 TCGCGCACAT CGTCCACCTG TATTGCGCCA TCGCCTTTGT CGGCGGCGTG 

4 51 TTTTTTGAAG TGCTGGTTTT GTCCGTCCTG CATACGGGAC GGGTGTCGTG 

501 CGAGGCGCGG CGCGAAGTGG AAAAGGCAAT GTCTTACCGC GCCGTCAGGG 

551 TGATGCCGTT TGTGGTCGGA CTGCTGTTCG CCAGCGGCAT CGTGATGGCG 

601 GCAAACCGCT ATCTTTCTAT ATTGGGCGAA CCGTTTGCCA CTTCCTTCGG 

651 TACGATGCTG ACGCTGAAAA TCCTGTTGGC GTTCAGCGTG TTGGCGCACT 
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